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UCR DIMINUTION IN TEMPORAL CONDITIONING 


AND HABITUATION * 


VIRGINIA E. PENDERGRASS Ax» H. D. KIMMEL 
Ohio University 


A study was done to compare diminution of the GSR to a 1,000-cps 
tone during regular and irregular temporal spacing of the tones, and 
to determine the influence of a judgment task on the GSR under 
these conditions. One group received a 90-db. tone every 40 sec. for 
40 trials, while one group received the series of 40 tones at irregular 
temporal intervals of 20, 30, 40, 50, or 60 sec, In addition, half of 
the Ss in each group were required to judge the intensity of the UCS 
after each stimulus presentation. No evidence was found for temporal 
conditioning in the nonjudgment groups, although there did appear 
to be anticipatory conditioned responding in the judgment groups. 
It was suggested that the task facilitated the development of antic- 
ipatory responding in temporal conditioning, and as a consequence 
also resulted in smaller GSRs to the tone. In habituation, however, 
the task appeared to disrupt the ongoing process, and eliminated 
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response attentuation. 


Increased arousal or attention induced by 


the task and/or by S’s perception of regular spacing was proposed 
as a possible explanation of these effects. 


The differences between classical 
conditioning and habituation are widely 
recognized but their similarities are 
rarely considered. Both procedures 
involve repeated presentations of stim- 
uli and both result in attenuation of 
the UCR. Diminution of the uncondi- 
tioned GSR across classical condition- 
ing trials, as compared with responding 
in control groups which received un- 
paired presentations of the CS and 
UCS, has been demonstrated by Bax- 

1 Based on the senior author's MA thesis 
done under the supervision of the junior au- 
thor and supported in part by United States 
Public Health Service Grant MH-12262-02. 
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ter (1966) and Kimmel (1967), and 
attenuation of the GSR across habitua- 
tion trials to light and auditory stimuli 
has been observed by Kimmel and 
Goldstein (1966). In addition to re- 
sponse attenuation, habituation shares 
with conditioning other characteristics 
of learning such as "savings" in rehabit- 
uation and disinhibition produced by 
changes in stimulus intensity. Re- 
cently, conditioning interpretations of 
habituation have been proposed (Kim- 
mel & Goldstein, 1966; Stein, 1966). 

The purpose of this experiment was 
to compare directly diminution of the 
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GSR under conditions of regularly and 
irregularly spaced trials. The regular 
spacing, of course, is tantamount to a 
temporal conditioning procedure. Tem- 
poral conditioning of the GSR, using a 
task in conjunction with background 
stimulation to disguise the regularity 
of UCS presentations, has been re- 
ported by Lockhart (1966). 

Some evidence has been found in 
classical GSR conditioning for acceler- 
ated diminution of the UCR when 
Ss were required to judge the intensity 
of the UCS, as compared with diminu- 
tion when no such task was required 
(Kimmel, 1967). Whether this effect 
of the judgment task is uniquely re- 
lated to the conditioning process, or 
whether the effect would be observed in 
any situation involving repeated stimu- 
lation (ie., habituation), is not known. 
Consequently, in this experiment half 
of the Ss in each group, fixed or vari- 
able stimulation, were required to 
judge the intensity of the stimulus. It 
was expected that Ss who received 
regular spacing of stimuli, whether 
judging intensity or not, would show 
more response attenuation than Ss who 
received the irregular spacing. In 
addition, it was expected that more re- 
sponse diminution would be shown by 
Ss who judged stimulus intensity, re- 
gardless of the spacing of trials. No 
interaction between these variables 
could be reasonably predicted. 


METHOD 


Subjects.—Seventy-one undergraduate stud- 
ents at Ohio University, randomly assigned 
to groups, served as Ss. Eleven of these 
were discarded because of E errors or equip- 
ment failure. 

Apparatus—The tone used throughout 
for all groups was a 90 db. (physical refer- 
ence), 1,000-cps tone of 1.0-sec. duration, 
produced by General Radio Co. equipment 
and delivered via Trimm earphones with 
MX-41/AR cushions. In the variable in- 


terval condition a randomly ordered set of 
intertrial intervals of 20, 30, 40, 50, or 60 
sec. duration, averaging 40 sec. overall, was 
presented. In the fixed interval condition a 
tone was delivered every 40 sec. 

Zinc-zinc sulphate electrodes (1 in.) in 
lucite cups filled with Fels GSR paste were 
used to pick up the exodermal GSR as a dc 
resistance change from the palm and back 
of the S’s left hand. Responses were ampli- 
fied and recorded using equipment previously 
described (Morrow, 1966). All responses 
were transformed to units of change in log 
conductance. 

Judgments were made by S using a plas- 
tic lever with his right hand. The lever 
could be rotated through 180*, and the judg- 
ments were automatically recorded. The de- 
vice and recording apparatus have been pre- 
viously described (Kimmel, 1967). Judg- 
ment responses were converted to logarithm 
of amount of lever movement. 

Design.—Fifteen Ss were assigned to each 
of the four groups, which were treated as 
follows: (a) the nonjudgment, variable inter- 
val (NJ-V) group received a series of 40 
tones presented at irregular intervals; (b) 
the judgment, variable interval (J-V) group 
received the same series of tones on an 
identical presentation schedule as the NJ-V 
group, but was also required to judge the 
intensity of each tone immediately after its 
presentation; (c) the nonjudgment, fixed 
interval (NJ-F) group received a series of 
40 tones presented at regular temporal inter- 
vals; (d) the judgment, fixed interval (J-F) 
group received the series of 40 tones at 
regular temporal intervals, and was required 
to judge the intensity of each tone. 

Procedure.—Each S was seated in a dark, 
soundproof, air-conditioned IAC series 1200 
audiometric chamber. The E and the equip- 
ment were in an adjoining room. After the 
GSR electrodes and earphones were properly 
placed, and the doors closed, instructions 
were read through an intercom. All Ss 
were instructed to relax and pay attention, 
and Ss in the judgment groups were addi- 
tionally instructed in the use of the judg- 
ment device. In the judgment groups, one 
preliminary tone was given to ensure that 
S understood the use of the judgment device. 
The acquisition series of 40 tones was then 
presented. Following acquisition, there was 
an extinction period in which no tones were 
presented but GSRs were recorded during 
10 consecutive 40-sec. intervals (ie, 400 
sec.). 
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RESULTS 


Temporal | conditioning.—Since it 
was assumed that temporal condition- 
ing might contribute to diminution of 
the GSR in the fixed interval groups, 
it was first necessary to determine 
whether temporal conditioning occur- 
red. During acquisition, a CR was 
defined as the first measurable GSR 
in the 35-40 sec. period of each 
40-sec. interval (referred to as acquisi- 
tion AR). In the fixed interval groups 
this constituted the 5 sec. preceding the 
delivery of the tone. Since stimulus 
presentations were irregular in the 
variable interval groups, this 5-sec. 
period fell irregularly in the intervals 
between stimuli. If this 5-sec. in the 
variable interval groups were preceded 
within 10 sec. by a tone, the responses 
were omitted on the grounds that they 
might either be responses to the tone or 
be attenuated by having closely fol- 
lowed a response to the tone. 

Mean acquisition ARs in blocks of 
two trials are presented for the two 
judgment and the two nonjudgment 
groups separately in Fig. la and 1b.* 
These figures show that the mean level 
of anticipatory responding in the J-F 
group was generally higher than that of 
any other group. In addition, there ap- 
peared to be virtually no difference in 
mean levels of responding among the 
other groups. Analysis of variance 
showed that the interaction between 
the task and interval variables ap- 
proached significance, F (1, 56) = 
3.57, error MS = .03,  Duncan's multi- 
ple-range test indicated that the J-F 
group mean differed from that of the 
J-V group, but that none of the other 


2 One acquisition AR measure (the first 
response of S# 12 in the J-F group) was 
excluded from consideration since it ex- 
ceeded the next highest measure in the en- 
tire distribution of 2,122 measures by more 
than four standard deviations. 
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Fic. la. Mean acquisition ARs in blocks of 
two trials for the judgment groups. 
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Fic. 1b. Mean acquisition ARs in blocks of 
two trials for the nonjudgment groups. 


differences between pairs of means 
were significant. 

During extinction, an anticipatory 
response (extinction AR) was defined 
as the first response occurring between 
35-40 sec. of each 40-sec. interval after 
the presentation of the last acquisition 
stimulus. The mean extinction AR in 
the J-F group was higher than in the 
other groups, and analysis of variance 
showed that the interaction between 
task and interval variables was signif- 
icant, F (1, 56) = 7.07, error MS = 
0116. Duncan’s multiple-range test in- 
dicated that the J-F and NJ-V groups 
did not differ significantly from one 
another, but that both differed signif- 
icantly from the other two groups. 

Then in the acquisition and extinc- 
tion AR data of the judgment groups 
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there was some evidence that temporal 
conditioning occurred, the J-F mean 
ARs exceeding those of the J-V group. 
The mean acquisition AR of the NJ-F 
group, however, did not differ signif- 
icantly from that of the NJ-V group. 
Moreover, during extinction the NJ-V 
group mean level of responding was 
significantly higher than in the NJ-F 
group, a difference opposite in direc- 
tion from what would be expected had 
temporal conditioning occurred. 

Diminution of the GSR.—The GSR 
to the tone was measured during the 
time beginning 1.0 sec. after the onset 
of the stimulus and ending 4 sec. later. 
Mean GSRs, transformed to change in 
log conductance, of the judgment and 
nonjudgment groups during acquisition 
are presented in Fig. 2a and 2b, in 
two-trial blocks. These responses 
were corrected for differing levels of 
initial responsiveness to the tone by 
subtracting the magnitude of each S’s 
GSR to the tone on the first trial from 
each subsequent response. 

Figures 2a and 2b show that the 
relative levels of responding according 
to task were reversed for variable and 
fixed intervals; ie., in the judgment 
condition, the variable interval group's 
responses were higher than those in 
the fixed interval while in the nonjudg- 
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Fic. 2a. Mean adjusted UCR magnitudes 
during acquisition in blocks of two trials 
for the judgment groups. 
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Fic. 2b. Mean adjusted UCR magnitudes 
during acquisition in blocks of two trials for 
the nonjudgment groups. 


ment condition, the fixed interval 
group’s responses were higher. The 
NJ-V group showed a decline in re- 
sponding characteristic of habituation, 
while this did not occur in either the 
NJ-F or J-V group responses. Anal- 
ysis of variance of the UCR data re- 
vealed a significant interaction for the 
task and interval variables, F (1, 
56) = 5.99, error MS = 11.22, and a 
significant three-way interaction, F 
(19, 1,064) = 2.27, error MS =2.27. 
The three-way interaction may be in- 
terpreted as indicating that the manner 
in which the fixed and variable inter- 
val groups changed across trials in re- 
lation to one another differed for the 
two task conditions and was, in fact, 
reversed in them. 

Judgments—There was no signif- 
icant effect of the interval variable on 
judgment responses and the average 
within-S correlation across trials be- 
tween the magnitude of judgment and 
the GSR was found to be zero overall. 


Discussion 


The primary purpose of this experiment 
was:to compare diminution of the GSR 
during temporally regular presentations 
of a loud tone with that occurring during 
temporally irregular stimulus presenta- 
tions, and to examine the effect that a 
judgment task might have on response 
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attenuation under these conditions. The 
main result was that a significant inter- 
action between the temporal spacing and 
task variables occurred. In the nonjudg- 
ment groups, considerably greater diminu- 
tion of responding occurred during tem- 
porally irregular presentations of the 
stimuli than during temporally regular 
presentations. But, in the judgment 
groups this relationship was reversed and, 
in addition, neither of the judgment 
groups showed as much reduction in re- 
sponding as was shown by the NJ-V 
group. 

One of the possible factors underlying 
this reversal from the judgment to the 
nonjudgment conditions was that a dif- 
ference in anticipatory responding (ac- 
quisition ARs) was found between the 
two judgment groups but not the two 
nonjudgment groups. To the extent that 
the occurrence of an AR might have di- 
rectly inhibited the orienting reaction 
(OR) to the tone, this would account for 
the fact that the two judgment groups’ 
ORs to the tone differed in the observed 
direction, That is, the larger the AR, the 
smaller the response to the tone (com- 
pare Fig. 2a and 2b). But, this explana- 
tion does not appear to apply to the very 
large difference in amount of response 
diminution shown between the two non- 
judgment groups, since no difference 
between their acquisition ARs was ob- 
served (Fig. 1b). Nor, in fact, does the 
direct inhibition explanation tell us much 
about why hardly any diminution to speak 
of was observed in either the J-V or 
NJ-F groups. 

The lack of response attenuation in 
the J-V group (as compared with NJ-V) 
may have simply been due to disruption 
of habituation of the OR by the judgment 
task. This agrees with results obtained 
by Garcia-Ausst, Bagacz, and Vanzulli 
(1964), who found that a task during 
habituation which called attention to the 
stimulus increased the magnitude of vis- 
ual evoked cortical responses. It may be 
that the judgment task had this effect 
because it was performed at the very 
moment that response-produced stimula- 
tion was occurring, since, as Kimmel 


and Goldstein (1966) have noted, reten- 
tion of habituation is not found until 
proprioception is present. 

The lack of diminution in responding 
in the NJ-F group cannot be explained 
as easily. No evidence was found in the 
acquisition ARs to support the conclusion 
that temporal conditioning occurred in 
this group; thus, no direct inhibition of 
responding could be produced by prior 
ARs. But this was true as well for the 
NJ-V group, which showed considerable 
response attenuation. Yet, the only dif- 
ference between these two groups’ pro- 
cedures was the regular vs. irregular 
spacing of the stimuli, It is possible that 
the regular occurrence of the stimuli 
created a situaton for S in which normal 
OR habituation mechanisms were over- 
come by attentional processes activated 
by S’s perception of the regularity. Since 
this would have influenced both groups 
receiving regular spacing, only the J-F 
group, which showed larger acquisition 
ARs, should have shown response at- 
tenuation directly due to the ARs (but 
not due to habituation). Thus, the re- 
sponses to the tone in the J-F group were 
intermediate between those of the NJ-F 
and NJ-V groups. 

The expectation that conditioned re- 
sponding would occur in both fixed in- 
terval groups was based primarily upon 
the results of one previous experiment, 
that of Lockhart (1966). In Lockhart's 
experiment, as a control measure to elim- 
inate active time estimation, Ss heard 
background music and were given the 
task of identifying the selections. There 
was no group in Lockhart's study which 
received regularly spaced stimulus pres- 
entations without the background stimula- 
tion and task. The groups in the present 
experiment whose procedures correspond 
most closely to that used by Lockhart are 
the judgment groups, and the judgment 
groups are also the groups which pro- 
duced evidence for some conditioned re- 
sponding. It seems, therefore, that the 
task may have facilitated the development 
of conditioned responding and that only 
under special conditions does the fixed 
time interval function as a CS for tem- 
poral conditioning. 


6 VIRGINIA E. PENDERGRASS AND H. D. KIMMEL 


The role of the task in temporal condi- 
tioning, however, is complicated by the 
observation that the task used by Lock- 
hart effectively disguised stimulus regu- 
larity, whereas the task used in this ex- 
periment should have emphasized stimulus 
regularity. There appears to be no spe- 
cific commonality between the character- 
istics of the tasks themselves. It is sug- 
gested, therefore, that the task may have 
increased the drive or arousal of S, which 
is known to facilitate classical eyelid con- 
ditioning (Taylor, 1951). Some support 
for this interpretation may be found in a 
GSR conditioning study done by Kimmel 
(1967). Using a shock UCS, Kimmel 
gave two groups of Ss, either with or 
without a judgment task, three preliminary 
sensitization trials. She found that after 
the sensitization trials, during a series of 
classical conditioning trials, the judgment 
group responses were uniformly and sig- 
nificantly higher than the nonjudgment 
group responses. Otherwise the course 
of conditioning was the same, No ex- 
planation was offered for this difference 
between judgment and  nonjudgment 
groups except a tentative suggestion that 
it might have been due to sampling errors. 
It is possible that the instructions regard- 


ing the judgment task may have been re- 
sponsible for the effect by elevating drive. 


REFERENCES 


Baxter, R. Diminution and recovery of the 
UCR in delayed and trace classical GSR 
conditioning. J. exp. Psychol., 1966, 71, 
447-451. 

Garcra-Ausst, E., Bacacz, J., & VANZULLI, 
A. Effects of attention and inattention 
upon visual evoked responses. EEG clin. 
Neurophysiol., 1964, 17, 136-143. 

Krumet, E. Judgments of UCS intensity 
and diminution of the UCR in classical 
GSR conditioning. J. exp. Psychol., 1967, 
73, 532-543. 

KmmeL, H. D, & GorposrEN, A. J. Re- 
tention of habituation of GSR to visual 
and auditory stimulation. J. exp. Psychol., 
1967, 73, 401-404. 

Lockmamr, R. A. Temporal conditioning of 
GSR. J. exp. Psychol., 1966, 71, 438-446. 

Morrow, M. C. Recovery of conditioned 
UCR diminution following extinction. J. 
exp. Psychol., 1966, 71, 884-888. 

SrErw, L. Habituation and stimulus novelty : 
A model based on classical conditioning. 
Psychol. Rev., 1966, 73, 352-356. 

TAYLOR, J. A. The relationship of anxiety to 
the conditioned eyelid response. J. exp. 
Psychol., 1951, 41, 81-92. 


(Received April 21, 1967) 


: 


a 


Journal of Experimental Psycholo, 
1908 Vol 77, No- d, T3 


EFFECTS OF NATURAL LANGUAGE MEDIATORS ON 


SHORT-TERM MEMORY * 


HAROLD O. KIESS ? 


University of Illinois 


2 experiments were run to assess the relationship of natural lan; 
mediators (NLMs) or idiosyncratic cies to recall in enel 
term memory (STM). 250 Ss each received 25 trials on a task 
similar to that employed by Peterson and Peterson (1959). Associa- 
tion value (low, medium, and high) of the stimulus CVCs or item 
presentation time (2, 3, and 4 sec.) were varied. NLMs were obtained 
by verbal report during presentation of the CVC and upon recall of 
the CVC at the end of the retention interval (0, 5, 10, 15, or 30 sec.). 
Items with NLMs were better retained than rote learned items at 
all retention intervals except 0 sec. Association value had a strong 
effect both on number of NLMs given and on recall of CVCs. Pre- 
sentation time was positively but weakly related to the number of 
NLMs given during the study period, and had no effect on the pro- 
portion of CVCs correctly recalled. 


Natural language mediators (NLMs), 
defined as the idiosyncratic associations 
or mediators that an S may impose 
upon verbal items when attempting to 
learn those items, and which are as- 
sessed by verbal report, have been 
shown to be strongly related to the 
level of recall in both long-term mem- 
ory (LTM) and short-term memory 
(STM) (Groninger, 1966; Montague, 
Adams, & Kiess, 1966). Montague et 
al. found recall in LTM of paired 
associates to be strongly related to 
presence of an NLM in acquisition 
and recall of that NLM at the time of 
the retention test. Groninger (1966) 
questioned his Ss about NLMs after 
recall on each trial of a STM task 
similar to that used by Peterson and 


1 These experiments are based on a dis- 
sertation submitted to the Graduate College 
of the University of Illinois in partial ful- 
fillment of the requirements for the PhD 
degree in psychology. The thesis was under 
the supervision of Jack A, Adams. The 
research was supported by Grant MH-12022 
from the United States Public Health 
Service. 

2Now at PRL, United States Army 
Natick Laboratories, Natick, Massachusetts. 


Peterson (1959). Natural language 
mediators were reported for a large 
number of items and those items were 
found to be better retained and more 
resistant to the effects of proactive 
inhibition (PI) than items without 
NLMs. To complement this finding, 
Adams and Montague (1967) have 
found that paired associates with 
NLMs are more resistant to retro- 
active inhibition than items without 
NLMs. These investigators have 
hypothesized that NLMs represent the 
transfer of well learned language habits 
to new verbal materials and may thus 
be effective in aiding retention of the 
new material. Although this hypoth- 
esis is provocative, the presently avail- 
able data do not permit it to be firmly 
accepted. 

The purpose of the present experi- 
ments was to investigate more fully the 
relationship of NLMs to item recall in 
STM by varying the association value 
(a) of the items to be learned and 
item presentation time (f). Associa- 
tion value (Exp. I) was chosen as a 
variable since it would be expected that 
as a increases S could more easily 
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apply his verbal habits to the item and 
report an NLM for the item. Mon- 
tague et al. (1966) have found this to 
be the case for increasing meaningful- 
ness in LTM. It has also been found 
that recall in STM is a positive func- 
tion of increasing meaningfulness or 
association value of the material to be 
learned (e.g. Groninger, 1966; Mc- 
Nulty, 1965; Peterson, Peterson, & 
Miller, 1961) and it is hypothesized 
that the increase in recall as meaning- 
fulness or association value increases 
may be related to a corresponding in- 
crease in number of NLMs. 

Presentation time (Exp. II) was 
chosen on the assumption that longer £ 
would permit S to form more NLMs 
(e.g., Bugelski, 1962; Montague et al., 
1966) and this increase in number of 
NLMs would also be accompanied by 
a corresponding increase in recall of 
the criterion items. Since Brown 
(1958), Peterson and Peterson (1959), 
and Sanders (1961) have found recall 
in STM to increase with longer t it is 
plausible to hypothesize that this in- 
crease may be related to a correspond- 
ing increase in number of NLMs as t 
lengthens. 


METHOD 
Experiment I 


Procedure.—Two experiments, both em- 
ploying the same procedure, were done to 
evaluate the effects of a and ¢ on NLMs 
and short-term retention of CVCs. The task 
employed was similar to that used by Peter- 
son and Peterson (1959). The S was pre- 
sented with a CVC via a slide projector for 
a brief period with instructions to learn it 
and recall was requested after a brief re- 
tention interval. If, during the learning time 
S had an NLM for the CVC, he was to 
report it, although he was not urged or 
required to have an NLM on each trial. 
If an NLM was reported, E recorded the 
latency of the beginning of the NLM by 
stopping an electric timer which was auto- 
matically started upon presentation of the 
CVC. Latency was recorded because it was 
expected that NLMs of lower associative 


strength would be forgotten more often, and 
that latency of response might reveal those 
lower strength NLMs. Since latency is an 
accepted measure of associative strength of 
responses (e.g, Adams, 1967; Eimas & 
Zeaman, 1963) it would be expected that 
latency of the NLM would also be related 
to its associative strength. 

The retention interval began immediately 
upon presentation of the next slide. If the 
retention interval was longer than 0 sec. 
this slide contained a series of randomly 
selected digits which S was required to read 
aloud at a fast pace for the duration of the 
retention interval. The purpose of reading 
the numbers aloud was to minimize possible 
covert rehearsal of the CVC by S. The 
digit reading was followed by a signal for 
recall of the CVC (a slide with an asterisk 
on it) which was followed by a signal for 
recall of the NLM (slide saying “Recall 
Aid”) which may have been reported during 
the acquisition period. Five seconds were 
allowed for recall of the CVC and 10 sec. 
for recall of the NLM. If the retention 
interval was 0 sec., reading the digits aloud 
was eliminated and the CVC slide was fol- 
lowed immediately by the signals for recall 
of the CVC and NLM. Recall of the NLM 
was required because Montague et al. (1966) 
found recall of the CVC strongly related to 
recall of the NLM and it was deemed im- 
portant to see if the same occurred in STM. 
Immediately upon conclusion of one trial 
another began. Each S received 3 practice 
trials and 25 criterion trials of the form 
described above. For the practice trials 
geometric forms were used as stimuli since 
they are relatively neutral materials which 
should provide no proactive inhibition yet 
permit S to make a verbal response to them 
as he had to in the experiment proper. All 
stimulus materials were presented via a 
slide projector timed by a paper tape 
reader. 

Experimental design.—Experiment I varied 
a of items over three levels: low, medium, 
and high. Each level of a was presented to 
an independent group of 50 Ss where each S 
was given 25 criterion trials on the task 
described above. Five trials were given at 
each of the following retention intervals: 0, 
5, 10, 15, and 30 sec. Retention intervals 
were counterbalanced over trials so each re- 
tention interval appeared an equal number of 
times at each trial. A different CVC was 
given on each trial and items were counter- 
balanced within blocks of five trials, Five 
different orders of retention-interval se- 
quences were used in each group and each 
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order was given to 10 Ss of the group. 
The same orders were used for each 
group. 

Materials.—The stimulus items were low, 
medium, or high a CVCs selected from 
Archer’s (1960) scaling. Average a for 
the items in the low, medium, and high 
groups was 12,5, 48.8, and 89.8%, respec- 
tively. 

Item presentation time.—The item pre- 
sentation time was 3 sec. (2-sec. slide pre- 
sentation and l-sec. interslide interval) for 
all groups. In calculating the presentation 
time for a CVC, the interslide interval (1 
sec.) was included as part of the presenta- 
tion time along with the actual "on" time 
for the CVC slide, Bugelski (1962) found 
that Ss use all the time available to them 
either for formation of an NLM or for rote 
practice, so all the time which S conceivably 
could use was counted. 

Subjects—The Ss were 150 paid male 
and female volunteers from the University 
of Illinois. Forty-three males and 107 fe- 
males were equally distributed among groups. 


Experiment II 


Experimental design—Experiment II util- 
ized the same procedures and design as Exp. 
I except that the three groups of 50 Ss 
each were assigned to one of three f’s and a 
was held constant over groups. 

Materials.—The CVCs for all groups were 
the low a items used in Exp. I. 

Item presentation time—The ts were 2 
sec. (l-sec. slide, l-sec. interslide), 3 sec. 
(2-sec. slide, l-sec. interslide), or 4 sec. 
(3-sec. slide, l-sec. interslide) for the re- 
spective groups. 

Subjects—The Ss were 150 paid male 
and female volunteers from the University 
of Illinois. Twenty-eight males and 72 fe- 
males were equally distributed among the f 
= 2- and 4-sec. groups. The 3-sec. condition 
was the low a group of Exp. I. 


RESULTS 
Experiment I 


Scoring of performance.—The cate- 
gories of response for the main analysis 
were four: presence or absence of an 
NLM in the verbal report during the 
syllable-presentation slide, and correct 
or incorrect recall of the CVC during 
the asterisk slide. However, in addi- 
tion to giving an NLM in acquisition, 


S may or may not have been able to 
recall the NLM during the Recall- 
Aid slide. The forgetting of NLMs 
will be treated in a later section. 

Any application of S's established 
language habits to the CVC was con- 
sidered to be an NLM and this will be 
referred to as mediated learning. In 
general, the most common NLMs were 
words and phrases containing the CVC. 
Phrases or sentences with initial letters 
of consecutive words forming the CVC 
were considered as NLMs. Lack of a 
response or responses which did not 
involve application of S's language ha- 
bits were considered to be rote learn- 
ing. Merely spelling the letters of the 
CVC, or attempting to pronounce it 
during the syllable-presentation slide 
were considered to be rote responses. 
Only CVCs that had their letters cor- 
rectly recalled in the proper sequence 
were scored correct. 

CVC recall.—In order to statistically 
analyze the number of CVCs correctly 
recalled (regardless of verbal report 
category) as a function of retention in- 
tervals, recall responses of each S were 
pooled over trials and items for each 
retention interval. The analysis em- 
ployed was a Case II design (Winer, 
1962, p. 337) with the total number 
correct for each S at each retention 
interval as the cell entry. Levels of a 
and the retention-interval sequences for 
subgroups of 10 Ss were between-Ss 
factors and rentention intervals were 
the within-Ss factor. Number of cor- 
rect recalls increased with increasing 
a, F (2, 135) = 50.52, p < .01, and de- 
creased with longer retention intervals, 
F (4, 540) = 45.40, p < 01. Thea x 
Retention Interval interaction was also 
significant, F (8, 540) = 992, p < .01. 
Other effects were nonsignificant. 
Table 1 presents percentage of items 
correctly recalled over retention inter- 
vals, for all groups. 
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NLMs given.—To analyze the total 
number of NLMs given as a function 
of a and retention intervals, the same 
analysis of variance used for the num- 
ber of correct recalls was employed ex- 
cept number of NLMs was the re- 
sponse measure used. Number of 
NLMs increased with a, F (2, 135) = 
76.44, p < 01, but all other main 
effects and the interactions were non- 
significant. Table 1 presents the per- 
centage of NLMs given for each group. 

NLMsrecalled.—Virtually all NLMs 
given during the syllable slide were re- 
called on the Recall-Aid slide. For 
all groups only 1.2% of the NLMs 
given were forgotten. Out of the 28 
instances where NLMs were forgotten 
in the three groups, only one CVC was 
correctly recalled. 

Learning mode.—In order to ascer- 
tain whether presence of an NLM in 
acquisition was related to superior re- 
call of the item, it is necessary to com- 
pare the recall of items with NLMs to 
recall of items without NLMs. Figure 
1 shows the retention of items as a 
function of learning mode (NLMs vs. 
rote) over retention intervals for each 
of the three groups. To test the sig- 
nificance of learning mode on retention, 
an analysis of variance was run com- 
paring recall as a function of learning 
mode for the low and medium a groups. 
For each S two scores were derived 


TABLE 1 


PERCENTAGE OF CVCs ConRECTLY RECALLED 
AT EACH RETENTION INTERVAL, PER- 
CENTAGE OF NLMs GIVEN, AND 
Mean LATENCY or NLMs as 
A FUNCTION OF ITEM Asso- 

CIATION VALUE, Exp. I 


Retention Interval 


Item Per- Mean 
Associa-| eo) centage | Latency 
Value |——— ———4——4—-| NLMs | (sec.) 
o |5 |10| 15 | 30 
Low | 99|80|73|68|65 32 2.09 
Medium | 98 | 93 | 90 | 84 | 85 61 1.91 
High 100 | 98 | 98 | 94 | 93 87 1.77 
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Fic. 1. Percentage of correct recalls over 
retention intervals as a function of report 
category and item association value, Exp. I. 


from the data of the 25 trials; the pro- 
portion of mediated items (items with 
NLMs) and the proportion of rote 
items correctly recalled. The Ss who 
used fewer than five NLMs and five 
rote reports were not included in the 
analysis, The high a group was 
not included in this analysis since very 
few Ss in this group gave the five rote 
reports set as a minimum for this 
analysis. These proportions were then 
entered into a two-factor repeated-mea- 
sures analysis of variance (Winer, 
1962, p. 302). Association value (low 
and medium a) was the between-Ss 
variable and correctness of recall as 
a function of learning mode was the 
within-Ss variable. This analysis em- 
ployed 27 Ss per group since this was 
the maximum number of Ss meeting 
the cut-off criterion for NLMs and 
rotes in the medium a group. Twenty- 
seven Ss were randomly selected from 
those meeting criterion in the low a 
group for inclusion in this analysis. 
Both a and learning mode effects were 
statistically significant, F (1, 52) = 
797,p < 01;F (1, 52) = 2756, p € 
.01, respectively. This indicates that 
mediated items were better retained 
than rote-learned items and that both 
mediated and rote-learned items were 
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better retained as a increased. All 
other effects were nonsignificant. 

NLM latencies.—The mean latency 
for each S who gave 10 or more NLMs 
was calculated and group means are 
presented in Table 1. Multiple com- 
parisons revealed that mean latencies 
of groups significantly increased with 
decreasing a and the latency of each 
group differed from the others at the 
.05 level. Within-S latencies for NLMs 
were unrelated to the correctness of re- 
call. 


Experiment II 


Scoring of performance.—The anal- 
yses run in Exp. II were the same as 
those of Exp. I, and thus the same cate- 
gorization of Ss' responses was used as 
in Exp. I. 

CVC recall.—The number of correct 
recalls increased only marginally with 
increasing t, F (2, 135) = 2.95,.05 < 
p< .10. Recall, however, did decrease 
with increasing length of retention 
interval, F (4, 540) = 76.07, p < .01. 
Other effects were nonsignificant. 
Table 2 presents percentage of items 
correctly recalled over retention inter- 
vals for all groups. 

NLMs given—The number of 
NLMs given increased with longer 
t, F (2, 135) = 5.49, p < .01, and the 
increase occurred mainly between the 
t = 3- and 4-sec. groups. Table 2 pre- 
sents the percentage of NLMs given 
for each group. 

NLMs recalled.—Again, virtually 
all NLMs given during the syllable 
slide were recalled on the Recall-Aid 
slide. For all groups only 1.3% of the 
NLMs given were forgotten. Out of 
the 18 instances where NLMs were 
forgotten in the three groups, only one 
CVC was correctly recalled. 

Learning mode.—To determine the 
relationship of NLMs to recall of 
CVCs, the categorization of items on 


TABLE 2 


PERCENTAGE OF CVCs CORRECTLY RECALLED 
AT EACH RETENTION INTERVAL, PER- 
CENTAGE OF NLMs GIVEN, AND 
Mean Latency or NLMs as a 
FUNCTION or ITEM PRESEN- 

TATION TIME, Exp. II 


Item | Retention Interval (sec.) 
Presen- 


tation | 


time | o | 5 | 10] 15 | 30 
(sec.) 


67 64 
3 99 | 80 | 73 | 68 | 65 33 
4 98 | 82 | 76 | 73 | 73 47 


the basis of presence or absence of an 
NLM in acquisition and whether the 
CVC recall was correct or incorrect 
was again employed. Figure 2 shows 
the recall of mediated and rote items 
over retention intervals for all Ss in 
each of the three groups. To test the 
significance of learning mode (NLMs 
vs. rote) on retention, an analysis of 
variance was run comparing retention 
as a function of learning mode for the 
three groups. Proportions of mediated 
and rote items correctly recalled for 
Ss giving five or more NLM and rote 
reports were determined and entered 
into the two-factor repeated-measures 
analysis of variance (Winer, 1962, p. 
302) discussed previously. This anal- 
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Fic. 2. Percentage of correct recalls over 
retention intervals as a function of report 
category and item presentation time, Exp. IL + 
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ysis employed 26 Ss per group since 
this was the maximum number of Ss 
meeting the cut-off criterion in the 
t = 2-sec. group. Twenty-six Ss were 
randomly selected from those meeting 
the criterion in each of the other two 
groups. The only significant effect was 
that due to learning mode, F (1, 75) = 
32.28, p, < 01, which indicates that 
mediated items were better retained 
than rote items but that increasing ¢ 
had no effect on either type of learn- 
ing mode. 

NLM latencies—The mean latency 
for each S who gave 10 or more NLMs 
was calculated and the mean for each 
group is presented in Table 2. Multi- 
ple comparisons revealed that latencies 
significantly increased with ¢ and the 
latency of each group differed from the 
others at the .01 level. Within-S la- 
tencies for NLMs were unrelated to 
the correctness of recall. 


Discussion 


These findings firmly extend to STM 
the relationship between item recall and 
NLMs previously found for LTM and 
are not subject to the criticism, as was 
Groninger’s (1966) study, that the NLMs 
may not have been active during the 
actual time the association was being 
formed. In the present study the NLMs 
were reported by S during the learning 
period and they were available to S at 
the time the association was being formed. 

The experiments, however, do not in- 
dicate the mode of operation of NLMs. 
Two possibilities arise, that NLMs are 
merely an indicant or correlate of high 
associative strength for an item that is 
well learned by rote repetition, or that 
NLMS are an actual determinant of 
level of recall in STM. Under the latter 
hypothesis, NLMs would be treated as an 
application of well learned language 
habits, which when applied to new items 
would transfer response strength to those 
items, Since retention is known to be 
a function of associative strength ( Under- 
wood, 1964) this approach would predict 


items with NLMs to be better retained 
than nonmediated items. Also, assuming 
the interference theory of forgetting, the 
NLM-item complex is a relatively unique 
verbal unit, and this uniqueness or dis- 
similarity may be expected to impart a 
resistance to experimental and extraex- 
perimental sources of interference that 
would produce less forgetting than would 
occur for items alone (Adams, 1967; 
Adams & Montague, 1967; Montague et 
al, 1966). The prevalence of NLMs in 
this study and their relationship to items 
that are better retained would seem to 
imply that further investigation is neces- 
sary before either the causal or correlate 
hypotheses may be accepted. 

Recall of both mediated and rote- 
learned items was strongly dependent on 
the level of a. The most plausible ex- 
planation of this seems to be to consider 
a as a relative indicant of how fre- 
quently S has experienced the item pre- 
viously in the written and spoken lan- 
guage, and thus how high the item is in 
S's repertoire and how likely it is to be a 
response. The strength of rote items 
would be a consequence of this experience 
and rote recall should be a positive func- 
tion of a, which was found. With re- 
spect to NLMs, it would seem that as a 
increases, the probability of an item hav- 
ing an NLM increases and that these 
items were better recalled either because 
the NLM was an indicant of high asso- 
ciative strength established by rote repe- 
tition or the NLM transferred associative 
strength to the item and the amount of 
strength transferred increased with a. 

The small effect of t on both CVC re- 
calls and NLMs is rather surprising in 
the light of previous research on the ef- 
fects of t on recall and NLMs. The basic 
assumption behind increasing ¢ in this ex- 
periment was that S would have more 
time to form an NLM, but that if he did 
not form one he could use the additional 
time for rote practice. If there was lit- 
tle difference in NLM production as a 
function of f, as was the case, then a 
difference in recall of rote-learned items 
as a function of f would be expected. In 
other words, ¢ is a variable for asso- 
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ciative strength whether the learning 
mode is rote or mediation. Under this 
assumption, the small effect of t on NLMs 
and CVC recall was unexpected. This 
result was probably due to the instruc- 
tional set that was given to Ss. The Ss 
were instructed to report an NLM if 
they had one, although they were not told 
they had to have one. It is likely that 
with this instructional set Ss used the 
additional time to find an NLM rather 
than devote it to rote practice as ex- 
pected, and this is supported by the in- 
crease in NLM latencies with increasing 
t. If S had been under a strong instruc- 
tional set to rehearse the item as many 
times as possible in the acquisition period, 
it is likely that ? would have an effect on 
rote learning. 
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TEMPORAL CHARACTERISTICS OF SENSORY INTERACTION 
IN CHOICE REACTION TIMES* 


LENORE K. MORRELL 
Division of Neurology, Stanford University School of Medicine, Palo Alto, California 


Reaction times (RTs) to a photic stimulus were evaluated as a 
function of the interval between a flash and a click which followed 
at varying intervals (ranging 20-120 msec.) in a choice RT task. 
RT to flash was found to be a linear function of the interstimulus 
interval between the flash and the click. The shorter the interval, 
the faster the RT. With choice RT, mean responses to flash-click 
pairs separated by as much as 120 msec. were reliably faster than those 


obtained to flash presented alone. 


Helson (1964) and  Hershenson 
(1962) have both reported a facilita- 
tory effect upon reaction time (RT) 
to a visual stimulus when an auditory 
stimulus was presented at brief inter- 
vals afterwards. Helson reported fa- 
cilitation effects at interstimulus inter- 
vals of 35 msec. or less and inhibitory 
effects at intervals greater than 50 
msec., when comparison was made with 
RT to the visual signal presented alone. 
Hershenson, using quite different meth- 
ods, found evidence for facilitation of 
visual RT by auditory stimuli coming 
as long as 65 msec. afterwards. In a 
recent study similar in format to that 
of Helson, only facilitative effects of 
the extraneous auditory stimulus could 
be observed (L. Morrell, 1967). RTs 
to paired visual auditory stimuli from 
20 to 80 msec. onset asynchrony were 
reliably shorter than RTs to the visual 
stimulus presented alone. Further, a 
linear function fit the set of mean data 
points obtained from interstimulus 
intervals ranging from 20 to 120 msec. 

All of the above studies were vari- 
ants of the Donders (1954) a-reaction. 
The Ss responded on every trial, for 


1This research was supported by Grant 
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every trial in the series contained at 
least the signal defined as the reaction 
stimulus. 

There is no published evidence of 
which the author is aware on such 
intersensory effects for choice RT. It 
was decided to study the effects of ex- 
tranenous auditory stimuli coming after 
a visual stimulus upon visual RT in 
experiments in which occasionally the 
auditory stimulus would be presented 
alone, and response to the latter was to 
be withheld. This is similar to the 
classic c-reaction defined by Donders 
(1954), except that there are a variety 
of positive reaction stimuli (all the 
light-click pairs and light alone), and 
only one nonreaction stimulus (sound 
alone). Would the additional require- 
ment of signal discrimination and the 
tagging of the auditory stimulus as one 
for which response was to be withheld 
alter the temporal characteristics of 
the effect ? 

There has been much recent interest 
in the neural mechanisms involved in 
processing polysensory information. 
Using electrographic criteria, neural 
inhibition and facilitation have been 
demonstrated for both single-unit re- 
sponse and evoked potentials to paired 
stimuli in different modalities (Brazier, 
1961; Buser & Imbert, 1959; Jung, 
Kornhuber, & Da Fonseca, 1963; F. 
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Morrell, 1967). These studies have 
been restricted in that no known be- 
havioral significance was attached to 
the stimuli, and anesthetized and/or 
curarized animal preparations were em- 
ployed. Therefore, one of the motiva- 
tions of the present investigation was to 
assess the relationship between behav- 
ior in a cross-modal task and the aver- 
aged cortical evoked potentials elicited 
by compound stimuli at varying inter- 
vals of temporal separation. Only the 
behavioral data will be considered here. 


METHOD 


Subjects—The Ss were seven adults 
(ranging in age 20-38), who were either 
graduate students or laboratory technicians. 
All were naive to the procedures of this 
laboratory. All were right-handed. 

Stimuli —The photic stimulus consisted of 
a 10-msec. flash delivered to both eyes from 
a pair of Sylvania glow modulator tubes 
(R1131C). These were mounted in a spe- 
cially constructed frame with a viewing 
hood. The tubes were behind diffusing 
glasses, which were 1 in. from Ss eyes. In- 
tensity of light measured 50 ftl with a 
Spectra brightness spot meter. With the 
eyes closed, the brightness of the light after 
dark adaptation was well above threshold 
(approximately 2 log units). The auditory 
stimulus was a click of 70 db. delivered from 
a tweeter placed 4 ft. directly in front of 
Ss head. A warning tone of 130 cps and 
20-msec. duration occurred 1-2 sec. before 
each trial. 

Procedure.—The Ss were seated in a 
lounge chair in a dark, sound-deadened room. 
Each was instructed to respond as rapidly 
as possible only to a flash signal by depress- 
ing a small switch lightly taped to his 
right hand. i 

The Ss were instructed to keep their eyes 
closed throughout the experiment. This was 
done in part to minimize possible eyeblink 
artifacts in electroencephalograms which 
were being recorded simultaneously. One 
S was studied on two separate occasions, 
once with eyes kept open and once with eyes 
closed. 

Each experimental session consisted of 288 
trials with 36 each of eight stimulus condi- 
tions. Six of these were flash followed by 
click at the following interstimulus intervals 
after flash onset: 20 msec., 40 msec., 60 msec., 


80 msec., 100 msec., and 120 msec. The 
other two conditions were flash presented 
alone and click presented alone. The 
eight conditions were presented in quasi- 
random order with the constraint that no 
pattern was repeated on the immediately fol- 
lowing trial Intertrial times ranged 48 
sec. A warning tone preceded each trial by 
12 sec. All experimental parameters and 
their sequencing were controlled by a stored 
program with the LINC computer. 

All Ss were given a 5-min. warmup before 
each session, which familiarized them with 
the task and established dark adaptation. In 
each session, a 10-min. rest period was given 
at the halfway point. 

A 14 channel DAS-100 Ampex system ser- 
ved for recording of all data on FM tape, 
including the visual and auditory event mark- 
ers and the motor response pulse. The RTs 
were measured to 1-msec. accuracy by means 
of a time-interval measuring program on the 
LINC. 


RESULTS 


Figure 1 shows the relationship be- 
tween RT to flash and the interstimulus 
interval separating the flash from the 
following click, and RT to flash alone 
in the choice setting. The points plot- 
ted are the means of each S’s median 
RT for each condition. The values ob- 
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Fic. 1. Mean reaction time in relation to 
flash-click interval; also RT to flash alone. 
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TABLE 1 
Parg-WisE CoMPARISONS OF RTs 
Choice RT (N = 7) Nonchoice RT (N = 6) 
Msec. Flash-Click Intervals (msec.) en Flash-Click Intervals (msec.) A 
40 60 80 100 | 120 40 60 80 100 | 120 
20 E ees =. ** + *» a. * * ** ^x »* 
40 3s WE ** *» »* ah ae aa, * » 
60 m. ** »* E EE [oi * 
80 wre * ** vert a * 
100 — s5 it zs 
120 * "A 
Note.—Statistically significant differences between RTs to the various stimulus configurations, testing all 
possible pairs, 
* p «i 
> <.01 


tained from the earlier nonchoice series 
are presented for comparison (L. Mor- 
rell, 1967). 

Two-way analysis of variance 
showed that treatment conditions were 
significant (p —.005). Table 1 shows 
the results of pair-wise comparisons 
of RTs for the different stimulus con- 
ditions using the T method of multiple 
comparison (Scheffé, 1959). With 
choice RT, RTs to each of the paired 
stimuli from 20-msec. to 120-msec. sep- 
aration was significantly faster than 
RT to flash given alone. Pair-wise 
tests of difference for the earlier non- 
choice series (also presented here for 
comparison) indicated that only the 
flash-click pairs at interstimulus inter- 
vals up to 80 msec, were reliably faster 
than RT to flash alone. With choice, 
RTs to stimulus pairs ranging from 20- 
msec. to 80-msec. onset asynchronies 
could be differentiated from RT to 
pairs separated by 120 msec. Further, 
RTs to both the 20-msec. and 40-msec. 
flash-click pairs were reliably faster 
than RTs to the 100-msec. pair. 

Using only the paired-stimuli obser- 
vations, linear regression analysis of 
the grouped data was performed. 
Choice RT to flash was a linear func- 


tion of the interstimulus interval be- 
tween the flash and the following click 
over the 20-120-msec. range. The 
slope of the function relating RT to 
interstimulus interval was .33 (p= 
01). Analysis of each S's data indi- 
cated that individual slopes did not dif- 
fer significantly from the group mean, 
although the intercept values did vary 
significantly among Ss. 

False responses.—The incidence of 
switch-depression responses on the 36 
trials in each session in which only 
bm was presented ranged 0-4 among 

s. 

Judgments of simultaneity.—Two Ss 
were rerun through the same experi- 
mental schedule used for choice RT 
on later occasions, with the instruction 
to indicate by manual response when 
the flash and click were adjudged si- 
multaneous. This pilot study indicated 
that simultaneity judgments were given 
9576 of the time for both the 20-msec. 
and 40-msec. flash-click pairs. The 
percentage of simultaneity response 
thereafter dropped steadily with in- 
creasing stimulus separation; at the . 
120-msec. interval, it was 2596 for one 
S and 0% for the other. 

Eyes closed vs. eyes open.—One S 
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was rerun in choice RT with his eyes 
kept open. Although his median RTs 
were generally faster in this condition 
as compared with eyes closed, the 
slopes of the function relating the RT 
to the interstimulus interval were not 
significantly different. 


Discussion 


These experiments demonstrate that RT 
to a photic signal may be facilitated by 
an auditory stimulus presented shortly 
afterward. In a choice RT a significant 
effect may be demonstrated with inter- 
stimulus intervals as long as 120 msec. 

The findings are similar to those re- 
ported by Reynolds (1966) in a uni- 
sensory task, which had the further dif- 
ference that S was also to respond to 
the second of the two visual stimuli. 
Reynolds further found the facilitative 
relation between the first RT and inter- 
stimulus interval to hold under varying 
conditions of temporal and event un- 
certainty. However, Helson and Steger 
(1962) reported that RTs were delayed 
in a task in which S was only to respond 
to the first of two visual stimuli; delay 
in RT as a function of interstimulus inter- 
val was found to be an inverted U func- 
tion, with maximum delay at 90 msec. 
Others have failed to replicate this in- 
hibitory effect (Koplin, Fox, & Dozier, 
1966; Lappin & Eriksen, 1964). 

The results are at variance with those 
reported by Helson (1964), who found 
that RT to a visual stimulus, while facili- 
tated at short intervals between that stim- 
ulus and an after-coming auditory one, 
was prolonged over the single signal con- 
trol value at intervals over 50 msec. One 
possible source of this difference in 
temporal range of effect may be that 
in the present series the photic signal was 
presented through closed lids to the dark- 
adapted eye. However, the findings on 
the S rerun under both conditions, with 
no difference in the obtained function, 
suggest that this may not be a critical 
factor. 

Reynolds (1964) suggested that psycho- 
physical summation may be an explana- 


tory factor in the observation of faster 
RTs to the first of two stimuli at shorter 
interstimulus intervals, even in the poly- 
sensory case. Some recent studies on 
auditory generalization gradients indi- 
cate that simultaneous presentation of 
auditory and visual stimuli may alter the 
gradient in the same manner as when the 
tone in fact was increased in intensity 
(Carter, Salive, & Zajonc, 1966; Dorf- 
man & Miller, 1966). The pilot explora- 
tion of perceptual simultaneity in the 
present study is also compatible with this 
interpretation. 

Hilgard (1933) has well described the 
differences in physiological time vs. 
physical time, in a study of visual-audi- 
tory interaction effects on the eyelid re- 
flex. His hypotheses, later used by 
Hershenson in a study more comparable 
to the present one, was that facilitation 
(in terms of latency or amplitude mea- 
sures) should be maximal when hetero- 
sensory stimuli were presented at inter- 
vals equal to the difference between their 
respective response latencies. Facilita- 
tion is attributed to a summation of “reac- 
tion tendencies.” Hilgard also noted 
that the response tendency is dispersed in 
time, and therefore the period of summa- 
tion would show a concomitant spread in 
time.  Hershenson observed that maxi- 
mum facilitation may be obtained at inter- 
stimulus intervals beyond the calculated 
physiological simultaneity. 

Whether the facilitation effects are 
largely mediated at earlier stages of signal 
processing (as in the psychophysical sum- 
mation hypothesis) or at later stages 
(closer in time to the elaboration of motor 
response), the present data indicate that 
the process may be graded, showing grad- 
ual decrement with increasing temporal 
separation. 
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VISUAL SEARCH IN A LETTER-CANCELING 
TASK REEXAMINED * 


DONALD A. SHURTLEFF ann MARION Y. MARSETTA 
The MITRE Corporation 


The relationship between search time and number of different targets 


searched for was reexamined in a letter-canceling task. 


4 Ss 


searched letter arrays and crossed out target letters when the target 


subsets included 1, 2, 3, 4, or 
present study, which showed 
various target subsets, 


5 different letters. 
little increase in search time for the 


The results of the 


were not in agreement with a previously 


reported finding in which search time increased markedly and in 


proportion to the number of tar; 
the earlier study interpreted the inci 


gets in a subset. The authors of 
rease in search time to represent 


a change in the temporal sequence of search operations for target 
letters committed to immediate memory. This interpretation was 


questioned. 


Recently, many experimenters con- 
ducting investigations of visual search 
have focused their attention on hypo- 
thetical, cognitive, and perceptual op- 
erations that are postulated to go on as 
an O is actively searching for a 
target in a visual field. In particular, 
these investigators have tried to assess 
the temporal sequence of these opera- 
tions during search and how these se- 
quences may be affected by the number 
of different targets which the searcher 
must find. Neisser, Novick, and Lazar 
(1963) had Ss search through ran- 
domly generated lists of alphanumerics 
and locate a target symbol where the 
subset of target symbols included itso; 
or 10 different alphanumerics. When 
S was well familiarized with the target 
letters and search routines, the authors 
found that the rate of searching non- 
target symbols was the same regardless 
of the number of different symbols in 
the target subset. Kaplan, Carvellas, 
and Metlay (1966) had Ss search 
through newspaper copy and cross out 
1-5 different target letters. They re- 

1 The research reported in this paper was 
sponsored by the Air Force Electronic Sys- 
tems Division, Air Force Systems 
under Contract AF19(628)-5165. 
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port that when S was less familiar with 
the target letters (target letters specified 
prior to each trial), the rate of search- 
ing nontarget letters slowed as the 
number of different targets in a subset 
increased. 

To account for the differences be- 
tween Neisser's and his findings, Kap- 
lan (1966) theorizes that search opera- 
tions for symbols committed to imme- 
diate memory (just-learned targets) 
are undertaken sequentially, while 
search operations for symbols com- 
mitted to long-term memory (well- 
learned targets) are undertaken simul- 


taneously. Kaplan et al, (1966) 
states, 
Neisser and his associates demonstrate that 


search [operations] for well-learned targets 
is simultaneous; the present study indicates 


that search [operations] for targets just spe- 
cified is sequential [p. 493]. 


Sequential presumably means that it 
takes twice as long to search nontarget 
letters with two different targets, and 
three times as long with three different 
targets than it does with one target. 
While Kaplan did not offer a formal 
definition of operations, it seems that 
he is using this term in the same way 
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as Neisser (1963), who defines it as 
the perceptual and cognitive processes 
by means of which S is able to recog- 
nize or differentiate one letter from 
another. 

While the theory is plausible, and, if 
verifed, would increase our under- 
standing of factors related to visual 
search, there are features of Kaplan's 
et al. (1966) search situation which need 
to be considered before this theory may 
be accepted. One feature is the 
arrangement of target and nontarget 
letters in the lists through which S 
searches. In the Kaplan et al. study, 
search lists consisted of newspaper copy 
in which the number of nontarget let- 
ters between successive occurrences of 
target letters decreased progressively 
as the number of different target letters 
in a subset increased. In order to 
obtain a quantitative estimate of how 
the number of nontarget letters may 
have varied in the Kaplan et al. study, 
a sample of newscopy was taken and 
an estimate of the median number of 
nontarget letters between successive 
target letters was made. The results of 
this sample are plotted in Fig. l. As 
one can see, there is a consistent de- 
crease in the number of intervening 
nontarget letters as the number of 
target letters in a subset increased. 

The number of nontarget letters in- 
tervening between adjacent target let- 
ters was confounded, therefore, with 
the number of targets in a subset in 
the Kaplan et al. study and could ac- 
count for the reported linear increase 
in search time. A linear increasing 
function may be obtained by dividing 
a nonlinear increasing function (total 
search time) by a nonlinear decreasing 
function (number of nontarget letters), 
provided the two nonlinear functions 
are of opposite slopes and the proper 
form. Such confounding of factors 
could not have occurred in the Neisser 
et al. (1963) study since S searched 
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Fic. 1. Median number of intervening 
nontarget letters between adjacent target 
letters estimated from a sample of newscopy. 


through the same number of nontarget 
letters in all conditions (1, 5, or 10 
targets). 

One way to assess the potential effect 
of the confounded factor on search be- 
havior is to set up a situation similar 
to the Kaplan et al. (1966) in which 
the number of different target letters in 
a subset and the numbers of nontarget 
letters intervening between target let- 
ters do not covary. The present study 
was concerned with S's search time in 
locating and canceling subsets of 1, 2, 3, 
4, or 5 different target letters with the 
number of intervening nontarget let- 
ters held constant for all subsets of 
targets. 


METHOD 


Subjects—The Ss were four employees of 
The MITRE Corporation. All Ss had 20/20 
near acuity and none of them had previously 
participated in an experiment on visual 
search. 

Search lists—Arrays of 5 rows and 10 
columns of letters were typed on standard 
white bond paper. There were two type- 
writer spaces between letters in a row and 
two typewriter spaces between successive 
rows of letters. An IBM Selectric type- 
writer with a MITRE font (similar to 
standard light gothic) was used to type 
the arrays. By use of a table of random 
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numbers, five letters, G, I, K, R, and S, 
were selected from the alphabet to be used as 
the target letters, i.e., those letters which Ss 
were to search for in the arrays. These 
letters were used to make up subsets of 
targets which included 1, 2, 3, 4, or all 5 
of the letters. 

Ten arrays were made up with a subset 
composed of only one target letter. Each 
of the five letters was used an equal number 
of times; thus each letter was a target in 
2 different arrays. In 3 of the 10 arrays 
(with only a single letter to look for) the 
letter appeared five times, in 4 of the arrays 
the target letter appeared six times and in 
the remaining 3 arrays, the target letter 
appeared seven times. With different num- 
bers of target letters to find, S would be 
less likely to use the number of letters so 
far located in a trial as a basis for ending 
his search of the array. 

The locations of the target letters in the 
arrays were selected by use of a table of 
random numbers with the restriction that at 
least one target letter appear in the last 
row of the arrays. The remaining locations 
in the arrays were filled with nontarget let- 
ters, which were selected by a table of ran- 
dom numbers from the other 21 letters of 
the alphabet. Ten arrays were made up in 
a similar manner for each target subset of 
2, 3, 4, or 5 letters. For each of these sub- 
sets of target letters, the 10 arrays were 
similar to those described for a target subset 
of a single letter in that the locations of 
the target letters, the nontarget letters, and 
the frequency of occurrence of target letters 
were the same. The only difference was 
that the particular letters making up target 
subsets of 2, 3, 4, or 5 letters were inserted 
in the locations in the arrays provided for 
target letters. A procedure was worked 
out so that each of the five letters was used 
equally often as a target when the subset 
included 2, 3, 4, or 5 letters. For example, 
the letter I was used just as frequently as 
the letters, G, K, R, and S when $ had to 
search for 2, 3, 4, or 5 letters in a subset. 
A second set of 50 arrays was made which 
were identical to the first 50 arrays with 
the exception that the letter sequence of 
each array was reversed. 

The letters had a brightness of 40 ftl. and 
a background brightness of 210 ftl. when 
illuminated from a distance of 12 in. by a 
100 w. incandescent bulb. The height of 
the letters subtended an angle of approxi- 
mately 16 min. of arc at the eye. Each 
sheet on which a letter array Was typed was 
covered with a clear plastic jacket. The 


letters showed clearly through the plastic 
and Ss canceled the target letters by mark- 
ing the plastic with a china marking pencil 
directly on the positions in the array where 
the target letters appeared. 

Procedure—Each S searched through 
all 100 arrays. The order with which the 
arrays were given to Ss was arranged so 
that potential effects arising from practice 
or fatigue were distributed equally over the 
five subsets of target letters. 

The arrays were mounted in a reading 
stand approximately 20 in. from S. The 
S sat facing the reading stand with his fore- 
head against a rest and was given a china 
marking pencil to use for canceling the tar- 
get letters. An array was covered by a 
white piece of cardboard which was pulled 
away by E to expose the letter array at 
the start of a trial. The E started a stop- 
watch coincident with the exposure of an 
array and stopped the watch when S had 
finished searching. The S indicated he had 
finished searching by saying the word “stop.” 
The S was instructed to work as quickly as 
possible without making errors. 

At the start of each trial, E read aloud 
the letter or letters to be located in the 
array. The time required to search an array 
and the errors made were recorded. 


RESULTS 


The estimated search times per letter 
for each of four Ss are shown in Fig. 
2B & 2C. These search times were ob- 
tained for arrays with a target fre- 
quency of five only. Analysis of 
variance showed that the interaction 
between target frequency and target 
subset was not a significant source of 
variance indicating that the slopes of 
the curves for arrays with a total of 5, 
6, or 7 target letters did not differ. 

The time required to search through 
an array was converted to search time 
per letter so that the findings of the 
present study could be compared more 
easily with those of Kaplan et al. 1966, 
(Fig. 2A), who reported their findings 
jn search time per nontarget letter. 
In the present study, the search times 
per letter were derived by dividing the 
search time for an array by the total 
number of letters in the array. These 
search times are merely a convenient 
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Fic. 2. A. Search time per nontarget let- 
ter for the Kaplan study. (The data points 
were estimated from Fig. 4 of his report.) 
B.. Search time per letter for the first trial 
of the present study. C. Search time per 
letter for all trials of the present study. 


fiction derived for this comparison, 
since they do not represent the actual 
time spent per letter. Components of 
the time to search an array, such as 
the time needed by E to start and stop 
the timer, and the time required by S 
to make cancel marks, were not re- 
moved from the total search time. 
However, these spurious components 
of search time have no effect on the 
slope of the curves of Fig. 2B and 2C, 
which are of primary interest, since 
they may be assumed to be the same for 
each of the five target subsets. Al- 
though it is not possible to compare the 
absolute values of search time per letter 
of the present study with those reported 
in the Kaplan et al. (1966) study, it 
is possible to compare the slopes of 
curves derived from each study. 

The primary point of interest of 
Fig. 2 is that the slopes of the curves 
for the present study are much less 
steep than those reported by Kaplan; 
these differences are clearly evident by 
visual inspection alone. The curves 
showing search time per letter for the 
first trial in each subset is perhaps the 
most interesting since this represents 
S’s search ability before he has become 
well familiarized with the letters mak- 
ing up the subset. In a few instances, 
omissions were made on the first trial 
of a subset so the next trial had to be 
used. As Fig. 2 shows, there is only 
a slight increase in search time per 
letter during the first trial of each 
subset. 

The slope of the curves for all trials 
(Fig. 2C) in the present study is simi- 
lar to those obtained for the first trial; 
however, the slope of the curve does 
change with practice, as is shown in 
Fig. 3, where the search time for one 
S given extended practice in this task 
is plotted. Each practice session is 
based on 100 trials (search through 
100 arrays), 20 for each number of 
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Fic, 3. Search time per letter for one S 
who was given additional practice in the 
search task. 


targets in a subset. It may be seen 
that by the fifth practice session his 
search time is approximately the same 
for each number of targets in a subset. 


Discussion 


When the number of intervening non- 
target letters is controlled, some slowing 
in the rate of search occurs as the number 
of targets in a subset increases, but the 
increase is not linear nor of sufficient 
magnitude to postulate the occurrence of 
sequential search operations. It may be 
concluded from this finding that the linear 
search function reported by Kaplan et al. 
(1966) was caused by an artifact in their 
procedure, namely, a systematic decrease 
in the number of intervening nontarget 
letters. 

A more direct comparison is possible 
between the present data and those of 
Kaplan et al. (1966) if the latter are cor- 
rected to show search time per letter (the 
same scale used in the present study) 
rather than search time per nontarget 
letter. The conversion of the Kaplan 
data is accomplished by calculating the 
proportion of nontarget letters from Fig. 
1 of the present study. Figure 1 reveals 


the ratio of NT/NT +T. The Kaplan 
data (Fig. 2A) are in search time ( ST 
per nontarget letter or ST/NT. When 
the Kaplan data are multiplied by the 
proportions derived from Fig. 1, search 
time per letter is obtained (ST/ 
NT/NT + T=ST/NT+T). These 
corrected search-time scores for the esti- 
mated averages of the Kaplan data are 
presented in the fourth column of Table 
1. The corrected scores are markedly 
similar to those of the present study 
(Fig. 2B), and the corrected function is 
clearly not linear. 

The findings of the present study are 
like those reported by Neisser, Novick, 
and Lazar (1963), in that while the rate 
of searching for a subset of five different 
letters was initially slightly slower than 
that for a single letter, these small differ- 
ences in search rate were eliminated with 
practice. 
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45 male apparatus-naive undergraduate Ss were assigned to 1 of 3 
displays associated with performance of a modified rotary pursuit 
task. Performance of a control group, practicing the reference task, 
was compared with that of groups which received either mechanical- 
visual guiding cues or augmented feedback during training, and during 
transfer, when all 3 groups practiced the reference task. Both 
forms of supplemental cues produced superior performance during 
training, but augmented feedback produced best results. During trans- 
fer, the augmented feedback group retained its superiority, but the 
group trained by means of mechanical-visual guidance lost approxi- i 
mately 50% of its pretransfer score. The results are discussed in 
the light of hypothesized influences on learning resulting from cue 


states present during training. 


Recently, reviews by Adams (1964), 
Annett (1961), Bilodeau and Bilodeau 
(1961), and Bilodeau (1966) have de- 
voted considerable attention to the role 
alterations in the information condi- 
tions accompanying the performance of 
motor tasks play in influencing skill. 
Despite considerable disagreement in 
experimental findings, it has been gen- 
erally demonstrated that supplementing 
S’s knowledge of his own error can 
produce both performance and learning 
benefits, provided that the augmented 
stimuli do not in themselves become 
sources upon which S can depend for 
primary guidance of his responses. 
Gordon and Gottlieb (1967) have re- 
cently demonstrated that off-target 
visual signals presented whenever S 
was off target, produced performance 
benefits both during training and dur- 
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ing transfer on a modified RP task 
when the aiding cues were removed. 

While studies of the effects of aug- 
menting stimuli have been focused on 
efforts to enhance or alter the nature 
of the information received by S as a 
consequence of responding, consider- 
ably less attention has been directed to 
the effect of alterations in guiding cues 
presented in advance of any response 
by S. Guiding cues differ from feed- 
back in that they are always present 
regardless of S's action, whereas feed- 
back cues vary as a function of S’s 
response, although they may be con- 
sidered to constitute a form of inter- 
mittent guidance as well. On the other 
hand, the provision of added guiding 
cues designed to augment input cues, 
can have the effect of altering feedback 
information if the nature of the guid- 
ance is such as to markedly affect re- 
sponses. 

Although it has been pointed out that 
attempts at cue augmentation which 
provided a source of primary guidance 
as well as enhanced feedback informa- 
tion generally lead to performance dec- 
rements when the extra cues are re- 
moved (viz., Annett, 1961; Micheli, 
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1966) while nonguiding feedback does 
not (viz, Gordon & Gottlieb, 1967), 
relatively few studies have directly ex- 
amined this question. Bilodeau and 
Bilodeau (1958), using an intertask 
transfer design, studied performance on 
a self-paced tracking task, in which S’s 
responses on one version were re- 
stricted to the criterion path, and on 
another were mechanically unguided. 
Performance under guided conditions 
proved to be markedly superior. How- 
ever, no evidence of differential trans- 
fer was reported between the two con- 
ditions, thus leaving indeterminate the 
question of learning differences on the 
two tasks. A reanalysis of their data, 
however, shows considerable: evidence 
of the possibility of differential learn- 
ing. 

The present research was conducted 
to compare the effect of two alterations 
in display of information on perform- 
ance of the RP task. One display 
modification was a form of mechanical- 
visual guidance, and the other, visual 
off-target augmented feedback. In an 
effort to determine the relative effects 
on learning, of training under the two 
display conditions, both forms of cue 
augmentation were removed after a 
period of practice, and the performance 
of the previously aided groups was 
compared with that of the reference 
task group. 


METHOD 


Subjects —Forty-five apparatus-naive, male 
undergraduate students between the ages © 
18 and 22 were assigned by an unsystematic 
schedule to one of the three experiment 
groups. The Ss were students at Nassau 
Community College, Garden City, N. Y. and 
were paid volunteers. 

Apparatus—The apparatus has already 
been described in Gordon and Gottlieb 
(1967). It was constructed by Shaw Labo- 
ratories, Syosset, N. Y. It consists of a 
light box display unit which provided a 
lighted target square 75 X 25 in. which ro- 
tated on a 3.50-in. orbit. An E control unit 


provided controls to set target rpm, The 
target was set to rotate clockwise at 60 rpm, 
thus generating a 22-in/sec rate of movement 
of the center of the target. A transparent 
glass plate covered the rotating target disc. 
The S’s manipulandum consisted of a rigid 
stylus which was held in the preferred hand. 
The tip of the stylus was § in. and had a 
Pin. aperture into which was fitted a 
photoelectric cell. The sensitivity of the 
photocell was set so that when the center 
of the. stylus passed over the edge of the 
target square, the scoring circuits were ac- 
tivated and deactivated in the opposite direc- 
tion. Performance was recorded on a 6 v. 
dc clutch-operated Standard Electric timer 
and a Sodeco impulse counter. A Sodeco- 
type TCeZ4PE predetermining impulse 
counter was used to control trial lengths. 
The target motor was kept on throughout the 
running of individual Ss to insure speed 
constancy. 

Displays—The tracking display was hori- 
zontal to the floor and 38 in. above the plat- 
form on which S stood. Three task ver- 
sions were used and provided the group 
names to which Ss were assigned : 

1. Reference task (NOTAF) consisted of 
an 18-ip. diameter black field bounded by a 
larger gray square (20 X 20 in. less than 
the black cutout). The black field was a 
portion of a rotating Lucite disc painted 
black, from a portion of which the paint was 
masked, revealing the target which was a 
J5-in. X 75-in. square. The target square 
was illuminated from beneath, making it 
visible to S and, at the same time, providing 
the light which the stylus photocell was 
sensitive to. 

2. The 
(OFTAF) pro ided the identical informa- 
tion described in 1 above, but in addition, 


after S lost contact with the target and 


get. 

i3 Mechanical-visual guidance (MG) con- 
sisted of a .75-in. thick clear Lucite plate 
mounted on top of the display surface. A 
path approximately 75 in. wide was cut 
from the Lucite plate, thus exposing the 
target orbit, and allowing the stylus to main- 
tain contact with the glass plate over the 
target. As a result, S was restricted to 
lag or lead errors during tracking, but had 
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Mechanically quided 
MG 


Reference task 
NOTAF 


Fic. l. Schematic representation of the 


task displays. 


a total of 8 in. of free play within the 
target path ring. The MG Ss saw the 
same target as Ss in the other groups. The 
two basic displays are illustrated in Fig. 1. 

Procedure,—The three groups of Ss were 
tested on two consecutive days in testing 
periods approximately 30 min. each. They 
were randomly assigned to each treatment 
group each day a series of treatments were 
commenced. The entire experiment was 
conducted over a 6-wk. period. Twenty-one 
20-sec. trials were given during the first 
session, and 21 during Session 2. Each 
three-trial block was separated by a 1-min. 
rest, and there were 20-sec. pauses between 
adjacent trials within a block for score-re- 
cording purposes. Trial spacing was se- 
lected so as to minimize reactive inhibition. 
During the second session the same work- 
rest schedule was used as in Session 1. Ss 
received four more three-trial training blocks 
for a grand total of 33 20-sec. trials, and 
then after a 1-min. rest Groups OFTAF 
and MG transferred to the NOTAF condi- 
tion. Group NOTAF simply continued the 
same task practiced during the training 
period. 

The Ss were given initial instructions 
which explained that they were participating 
in an experiment on motor skill learning and 
were given instruction in the use of the 
stylus. They were asked to keep the stylus 
in contact with the glass plate surface at all 
times, and to maintain the tip in a horizontal 
position, Stylus position was continuously 
monitored by E to see to it that instructions 
were followed. The S was also told that 
he would be informed concerning his per- 
formance when the experiment was com- 
pleted and the standard response to any in- 
quiry during the progress of the experi- 
ment was, “You are doing quite well.” The 
walls of the MG condition were lubricated 
with a silicon-graphite compound to prevent 


binding, and all of the Ss were able to move 
the stylus quite freely around the circle of 
the display. All Ss were told to try to keep 
within the target square, and the augmented 
and guided groups were told the purpose of 
the extra information. When Groups 
OFTAF and MG were transferred to the 
reference task display, they were told: “We 
want to see how well you can perform with- 
out the guide (or light) from now on.” 


RESULTS 


A comparison of the time-on-target 
(TOT) performance of the three treat- 
ment groups is presented in Fig. 2. 
On the first day of practice, both MG 
and OFTAF surpass the reference 
group by about 12%. During the 
training trials on the second day, 
OFTAF exceeds both of the other 
groups by about 12%, and MG and 
NOTAF perform at about the same 
level. During transfer, when all groups 
practiced the reference task, Group 
OFTAF, though showing a downward 
trend, is superior to NOTAF, and MG 
has experienced a drop of about 1296 
from its pretransfer level. 

A somewhat different picture is pre- 
sented in Fig. 3, where mean numbers 
of target contacts (hits) per three-trial 
block are compared. Hits of Group 
MG Ss are lowest during training and 
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Fic. 2. Comparison of time on target 
on three RP displays. 
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Fic. 3. Comparison of target contacts 
on three RP displays. 
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during the second session the hits of 
Ss practicing MG and OFTAF are 
equivalent and lower than those of 
NOTAF. On transfer, Ss in Group 
MG experience a 4596 increase in hits 
whereas the remaining two groups 
maintain their performance levels ex- 
cept for exhibiting a slight rising trend 
probably associated with fatigue. 

It is noted that during training, MG 
Ss achieve an early superiority to 


TOT (viz, Fig. 2). It was desired 
to examine the relative development of 
performance of the three groups. 
Trend analyses of variance of the TOT 
and hits data, using orthogonal poly- 
nominals were conducted for the 11 
three-trial training blocks. The results 
of these analyses are presented in Table 
1. 

Overall significant differences in 
mean TOT between NOTAF, OFTAF 
and MG were found, the means being 
34.71, 40.30, and 39.80 sec., respec- 
tively, and the critical difference at 
p< .05— 4.76 sec. (Dunnett), On 
the hits measure, NOTAF accumulated 
the greatest mean hits (67.9/min), 
OFTAF was next lowest (63.2) and 
MG least (53.2). Only the overall dif- 
ferences between NOTAF and MG 
were significant (critical difference, 7.6 
hits). The presence of overall signif- 
icant interactions, and significant inter- 
actions in the trend components led to 
closer examination of the data. With 
respect to TOT, it can be seen that it 
is Group MG (viz., Fig. 2) which ex- 


NOTAF Ss. However, OFTAF Ss ( : 7 
also achieve an initial superiority in hibits the major difference in trend 
TABLE 1 
REPEATED-MEASURES ANALYSES OF VARIANCE AND TESTS FOR TRENDS DURING 
ELEVEN THREE-TRIAL TRAINING BLOCKS 
TOT HITS 
Source 
df MS F af MS F 
close Hai 
Displ 2 1,615 3.84* 2 9,330 143** 
Error (a) 42 421 42 1,256 
Trials 10 3,257 171.53** 10 2,844 28.16 
Displays X Trials 20 130 6.84** 10 384 3.80 
Error (b) 420 19 420 101 
creer 1 29,837 1,570.37** 1 12,455 123.3277 
Displays X Trials 2 1,202 63.26** 2 1,502 14.87: 
o pi 1 | 1506 19.26** 1 | 12,074 | 119.54** 
Displays X Trials 2 14 x 2 2416 | 20.95 
Nr eis 1 1,119 58.80** 1 3,709 36.72** 
Displays X Trials 2 39 -— 2 198 — 


* p c. 05. 
** p <.01. 
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Fic. 4. Comparison of mean-time-per-target 
contact scores on three RP displays. 


in relation to the other two groups. 
The OFTAF and NOTAF groups 
seem parallel throughout training, al- 
though at the end of training, OFTAF 
diverges somewhat from NOTAF. In 
examining the hits measure (Fig. 3), 
it can be seen that the trend for MG 
is fairly linear and downward, but that 
OFTAF first incurs more hits than 
NOTAF, and later crosses over and 
achieves significantly fewer hits. Both 
TOT and hits must be taken into ac- 


8 During Trial Block 1, OFTAF has a 
mean TOT of 20.33 sec/min and NOTAF, 
15,33 sec/min. During Trial Block 11, 
OFTAF has 5230 sec/min and NOTAF, 
47.60. This amounts to a fairly constant ad- 
vantage in TOT from the outset for Group 
OFTAF. However, OFTAF acquires 62.9 
hits/min and NOTAF, 56.6/min in Trial 
Block 1. In Trial Block 11, OFTAF hits 
have reduced to 46.6/min and NOTAF to 
55.1 hits/min. This represents a crossover 
in hits, indicating a change in relative skill. 
"The combination of both measures to indicate 
mean time per hit reveals that during Trial 
Block 1, NOTAF had a mean time of 29 
and OFTAF a mean time of .30 sec/hit and 
were thus about equal in skill. At Trial 
Block 11, NOTAF had .86 and OFTAF 
1.12 sec/hit, representing a gain of .26 sec/hit 
in favor of OFTAF, (the equivalent of + 
of a target revolution) which indicates that 
Ss in Group OFTAF had gained in ability 
to stay on target for 5.7 in. per target rev- 
olution more than Ss in Group NOTAF. 
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count in order to obtain an appropriate 
understanding of the factors that con- 
tribute to performance. This is illus- 
trated in Fig. 4, where TOT has been 
divided by total hits during each trial 
block, and the derived meantime per 
hit curve has been plotted. Taking 
these data into account, it can be seen 
that MG achieves early superiority in 
mean target contact time, whereas 
OFTAF gradually achieves superiority 
to NOTAF over a number of trials. 
After the seventh trial block, OFTAF 
diverges rapidly from NOTAF and 
achieves greatest mean time per target 
contact. Thus viewed, the trends ob- 
served on the individual measures take 
on added meaning, for the number of 
hits required to achieve a given level 
of TOT has an important bearing on 
actual skill. 

Table 2 presents a comparison of the 
mean performance per three-trial block 
during the last nine trials of training 
and the nine transfer trials. Differ- 
ences among the groups were tested for 
significance by means of an analysis of 
variance during the training and sep- 
arately during the transfer periods, and 
the display effects were clearly signif- 
icant. Differences between training 
and transfer within each group were 


TABLE 2 


ComPaRISON OF MEAN TOT AND MEAN Hits 
DURING THE Last NINE TRAINING AND 
NINE TRANSFER TRIALS 


Train- Trans- 
ing SD ka SD t 
TOT 
OFTAF 17.065 | +1.10 | 17.214| +1.11 = 
NOTAF | 15.125 | +1.62 | 15.424 | +1.65 — 
MG 14.84 | +2.11 | 13.044 | +2.29 | 5.65* 
Fs 7.61 21.73 
HITS 
OFTAF 16.16 | +3.9 15.9. | 44.3 — 
NOTAF | 19.8. | +3.5 | 19.8. | +3.5 — 
G 15.4, | 23.6 | 2322, | +4.1 7.65* 
Fe 6.30 13,13 


Note.—Individual means marked by common sub- 
scripts differ significantly at p < .05 (2-tailed E when 
tested by Dunnett's ¢ statistic (Winer, 1962, p. 89). 

eae significant below the .01 level, df = 2, 42. 

? 


GUIDANCE AND AUGMENTED FEEDBACK IN ROTARY PURSUIT 29 


TABLE 3 


COMPARISON or MEAN Time (SEC.) PER 
TARGET CONTACT DURING TRAINING 
AND TRANSFER 


; BU pur 
Group Training Block Transter Block 
OFTAF 1.06 1.08 
NOTAF 0.76 0.78 
MG 0.96 0.56, 


tested by t tests. Only the perform- 
ance of Group MG differed before 
and after transfer on both measures, 
differences between the other groups' 
pre- and posttransfer scores were not 
significant. At the end of training, 
OFTAF is superior to NOTAF, but 
NOTAF and MG do not differ in 
TOT. During transfer NOTAF is 
significantly poorer than OFTAF, but 
superior to MG. With respect to hits, 
NOTAF is inferior to OFTAF (more 
hits) during training and transfer, and 
inferior to MG during training but 
superior (fewer hits) during the trans- 
fer period, in the sense that greater 
number of hits imply a lower order of 
skill. Taking into account both TOT 
and hits, Table 3 presents the mean 
time per each target contact during the 
three three-trial blocks before and after 
transfer. These data illustrate that 
both extra-cue groups achieved mean 
times per hit that were almost identi- 
cal during training and clearly superior 
to the reference group. Removal of the 
extra cues resulted in a marked 
drop in mean time per hit for Ss in 
Group MG (a drop of about 50%), 
whereas Group OFTAF maintained its 
pretransfer superiority to NOTAF. 


Discussion 


The findings of the present experiment 
confirm the previous findings of Gordon 
and Gottlieb (1967) to the effect that a 
discrete off-target signal presented when 
S was in error improved performance 


during training, and left a superior resid- 
ual effect on performance after the aiding 
signal was removed. A further step was 
taken in an effort to compare the effects 
of the added error signal with the effects 
of augmenting external guiding cues 
achieved by mechanical-visual response 
guidance (cf. Bilodeau & Bilodeau, 1958). 
During the training period, the perform- 
ance of MG Ss (given mechdnical-visual 
guidance) achieved early superiority, but 
Group OFTAF (augmented feedback) 
Ss overtook and exceeded them. By the 
end of training, both augmented groups 
were superior to the unaided reference 
task, as seen in Fig. 4. MG Ss’ TOT 
scores though lower than OFTAF's, were 
bolstered by a low level of hits, which 
allowed them to achieve a level of skill 
superior to NOTAF Ss. When the 
augmenting cues were removed, Ss in 
Group MG experienced a 1296 loss in 
TOT and a 50% rise in hits, despite the 
fact that prior to transfer they were 
performing movements which Ammons, 
Ammons, and Morgan (1958) have 
pointed out were associated with high de- 
grees of skill. The OFTAF Ss’ on the 
other hand, maintained their relative su- 
periority to NOTAF Ss, indicating that 
they acquired responses which MG Ss 
failed to. 

These results require the explanation of 
two kinds of effects: (a) Why does aug- 
mented feedback appear to lead to su- 
perior learning ? (b) Why does mechan- 
ical-visual guidance not lead to greater 
transfer of training after its removal? 

In summary, the advantage of aug- 
mented feedback appears to derive from 
its tendency to enhance S's motivation 
and more importantly in that it directs S’s 
attention to the information which stems 
from matching his responses with already 
available exteroceptive cues (cf. Annett, 
1961; Bilodeau, 1966). The poorer 
transfer results achieved by MG Ss ap- 
pear to stem from the fact that guiding 
of responses may increase the likelihood of 
a more correct response set, but does not 
necessarily foster acquisition of such re- 
sponses. The Ss operating under the 
MG condition made fewer errors than 
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NOTAF Ss, but were undoubtedly pay- 
ing less attention to their responses than 
to the guiding cues. The added cues in 
Task MG were invariant, and as a result 
this may have tended to interfere with 
the development of responses designed to 
reduce S's dependence on the continual 
flow of information from the task dis- 
play. The OFTAF Ss (and NOTAF 
Ss) were provided with guiding cues 
which were alone insufficient to lead di- 
rectly to a high level of performance. As 
a result, Ss in group OFTAF appear to 
have been led to a trial and error learn- 
ing process superior to that of NOTAF 
Ss because of the presence of discrete 
feedback signals which diminished in 
frequency as performance improved. 

The present findings suggest that two 
interrelated strategies are important de- 
terminants of skill in tracking tasks which 
have predictable and/or repetitive re- 
sponse elements. One strategy which ap- 
pears to dominate early responses is 
aimed at the prediction of external events. 
The second strategy appears aimed at the 
chaining of discrete responses to form 
response units of increasing length and 
probably consists of a trial and error 
learning process in which knowledge of 
error plays an important role. In the 
present task, the response chaining 
Strategy occurs because the speed of rota- 
tion of the target prevents full and im- 
mediate utilization of the standard set 
of exteroceptive guiding cues. This is 
evident in the training differences be- 
tween MG and NOTAF. The acquisi- 
tion of longer units of correct response se- 
quences serves the purpose of filling in the 
gaps created by Ss’ inability directly to 
utilize intrinsic task guidance. The end 
results of the utilization of the two strate- 
gies is less frequent scanning of the ex- 
teroceptive cue set for prediction attempts, 
made possible by longer chains of correct 
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responses. In time a certain balance in 
the utilization of the two strategies is 
achieved. 

The extra cues of task MG appear to 
have caused S$ to utilize the initial 
strategy at the relative expense of at- 
tention to response sequence learning. 
On the other hand, augmented feedback 
(OFTAF) appears to have promoted the 
response sequence learning process, prob- 
ably by directing S's attention to his 
errors. It is also tenable that OFTAF, 
by providing a cue-set which diminished 
with improved performance, may have en- 
hanced S’s motivation. Further work is 
required to separate associative from 
motivational factors. 
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COLOR-WORD INTERFERENCE: 


IL. AN INVESTIGATION OF THE ROLE OF VOCAL CONFLICT 
AND HUNGER IN ASSOCIATIVE PRIMING* 
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Associative priming was studied under conditions of color-word con- 


flict and hunger. 
in high- 
Hungry and sated Ss, 
cognitive controls, 
conditions. Priming of food 


were required 
and neutral words was measured by the 


Food-related and neutral words were presented 
and low-conflict modifications of the Stroop color-word task. 
characterized by either constricted or flexible 


to name colors in these modified 


frequency of their occurrence on a subsequent word-association test. 
Results showed that significant priming effects occurred only follow- 


ing performance on 


Directive effects of hunger were also noted 


the high-conflict form of the color-word task. 


only in this condition. 


Furthermore, analysis revealed that individual differences in suscepti- 


bility to color-word 
also contributed to 
results may have been 


conflict and associative style (ie, 
the priming effects. It was suggested that these 
due to arousing properties of conflict which 


commonality) 


caused a heightened potential for discharging the interfering words, 


The color-word task consists of color 
names printed in noncongruous 
colored inks (eg. the word “red” 
printed in the color green). The Ss 
must name the colors, ignoring the 
words in which they are embedded. 
Most Ss find this difficult to do and 
typically performance is marked by 
stumbling, slowness of response, and 
color-naming errors, all characteristic 
of vocal-response conflict. This phe- 
nomenon, first reported by Stroop 
(1935), is also known to occur when 
words other than color names are 
printed in the color-word format and 
has been attributed to semantic char- 
acteristics of the embedded words 
(Klein, 1964). Considering this 
source of color-naming interference, it 
is surprising to find that few of the 
distracting words can be remembered 


1This paper is based upon a PhD dis- 
sertation submitted to the Department of 
Psychology, New York University, 1966. 
The author wishes to express his apprecia- 
tion to George S. Klein for his guidance and 
criticism during all phases of this research. 
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(ie. recalled or recognized) imme- 
diately following task performance 
(Gardiner, Holzman, Klein, Linton, 
& Spence, 1959; Klein, 1954). 
Recently, Grand and Segal (1966) 
explored whether colored words could 
be recovered by a procedure more in- 
direct than recall or recognition. They 
used a word-association technique of 
“priming” (Cramer, 1965; Segal, 
1962; Segal & Cofer, 1960; Storms, 
1958). In this method, associates with 
known frequencies of occurrence to cue 
stimuli (Russel & Jenkins, 1954) are 
presented to Ss in a pretask (priming 
condition). Following this, a word- 
association test is administered which 
includes the cue stimuli for the pre- 
viously primed associates. Priming 
has the effect of increasing the fre- 
quency with which these associates 
are given to their cue stimuli on the 
word-association test. This increased 
frequency provides an indirect measure 
of the effect of the word’s prior ex- 
posure and may be a more sensitive 
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measure of this effect than is recall or 
recognition. 

Grand and Segal (1966) showed 
that associates exposed via a color- 
word conflict task were primed signif- 
icantly better than when they were ex- 
posed in a task which stimulated rela- 
tively little conflict, that is, one in 
which Ss were instructed to merely 
cross out all vowels in the exposed 
associates. As expected, recall of the 
colored words was very poor. How- 
ever, recall following the vowel-cross- 
ing task was significantly better. It 
was suggested that these effects were 
due to vocal conflict between the word- 
naming and color-naming response. 

. That is, the conflicting nature of the 
task resulted in subthreshold activation 
of the words, insufficient to facilitate 
their recall, but sufficiently potent to 
indirectly facilitate their emergence on 
the word-association test. 

The present study was designed, in 
part, to explore this assumption more 
systematically. If the color-word con- 
flict facilitates priming, then varying 
the degree of conflict should alter the 
strength of the priming effect. To 
achieve this purpose, the color-word 
task was modified to vary its conflict- 
inducing potency. 

A second goal of this study was to 
explore the effect of hunger on color- 
word priming. Previous studies have 
shown that hunger increases sensitiv- 
ity to hunger-related stimuli (Atkin- 
son & McClelland, 1948; Chein, Levine, 
& Murphy, 1942; Epstein & Levitt, 
1962; Lazarus, Yousem, & Arenberg, 
1953; Sanford, 1936, 1937; Wispé, 
1954; Wispé & Drambarean, 1953). 
Might hunger selectively sensitize Ss 
to food words in a color-word context? 
Tf so, it would be reasonable to expect 
increased interference with color nam- 
ing from these words. Might it also re- 
sult in heightened food-word priming? 
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Klein (1954) has shown that Ss whose 
color-word performance is especially 
disrupted by the words (i.e., constricted 
Ss) also show the effects of need in 
behavior. Would such individual dif- 
ferences (ie., constricted and flexible 
cognitive controls) be related to the 
priming effect of hunger ? 

In the present study, selected neutral 
and hunger-related associates were ex- 
posed in color-word priming tasks 
modified to vary conflict. Hungry and 
sated Ss, characterized by either con- 
stricted or flexible cognitive controls, 
performed under conditions of high and 
low conflict. Priming of neutral and 
hunger-related associates was subse- 
quently measured on a word-associa- 
tion test. A third group of Ss, shown 
nonsense words in the color-word for- 
mat, acted as a control for the effects 
of color naming on word associations 
and provided base-line data against 
which to evaluate experimental effects. 


METHOD 


Fourteen weak to medium strength asso- 
ciates to Kent-Rosanoff (K-R) cue stimuli 
(from Segals Brooklyn College norms?) 
were selected for priming. Half the words 
formed a food category, and half formed a 
nature category. Table 1 lists the proper- 
ties of these associates. 

The 14 associates were typed in two forms 
of the color-word task: a high-conflict 
Standard color-word sheet, on which the 
words were typed in the colors red, yellow, 
green, and blue; a low-conflict Segregated 
color-word sheet, on which the words were 
typed in black on background patches of 
the same colors. A third form of the color- 
word task consisting of nonsense syllables 
(consonants) printed in the four colors 
(Klein, 1964) was used as a control for 
effects of color naming on word associations. 
The 14 words were each repeated six times 
on the experimental pages and arranged 
randomly. A page of colored asterisks, com- 
parable in color arrangement and number 
of letters was used to measure base-line 
color-coding efficiency. ^ Special scoring 


?Segal S. J. Personal communication, 
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TABLE 1 
KrNT-RosaNorr CUE WORDS, PRIMED ASSOCIATES AND THEIR PROPERTIES 
Cue Word Associate Pru Type Thorndike-Lorge 
ndex 
Food Category 
Fruit Vegetable 6.6 Tertiary A 
Salt Sugar 9.3 Secondary AA 
Cheese Bread 11.3 Primary A 
Sour Grape(s) 15.0 Secondary 34 
Stove Cook 19.0 Primary AA 
Sheep Lamb 19.6 Primary 45 
Stomach Food 20.6 Secondary AA 
Nature Category 
Moon Stars 1.6 Tertiary AA 
Dark Night 10.6 Secondary AA 
Earth Ground 12.0 Primary AA 
Cold Warm 14.0 Secondary AA 
River Stream 16.6 Secondary AA 
Music Sound 18.0 Primary AA 
Deep Shallow 21.3 Primary 27 


sheets enabled E to track S’s performance 
and record errors and reading time for each 
task. Speed of color naming and errors in 
color naming (ie. misidentifications of 
colors, breakthrough of words, false starts, 
stuttering, omissions, articulated pauses), 
corrected for speed and error rates in naming 
colors alone, were used to measure conflict. 

Constricted and flexible cognitive controls. 
—Individual differences in susceptibility to 
color-word interference were determined by 
subtracting S’s base-line color-coding time 
from his color-word page reading time 
(CW-CA). The mean difference score for 
each condition was used to divide Ss into 
high-conflict constricted and low-conflict 
flexible subgroups. 

W ord-association test.—This consisted of 
14 K-R stimulus words for the associates 
which appeared on the color-word task, and 
14 buffer stimuli from the K-R list, each 
of which had a strong primary associate (a 
frequency of 50% or more) in the norms. 
Since previous studies (eg. Segal & Cofer, 
1960) have shown that priming is related to 
the extent to which S’s associations agree 
with the norms (ie, commonality), re- 
sponses to these buffer items provide a con- 
trol for this S' variable. Two arrangements 
of K-R stimuli were used in the association 
test to control for effects of order of presen- 
tation. Position of critical stimuli was, kept 
constant, but stimulus order was varie 

A color word was considered primed only 
when it was given in response to its own 
K-R cue stimulus on the word-association 
test. Instances in which a word from the 


color-word page appeared as an associate 
to some other cue stimulus on the word- 
association test were not considered in this 
report. Two priming scores were assigned 
to each S: one score reflected the number 
of food associates primed; a second score re- 
flected the number of nature associates 
primed. 

Measure of hunger.—Four indexes of hun- 
ger were measured: (a) number of hours 
since last eating (estimated to the nearest 
15 min.); (b) subjective hunger (rated on 
a 7-point scale anchored by the points "not 
hungry at all" and "extremely hungry"); 
(c) S's estimate of the amount of his 
favorite food he would be able to eat at the 
time of testing (rated on a 6-point scale 
anchored by the points “none at all" and 
“as much as I could get"); (d) S's esti- 
mate of time until next expected meal (to 
nearest 15 min). Scores on the four in- 
dexes were combined to form a composite 
hunger index, The Ss were divided at the 
mean of the composite index for the entire 
sample to form hungry and sated subgroups 
within each conflict condition. Hunger rat- 
ings on each of the four indexes were com- 
parable within each condition. 

Subjects and design—One hundred and 
eighty undergraduate students of New York 
University served as Ss. They were both 
male and female, between the ages of 16 
and 38, The Ss were assigned at random 
to the Standard, Segregated, and Control 


3I am indebted to Joseph DeRivera for 
this suggestion. 
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conditions, and tested individually. After 
testing was completed the 60 Ss in each con- 
dition were assigned to hungry and sated 
subgroups, on the basis of their scores on 
the composite hunger index, and constricted 
and flexible subgroups, on the basis of their 
color-word interference scores. Since this 
procedure resulted in subgroups of unequal 
size, an unweighted means analysis of vari- 
ance for repeated measures (Winer, 1962, 
p. 337) was required to study the effects 
of conflict conditions, hunger, and cognitive 
controls on the priming of food and neutral 
associates. 

Procedure,—The Ss were asked to com- 
plete the hunger rating scales immediately 
prior to testing. Following this they were 
handed a 9X ll-in. card on which the 84 
color words, or color-nonsense word com- 
binations, were printed and told to name 
the colors quickly and accurately and to 
ignore the words. Reading speed was timed 
by a stopwatch visible to S. Errors were 
recorded on the specially prepared scoring 
sheet. Following this task, Ss were given 
an oral word-association test containing the 
14 cue stimuli known to elicit the items 
which appeared on the conflict page, and 
the 14 buffer cue stimuli. At the conclusion 
of this test, Ss were asked if they could re- 
call any of the words which appeared on 
the color-word page. The Ss were en- 
couraged to guess. Following the recall 
period the page of colored asterisks was 
shown to Ss and they were asked to name 
colors on this page quickly and accurately. 
Time and errors were recorded. Finally, 
Ss were asked to name the words on the 
experimental page as quickly and accurately 
as they could. (This procedure was omitted 
for control Ss.) ^ Time and errors were 
again recorded. At the conclusion of the 
experiment Ss were informally interviewed 
about their experience and the nature of the 
experiment was explained to them. 


RESULTS 


Analysis of conflict.—In order to 
determine whether the Standard and 
Segregated conditions differed in their 
conflict-inducing potency, an analysis 
of covariance was applied to the color- 
word page reading-time scores. This 
analysis controls for differences be- 
tween conditions in base-line speed of 
color coding. A test of the within- 
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TABLE 2 


ADJUSTED MEAN READING TIME FOR 
STANDARD, SEGREGATED, AND 
CONTROL CONDITIONS 


Cond. 
Standard | Segregated Control 
M 72.06* 61.49 60.79 
SD 15.70 8.82 6.96 
* p = .005. 


class regression indicated that the indi- 
vidual regression lines were homo- 
geneous, F (2, 175) = 2.65, p> .10, 
permitting the use of a pooled regres- 
sion line for this analysis. Table 2 
shows the adjusted mean reading times 
for the three conditions. As expected, 
the difference between conditions in 
speed of color naming is highly signif- 
icant, F (2, 175) = 27.11, p< .005. 
Differences among the means of the 
conditions were assessed by Duncan’s 
new multiple-range test (Edwards, 
1960). Comparisons reveal that read- 
ing time in the Standard cendition is 
longer than in the Segregated condi- 
tion and the Control condition (p = 
001). However, the Segregated con- 
dition is not significantly different from 
the Control condition.* 

"The presence of the word retards the 
speed of color naming only in the 
Standard condition where the physical 
Structure of the word also carries the 
color. The Segregated condition pro- 
duces little more difficulty than non- 
sense words embedded in the colors. 
Thus, the data confirm the expectation 
that the Standard condition is more 
conflict inducing than the Segregated 
condition. 

Priming.—Preanalysis of the prim- 
ing data indicated that S's style of re- 
sponding to the association test (i.e., 


* Separate analysis of the error scores re- 
vealed essentially the same results. 
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commonality) was significantly cor- 
related with priming in each condition, 
r (58) 42, p € 0l: in both the 
Standard and Segregated conditions. 
Therefore, the analysis of variance de- 
sign was expanded in order to factor 
out this S variable. A commonality 
score was assigned to each S by total- 
ling his number of primary associates 
to the buffer items on the word-associa- 
tion test. Within each condition Ss 
were divided at the mean commonality 
score to form high- and low-commonal- 
ity subgroups. 

The results of the analysis of the 
priming data reveal that the only main 
effects which reach significance are due 
to the color-word conflict conditions, 
F (2,156) = 8.15, p< .005, and com- 
monality, F (1, 156) = 14.22, p< 
005. The main effects of hunger and 
cognitive controls were not significant. 

Conflict and priming.—Table 3 
shows the mean number of primed as- 
sociates in each condition. The order 
of effect is consistent with the conflict- 
inducing potency of the conditions. 
Duncan's new multiple-range test re- 
veals that the Standard condition yields 
significantly more primed associates on 
the word-association test than the Seg- 
regated condition and the Control con- 
dition (p —.01). While the low-con- 
flit Segregated condition shows à 
trend in the same direction, it does not 
prime words significantly better than 
the Control condition. 

Since conflict is indicated by in- 
creased reading time, it is possible that 
priming merely reflects longer exposure 
to the words in the Standard condition. 
To test the effect of exposure time, cor- 
relations between reading time on the 
color-word pages and the summed 
priming scores for food and nature 
words were computed in the Standard 


5 The conditions did not differ significantly 
on this commonality index, F (2, 176) = 83. 


TABLE 3 


PRIMING FREQUENCY IN THE STANDARD, 
SEGREGATED, AND CONTROL 


CONDITIONS 
Cond. 
Standard | Segregated | Control 
M 3.43* 2.82 2.27 
SD 1.67 1.48 1.57 


! Note—The frequency of occurrence of these key 
associates in the Brooklyn College Norms (N = 300) 
is 2.02. The Control Ss in this study gave these asso- 
ciates we slightly higher frequency. 

p «.0t. 


and Segregated conditions. In neither 
condition was a significant relationship 
observed, r (58) = 04; r (58) = 15, 
respectively. Furthermore, since er- 
rors reflect momentary deflections of 
attention to the interfering words, 
priming might have been due to height- 
ened attention to words in the Standard 
condition. Chi-square analyses of the 
relationship between primed words and 
those words on the color-word pages 
associated with color-naming errors 
also failed to reach significance, 
X: (1) =2.09, p> 10; x? (1)= 
48, p > .30 for the Standard and Seg- 
regated conditions. As an additional 
check on whether the conditions pro- 
vided unequal opportunities for learn- 
ing, recall of the exposed words was 
compared in the Standard and Seg- 
regated conditions. Mean recall in the 
Standard condition amounts to 177 
words (SD — 1.37), and in the Seg- 
regated condition, 1.55 words (SD = 
93). This difference was not signif- 
icant, £ (118) = 1.04, p> 30. 
Commonality and priming.—Since 
the primed words were all primary, sec- 
ondary, or tertiary associates in the 
norms (Segal, 1962) it is not surpris- 
ing that these associates would be more 
readily available in high-commonality 
Ss. The importance of commonality 
for this study lies, however, in the fact 
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that it interacts significantly with hun- 
ger and cognitive controls, F (1, 
156) = 5.03, p < .01. Table 4 shows 
that for hungry Ss this associative 
tendency in interaction with a tendency 
to be constricted in naming colors 
on the conflict page produced the 
maximum priming effect for the total 
sample of Ss. 

Not only does the high-commonality, 
hungry, constricted subgroup surpass 
all low-commonality groups in amount 
of priming, but this group primes sig- 
nificantly more words than its flexible 
counterpart, F (1, 156) — 829, p< 
.005, and approaches significance when 
compared to its sated counterpart, F 
(1, 156) = 3.59, p « .10. 

Hunger and selective  priming.— 
Hunger not only contributes to the 
overall priming effect, it also contrib- 
utes to the selective priming of food 
words. Table 5 shows a significant 
interaction of Conditions x Hunger x 
Stimuli, F (2, 155) = 5.58, p < .005. 

Hunger produces a significant posi- 
tive trend for food-word priming in the 
Standard condition, r (58) = 27, p < 
.05. Both the Segregated and Control 
conditions show nonsignificant trends 
in the opposite direction. The only 
comparisons of interest which reach 


TABLE 4 


PriMING FREQUENCY AS A FUNCTION OF 
Commonatity, HUNGER, AND 
COGNITIVE CONTROL 


High 
Commonality 


Low 
Commonality 


Cond. 
M SD | N M|SD|N 
Hungry 
Constricted | 4.18, | 2.27| 17 | 1.93| 1.14| 14 
Flexible 2.86, | 1.22| 22 | 2.59| 1.40| 22 
Sated 
Constricted | 3.22, | 1.51| 18 | 2.61| 1.56| 23 
Flexible 3.31 | 1.35| 26 | 2.37| 1.59| 37 


Note.—a vs. c significant at .10 level; a vs. b signifi- 
cant at .005 level, S 


TABLE 5 


FREQUENCY OF Foop AND NATURE Worps 
GIVEN BY HUNGRY AND SATED Ss IN 
EACH CONDITION 


Food Nature 
Cond. 
M SD M SD | N 
Standard 
Hungry | 2.11, | 1.39 | 1.56, | 1.34 | 27 
qoae 1.39, | 1.32 | 1.854 | 1.03 | 33 
regated 
Hungry | 1.26 .84| 1.65 | 1.28 | 23 
Sated 149 | 1.12} 1.27 |1.02|37 
Control 
Hungry .80 .81| 1.28 | 1.27 |25 
Sated 121 |1.02| 1.21 | 1.07 |34 


Note,—d vs. b significant at .10 level; a vs. c sig- 
nificant at .05 level; a vs. b significant at .025 level. 


significance occur in the Standard con- 
dition. Food-word priming, in this 
condition, is significantly greater 
among hungry Ss than sated Ss, F (1, 
155) = 5.35, p < .025. Furthermore, 
hungry Ss prime more food words 
than nature words, F (1, 155) = 4.20, 
p < .05, while sated Ss tend to prime 
more nature words than food words, 
PSL, 155): = 3,36, 5 « 10. 

A significant four-way interaction of 
Conditions x Hunger x Cognitive 
Controls X Stimuli, F (2, 155) = 3.73, 
p < .05, pinpoints the source of these 
hunger effects in the constricted Ss in 
the Standard condition. The results 
of this condition are presented in 
Table 6. 

Note that the significant correlation 
between hunger and food-word priming 
is mainly attributable to differences be- 
tween sated and hungry constricted Ss, 
F (1, 155) = 7.63; p<'.01. Con- 
stricted Ss also mainly account for the 
difference between food and nature 
words noted above; hungry, constricted 
Ss prime more food than nature words, 
F (1, 155) =861, p< .025, while 
sated, constricted S's prime more na- 
ture than food words, F (1, 155).— 
8.53, p< .005. Flexible Ss remain 


COLOR-WORD INTERFERENCE, VOCAL CONFLICT, AND DRIVE 37 


TABLE 6 


FREQUENCY OF FOOD AND Nature Worps 
GIVEN BY HUNGRY AND SaTED, CON- 
STRICTED AND FLEXIBLE SS IN 
THE STANDARD CONDITION 


Food Nature 

Cond. MOTS 
M | sD | M |SD|N 

Hungry 
Constricted | 2.70, | 1.94 | 1.50, | 1.27 | 10 
Flexible 1.76 | .83 |1.59 |141| 17 

Sated 

Constricted | 1.31, | 1.18 | 2.544 | .97 | 13 
Flexible 1.45 | 1.23 | 1.40 | .82| 20 


Note.—a vs. c significant at .025 level; a vs. b 
significant at .01 level; d vs. b significant at .005 level. 


totally unaffected by the priming condi- 
tions. Thus, not only is the overall 
priming effect restricted to the high- 
conflict Standard condition, but the di- 
rective effects of hunger, as well, are 
limited to Ss in this condition who ex- 
perience greatest interference from the 
conflicting words. 


DISCUSSION 


Most striking, in these results, is the 
consistency of effects in the Standard 
condition contrasted with the almost total 
lack of effect in the Segregated condi- 
tion; this, in spite of the similarity of 
stimulus conditions and the identical na- 
ture of the task requirements (ie, to 
name colors and disregard the words). 
Klein (1964) has attributed the words' 
power to produce interference with color 
naming to its capacity to arouse a dis- 
position to say it, that is, to the arousal 
of vocal-motor antagonism. Performance 
of the correct color-naming response 
requires an effort to “hold back” 
the competing word-naming response. 
Clearly, this requirement was more diffi- 
cult to fulfill in the Standard condition. 
It is in this sense that the Standard condi- 
tion was more conflict inducing than the 
Segregated condition. Possibly, priming 
resulted from this increased effort to in- 
hibit vocalization of the interfering words. 

Alternatively, one might consider atten- 


tional factors to underlie the priming ef- 
fect. It will be recalled that the Segre- 
gated condition consisted of words typed 
in black on colored backgrounds. Con- 
fronted with such configurations, Ss may 
have ignored the words by focusing their 
attention on the background patches of 
color. If so, priming may simply have 
been due to Ss’ attention to the words in 
the Standard condition. 

Although statistical tests of the rela- 
tionship between attentional factors (e.g., 
performance time and errors) and prim- 
ing failed to support this assumption, 
these tests do not provide a direct check 
upon Ss’ attention to the words in the 
Segregated condition. However, a more 
direct check of this is provided by the 
fact that recall in the Standard and 
Segregated conditions was equivalent ; 
moreover, the amount of recall in both 
conditions was no different from previ- 
ous studies of color-word recall (Gardner 
et al., 1959; Grand & Segal, 1966; Klein, 
1954). While this finding does not rule 
out the possibility that attentional factors 
played a role in producing the priming 
effect, it does indicate that Ss did not 
completely ignore the words in the Segre- 
gated condition. 

In the absence of adequate control of 
the attention variable, interpretations of 
the present data in terms of conflict are 
speculative. Yet, the possibility of recon- 
ciling the effects of attention with those 
of conflict does warrant some comment. 
Consider, for example, the fact that when 
Ss are confronted with color-word con- 
figurations they experience a strong tend- 
ency to say the word rather than the 
color names. The Ss report that this 
is so for both the Standard and Segre- 
grated conditions. Clearly, looking at 
the word activates the word-naming re- 
sponse. In order to activate the color- 
naming response $s must not only “hold 
back” the word, but must shift their atten- 
tion to the color aspect of the configura- 
tion (Klein, 1964). Although this could 
be done in the Segregated condition, 
thereby minimizing the tendency to name 
the words, the overlap of color and word 
in the Standard condition precludes this 
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possibility. ^ Squinting, abortive break- 
through of the words, increased voice vol- 
ume, and pitch, all attest to the persistent 
pressure to name the words.  Priming 
may be rooted in this persistent "tension 
for discharge" (Lewin, 1935). In this 
view attention per se may serve to trigger 
the word-naming response, but priming 
would be due to tension resulting from 
the effort to inhibit this response. 

Since cognitive controls reflect individ- 
ual differences in the ability to overcome 
word interference, the present interpreta- 
tion is called upon to account for the 
fact that this variable failed to yield a 
significant main effect. An explanation 
for this may lie in Klein's (1964) find- 
ing that color-word (CW) time is ac- 
curately predicted by the combined colors- 
alone (CA) and words-alone (WA) 
times. This indicates that the interfer- 
ence measure (CW-CA) contains effects 
of S differences in word-reading speed 
(ie, the time required to name the 
words) and is not a pure measure of the 
effort to "hold back" the competing 
words. Indeed, when the influence of 
variation due to coding colors (CA) and 
reading words (WA) was held constant, 
color-word time and priming scores were 
found to be significantly correlated in the 
Standard condition, partial r (58) = 28, 
P «.03. This finding supports the pres- 
ent view and furthermore suggests that 
the interference measure used here (CW- 
CA) may not have been sensitive enough 
to show these effects. 

Considering this possibility, the inter- 
action of cognitive controls with hunger 
and commonality may have been due to 
an intensification of the effects of interfer- 
ence under the influence of heightened 


ê It should be noted in this regard that 
most color-naming errors reflect a momen- 
tary breakdown of inhibitory control over 
the word-naming response. In this sense, 
they may be considered to be partial release 
phenomena. If attention produced the prim- 
ing effect it might have been expected that 
errors and priming would have been related. 
The fact that they were not creates some 
difficulty for an attentional interpretation of 
these data. 
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drive. Cognitive controls (Broverman, 
1960; Gardner et al., 1959; Klein, 1954) 
are assumed to reflect relatively enduring 
response tendencies. Brown (1953) has 
suggested that drive energizes dominant 
habit tendencies. Since constricted Ss 
are those who are particularly coerced 
by the interfering words (Klein, 1954) 
intensification of this style under height- 
ened drive could have led to increased 
priming, especially among Ss who were 
already prone to the priming effect (i.e, 
high-commonality Ss). 

Similarly, the directive effects of 
hunger may be understood in terms of 
a heightened tendency for hungry con- 
stricted Ss to respond to food words on 
the color-word page. It is interesting to 
note in this regard that when separate 
correlations were computed for the indi- 
vidual scales comprising the hunger in- 
dex, only Time Until Next Meal and the 
rated Amount of Food that Could be 
Eaten at the Time of Testing were sig- 
nificantly correlated with food-word prim- 
ing, r (58) —.32, p <.02; r (58) — 24, 
P X.06. This suggests that a commit- 
ment to eat may be associated with an 
increased tendency to respond both to food 
and its verbal surrogates. Considering 
the fact that Time Since Last Eating and 
Subjective Hunger were not related to 
food-word priming suggests that readi- 
ness to respond to food or its verbal sur- 
Togates may be a more potent index of 
hunger in laboratory studies than either 
deprivation or subjective feelings. A 
similar point has been made by Saug- 
stad (1966) and is consistent with studies 
of hunger under conditions of cognitive 
dissonance (Brehm, 1962; Brehm, Back, 
& Bogdonoff, 1964). 

The foregoing interpretations of the 
priming effect are tentative and require 
further study. One particularly in- 
triguing direction for future research is 
the possibility of linking color-word prim- 
ing to measures of physiological arousal. 
Such a possibility is suggested by re- 
cent work relating conflict to arousal 
(Berlyne, 1960; Johnson, 1963). If 
color-word performance is accompanied 
by arousal the priming effects obtained 
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here could be consistent with work by 
Lanier (1941a, b) and Kleinsmith and 
Kaplan (1963, 1964) showing remi- 
niscence for verbal material associated 
with measures of physiological arousal. 
Kleinsmith and Kaplan (1963) attribute 
their results to the perseverative consoli- 
dation of reverberating neural traces. If 
color-word performance also results in 
heightened arousal, traces established 
during this time may remain sufficiently 
activated to emerge subsequently in 
response to association-test ^ cues. 
Furthermore, the paradoxically poor re- 
call following color-word performance 
could be attributed to the fact that rever- 
berating traces would be relatively un- 
available for direct refiring during con- 
solidation (Kleinsmith & Kaplan, 1963). 
Future studies measuring physiological 
arousal during color-word performance 
are required to establish these specula- 
tions more firmly. 
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The hypothesis was advanced 


that prompt CS termination on CR 


trials reinfórces avoidance acquisition because it serves as an excellent 
discriminative cue for US avoidance rather than because it leads to 


fear reduction. It was shown 


termination could serve 


I that a cue other than prompt CS 
this function both in delayed (Experiment I) 


and in trace (Experiment II) avoidance conditioning paradigms. The 


important role played by anticipatory 


responses—CRs which are 


independent of the avoidance contingency—in avoidance acquisition 


was pointed out. 


The most widely held view of avoid- 
ance conditioning is the two-factor or 
two-process theory, as originally for- 
mulated by Mowrer and Lamoreaux 
(1942) and later elaborated by Solo- 
mon and his associates (Solomon & 
Wynne, 1954; Turner & Solomon, 
1962). The two factors or processes 
referred to are the classical condition- 
ing of an emotional (fear) response, 
upon which hinges the instrumental 
conditioning of the avoidance response 
itself. One of the major accomplish- 
ments of this theory was that it circum- 
vented the necessity of accepting avoid- 
ance of the unconditioned stimulus (US 
avoidance) as the reinforcing event on 
an avoidance trial; fear reduction, in- 
stigated by termination of the CS, was 
made to carry the burden of reinforce- 
ment. As Mowrer (1951) put it, pt 
response which is instrumental in avert- 
ing an impending noxious event gets 
reinforced, not by the nonoccurrence 
of that event, but by reduction in the 
intensity of the fear which foreshadows 
it [pp. 350-351].” Teken literally 
two-factor theory reduces avoidance 
learning to escape conditioning, i.e. 
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escape from the fear generated by the 
CS or related stimuli. 

This is not the place to marshall the 
considerable evidence which runs 
counter to escape interpretations of 
avoidance conditioning. Suffice it to 
say that the experimental evidence 
points increasingly to a lack of con- 
gruence between fear processes and 
avoidance conditioning (€-g., Kamin, 
Brimer, & Black, 1963), and to the 
fact that US avoidance can of itself 
serve as a reinforcing event on avoid- 
ance trials (e.g, Bolles, Stokes, & 
Younger, 1966; Taub, Bacon, & Ber- 
man, 1965). 

The present experiments are ad- 
dressed to the second point. One of 
the important sources of support for 
two-process theory is the undeniable 
fact that, in animals at least, avoidance 
conditioning is facilitated, often mark- 
edly, by allowing the instrumental re- 
sponse to terminate promptly the CS 
on avoidance trials. Perhaps, however, 
CS termination (on avoidance trials) 
facilitates avoidance conditioning not 
because it results in fear reduction but 
rather because it serves as an excellent 
discriminative cue for US avoidance. 
Put somewhat differently, prompt Cs 
termination on avoidance trials pro- 
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vides S with a highly discriminable cue 
for the contingency operating between 
the instrumental response and US 
avoidance. Our experiments will at- 
tempt to show that a cue other than CS 
termination can be made to serve this 
function, a cue furthermore that is 
never paired with shock onset or offset. 

Our second purpose is to call atten- 
tion to a distinction between anticipa- 
tory and avoidance responses. Under 
favorable experimental conditions a fair 
percentage of avoidance responses may 
be maintained even though the avoid- 
ance contingency has been specifically 
excluded, as in the classical aversive 
conditioning paradigm. Such responses 
are perhaps better called anticipatory 
rather than avoidance, responses. En- 
larging on this distinction we shall use 
the term conditioned response (CR) 
to refer to any instrumental response 
which occurs prior to the termination 
of the established CS—US interval; 
an avoidance response is a CR per- 
formed while the avoidance contingency 
is actually in effect; and finally, by an 
anticipatory response the authors shall 
usually mean a CR which occurs in 
the absence of the avoidance con- 
tingency. 


EXPERIMENT I 


The design of this experiment is 
similar to that of Kamin (1956), which 
was the first attempt to evaluate the 
Separate contributions of CS termina- 
tion (on avoidance trials) and US 
avoidance to the acquisition of an 
avoidance response. 


Method 


Subjects and apparatus—The 76 Ss that 
completed the experiment were female al- 
bino rats weighing 223-285 gm., purchased 
from Blue Spruce Farms, Altamont, N. Y. 
Eleven Ss were lost because of programming 
errors and apparatus failures. 

The apparatus consisted of two commercial 
rat chambers, Grason-Stadler E3125A-300 
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£ 
converted to single-bar boxes by removal of 
the right bar. Programming of trials and — 


data recording were automatic. 
Main experiment.—In Groups 1, 2, 3N, and 
3L escape responses produced prompt CS 


termination; CRs resulted in US avoidance, — 
but the discriminability of the latter event - 
was manipulated. Group 1 was a standard | 


discriminated avoidance group which re- 
ceived CS termination as well as US avoid- 


ance with each CR. Group 2 differed from ` 
Group 1 only in that prompt CS termina- - 


tion did not accompany avoidance responses; 
rather, the CS persisted for 8 sec. beyond 
the execution of the CR. Groups 3N and 3L 
were treated in the same manner as Group 2 
with the important difference that each 
avoidance response produced a distinctive 
avoidance cue which remained in effect for as 
long as the CS, ie, for 8 sec. In the case 


of Group 3N the CS was a white noise and ` 


the avoidance cue was provided by the illum- 
ination of three pilot lights; the white noise 


and visual cue exchanged roles for Group 


SL. It is important to observe that since 
the avoidance cue occurred only on avoidance 
trials it was never associated with either 
shock onset or shock offset. 

In Groups 4 and 5 avoidance was not pos- 
sible. In Group 4, prompt CS termination 
Occurred on escape trials but when a CR 
(ie, an anticipatory response) was made 
nothing at all happened. The US came on 
at its appointed time and-S had to respond 
again in order to escape the shock, at the 
same time terminating the CS. Group 5, 
on the other hand, was never allowed prompt 
CS termination. On escape trials the CS 
persisted for 8 sec. beyond the escape re- 
sponse, On anticipatory responses (CRs) 
the US was applied at the end of the CS— 
US interval and S had to respond again in 
order to terminate shock, the CS persisting 
beyond the latter response for 8 sec, Con- 


sequently, Ss of Group 5 never experienced ` 


US avoidance or prompt CS termination. 
There were 14 Ss in all groups but 3N and 
3L, which contained 10 Ss each. 

Control experiment —After completion of 
the main experiment two groups of Ss were 
run under identical experimental conditions 
except for the source of the Ss, who came 
from litters bred in our laboratory. One 
group of 8 Ss (Group 5R) was a replication 
of Group 5 above. A second group of 8 Ss 
(Group 5S) differed only in that on each 
trial that S made an anticipatory response, 
the CR, although it did not lead to US 
avoidance, resulted in the illumination of 
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three pilot lights for 8 sec. Thus the se- 
quence of events following a CR was presen- 
tation of the visual stimulus, onset of the 
US, execution of the escape response, ter- 
mination of the visual cue, termination of 
the CS. These groups were rum in an 
attempt to evaluate the effect on anticipatory 
responding of the avoidance cue provided 
Groups 3N and 3L. 

Training procedure.—T! he relevant training 
parameters were as follows. The CS— 
US interval was 5 sec. The CS for all 
groups but 3L was a white noise of approxi- 
mately 80 db. (cf. D'Amato, Keller, & 
DiCara, 1964). In Group 3L the CS was 
comprised by the illumination of three white- 
jeweled pilot lights on the front wall of the 
chamber (cf. D'Amato et al, 1964). In 
Group 3N this stimulus served as the avoid- 
ance cue, while the white noise served the 
same function for Group 3L. The ITI, 
programmed by a film tape, averaged ap- 
proximately 35 sec. The discontinuous 
shock employed had a 2 sec. on time and 
2.0 sec. off time and was delivered by a 
Grason-Stadler grid scrambling device. 

On Day 1 Ss were given 60 “shaping” 
trials under the experimental conditions ap- 
propriate to their group membership. On 
the first 30 or so of these trials continuous 
shock, set at approximately 25 ma., was em- 
ployed, followed by discontinuous shock of 
approximately the same intensity. On Days 
2 and 3, 300 training trials were adminis- 
tered, shock level being 3 ma. for most Ss 
on Day 2 and .6 ma. for all Ss on Day 3. 


Results 


Main experiment.—Asis the authors' 
custom, the 60 shaping trials of Day 1 
will not enter the analysis; also, since 
intertrial responding is extremely low 
in the experimental situation it has 
never been necessary to adjust CR 
scores to take this factor into account. 
The main results of the experiment are 
depicted in Fig. 1, which presents the 
300 trials of Days 2 and 3 in blocks of 
50 trials. Separate analyses of vari- 
ance based on the percentage of CRs 
achieved over the 300 trials of Day 2 
and Day 3 showed highly 
between-groups effects for both days. 
F (5,70) 2741, P< 001 and F = 
11.54, p < .001, respectively. 


MEAN * CRs 


DAY 3 


Daye f 
BLOCKS OF 50 TRIALS 


Fic, 1. Mean percentage CRs during 


Days 2 and 3 of Exp. I. 


Day 2.—It is apparent that the dis- 
criminative cue provided Groups 3N 
and 3L had little effect on Day 2. 
These groups are indistinguishable 
from Group 2, which did not have the 
benefit of an avoidance cue. Group 1, 
the controls, performed significantly 
better than Group 2 (= 2.11, p< 
05) ; however, the comparison between 
Group 1 and Groups 3N and 3L com- 
bined fell a trifle short of accepted sig- 
nificance levels (t= 192, p~ 06). 
On the other hand it is clear from Fig. 
1 that, even in the absence of prompt 
CS termination, the avoidance contin- 
gency facilitated avoidance acquisition, 
inasmuch as Groups 2, 3N, and 3L all 
performed at a higher level than Group 
4 (22537, P< 01, comparing the 
first three groups combined with the 
last.) 

The next result of note is the sur- 
prisingly high level of avoidance re- 
sponding observed during Day 2 in 
Groups 4 and 5, in which there was 
in fact no avoidance contingency oper- 
ating. Since the bar press response 
has a very low operant level in our 
situation, it is certain that the CRs 
observed in these groups arise from 
specific features of the experimental 
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manipulations. Apparently, prompt 
CS termination following escape re- 
sponses is not a necessary condition, 
inasmuch as a substantial level of antic- 
ipatory responding is evident in 
Group 5. The superiority of Group 4 
over Group 5 may reflect a genuine 
contribution from prompt CS termina- 
tion; however, in the present case the 
difference between the groups is far 
from significant (t = 1.36). 

Day 3.—The marked depression in 
avoidance performance evident during 
the early trials of Day 3 is probably 
due to the increase in shock level plus 
warm-up effects prevalent in situations 
such as the present one. In any event, 
it is perfectly clear that during the 300 
trials of this session the avoidance cue 
takes hold in both Groups 3N and 3L, 
raising their performance to the level 
achieved by Group 1. Performance 
in all three of these groups is statis- 
tically superior to that of Group 2 (p < 
-O1 in all cases). Group 2, in turn, 
is significantly better than Groups 4 
and 5 combined (t= 2.26, p< .05), 
but not Group 4 alone (t = 1.55). 

In spite of the raised shock level, 
Groups 4 and 5 maintained a fair level 
of CRs throughout the 300 trials of 
Day 3, Group 4 once again performing 
a bit better than Group 5. 

Responding during the 8-sec. post- 
CR interval—In our situation a re- 
sponse records upon bar depression, re- 
lease of the bar not being required. 
The question arises as to whether the 
superior performance of Groups 3N 
and 3L over Group 2 could be ascrib- 
able to Ss in the former groups holding 
down on the bar for substantial por- 
tions of the 8-sec. CS interval which 
followed each avoidance response, 
thereby profiting from a foreshortening 
of the interval separating response 
completion and CS termination. 

The authors calculated for each S the 


mean number of seconds of bar holding 
occurring during such post-CR inter- 
vals for all CRs performed during Day 
2 and Day 3. A simple analysis of 
variance based on this measure is sig- 
nificant for both days, F (2, 31) = 
4.04 and 8.89, p < .05 and < .001, re- 
spectively. On both days, however, 
Group 2 bar-holds significantly more 
than Groups 3N and 3L (p < .05 on 
Day 2 and < .01 on Day 3), with the 
difference between the latter two 
groups being far from significant. 

We also determined the mean num- 
ber of responses executed during the 
8-sec. post-CR interval for the 300 
trials of Day 2 and Day 3. Analysis 
of variance of the Day 2 data was sig- 
nificant, F (2, 31) = 4.33, p < .025, 
with Group 3L making significantly 
more responses than either of the other 
two groups, which did not differ sig- 
nificantly. On Day 3 these differences 
vanished (F = 1.51). It is difficult 
to fathom the meaning of the larger 
number of post-CR responses in Group 
SL on Day 2, although it may possibly 
be related to the fact that an auditory 
rather than a visual stimulus served as 
the avoidance cue in this group. In 
any event, it is clear that the superior 
performance of Groups 3N and 3L 
over Group 2 on Day 3 cannot be attrib- 
uted either to a larger amount of post- 
CR responding in the first two groups 
or to greater proclivity toward bar 
holding. 

Control: experiment.—It is conceiv- 
able that the facilitating effect of the 
avoidance cue provided to Groups 3N 
and 3L arises not from its discrimina- 
tive function but rather from some such 
process as stimulus novelty, admittedly 
a difficult hypothesis to evaluate 
cleanly. In any event the conditions 
of Group 5 were replicated in Group 
SR. Group 5S differed from 5R only 
in that each anticipatory response pro- 


AVOIDANCE DISCRIMINATION IN AVOIDANCE LEARNING 45 


duced the visual stimulus (for 8 sec.) 
that served as the avoidance cue in 
Group 3N. We assumed that any re- 
inforcing effect of the response-contin- 
gent stimulus ought to be revealed in 
a greater percentage of CRs in Group 
5S. This turned out not to be the 
case. The percentage of CRs on Day 
2 were 30 and 27 for Groups 5R and 
5S, respectively; on Day 3 the cor- 
responding figures were 34 and 27%. 


EXPERIMENT IT 


Although the results of Exp. I pro- 
vide evidence for the hypothesized dis- 
criminative role of CS termination, 
their generality will be extended by 
employing a trace conditioning para- 
digm, which often results in poorer 
avoidance acquisition than delayed con- 
ditioning. 


Method 


Subjects and apparatus —The Ss, 21 male 
albino rats born of dams purchased from 
Edgewood Breeders of Colonia, N. J., were 
110-150 days of age at the start of the 
study. They had participated in an experi- 
ment in which, as infants, their dams had 
been removed from them a few minutes each 
day; later, at 50 days of age, they served in 
open field tests. Five Ss were lost because 
of illness and technical difficulties, and two 
(one each in Groups C and T) were dis- 
carded because the authors were unable to 
shape them properly. The apparatus was the 
same as employed in Exp. I. y 

Procedure —Except where otherwise indi- 
cated the experimental conditions were as 
those of the previous experiment. The CS 
in all cases was the white noise stimulus em- 
ployed in Exp. I, and for all groups the 


ing. In Groups T and TS a trace condi- 
tioning paradigm was em] loyed. Group TS, 
however, had the advantage of a 5-sec. 
avoidance cue (illumination of the three 
pilot lights) which occurred each time an 
avoidance response was made. Note again 
that this cue was not presented on escape 
trials. During the 60 shaping trials of Day 
1 the duration of the CS in the two trace 


groups was 2 sec.; thereafter it was fixed 
atlsec. There were seven Ss in each group. 

In order to preclude the possibility in 
the trace groups that rapid avoidance re- 
sponses would be reinforced by CS termina- 
tion, responses which occurred during the 
presence of the CS (ie, responses with 
latency 2 sec. or less during shaping and 1 
sec. or less on subsequent trials) were not 
counted as CRs. The trial proceeded as 
though these responses were never made, 
and S had to respond again to effect either 
an avoidance or an escape response. To 
maintain conditions comparable across all 
groups the same arrangement prevailed for 
Group C. 

On Day 2, 500 training trials were given, 
the shock level being set at .3 ma. for the 
first 200 trials and .5 ma. thereafter. On 
Day 3, 200 further trials were given, 


Results 


Figure 2 presents the 700 condition- 
ing trials of Days 2 and 3. Analysis of 
variance of the percentage of avoid- 
ances over the 500 trials of Day 2 
proved significant, F (2, 18) = 5.53, 
p<.025. The difference between 
Groups C and T was highly significant 
(t= 3.29, p < 01), while the compari- 
son between Groups C and TS fell 
somewhat short of accepted significance 
levels (t= 204, p~ 06), Consist- 
ent with the results obtained in Exp. I. 
Groups C and TS converged during 
the 200 trials of Day 3, and both groups 
performed significantly better than 
Group T (p < 01 in both cases). 


MEAN % AVOIDANCES 


3 1 
pave. pay 3 
BLOCKS OF 100 TRIALS 


Fic. 2. Mean percentages of avoidance re 
sponses in the three groups of Exp. II. 
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Discussion 


The results of both experiments support 
the view that the major function of 
prompt CS termination on avoidance 
trials is serving as a discriminative cue 
for US avoidance. The fact that in both 
experiments the controls were superior 
on Day 2 to the grounds provided an 
avoidance cue is understandable when one 
takes into account the circumstance that, 
in the latter groups, the significance of 
the avoidance cue could be learned only 
on CR trials. In the control groups, on 
the other hand, absence of the CS (the 
product on CS termination) is associated 
with absence of the US throughout every 
intertrial interval; hence, CS termination 
should more quickly come to serve as an 
effective cue for US avoidance, 

Inasmuch as the avoidance cue was 
never correlated with shock onset or off- 
Set it seems rather remote to interpret its 
demonstrated reinforcing ability to a pre- 
sumed fear-reducing function, as that 
process has been elaborated within con- 
ventional two-factor theory. Perhaps one 
might enlarge on the recent results of 
Rescorla and LoLordo (1965) and main- 
tain that the avoidance cue had become 
a conditioned inhibitor of fear. The 
authors think this a tenuous reprieve, 
however. For one thing, in their study 
inhibition of fear was largely inferred 
from the avoidance behavior itself. For 
another, assuming that fear reduction had 
been independently verified there is the 
further problem of establishing that it 
alone was responsible for the observed 
changes in avoidance behavior, i.e., that it 
Was not a mere epiphenomenon. F; inally, 
it should be pointed out that it was the 
elicitation of established avoidance be- 
havior that was investigated in the Res- 
corla and LoLordo study, while our con- 
cern was with the reinforcing function of 
avoidance cues. 

Although anticipatory responses were 
early recognized as playing an important 
role in avoidance conditioning (Mowrer 
& Lamoreaux, 1942), they never have 
been adequately stressed, and indeed most 
relevant research has been designed in 


such a fashion that the contribution of 
these responses to avoidance conditioning 
is impossible to assess. Nevertheless, the 
extent and duration of anticipatory re- 
sponding observed in Exp. I suggests that 
it performs a central function in avoid- 
ance conditioning. Possibly many of the 
variables known to influence avoidance 
acquisition do so because of their effect 
on this process. For example, perhaps 
discontinuous shock (D'Amato et al, 
1964), relatively low shock intensities 
(D'Amato & Fazzaro, 1966), and one- 
way conditioning apparatuses (Theios, 
Lynch, & Lowe, 1966) all facilitate avoid- 
ance acquisition because they stimulate 
anticipatory responding. 

Other variables, particularly those as- 
sociated with the physical characteristics 
of the CS, may affect avoidance condition- 
ing through the role that they play in dis- 
criminating US avoidance, Thus, a more 
intense CS may facilitate avoidance ac- 
quisition, not because it stimulates an- 
ticipatory responding, but because its off- 
set serves as a more salient avoidance cue. 
The recent results of Bower, Starr, and 
Lazarovitz (1965), demonstrating a posi- 
tive relation between the amount of re- 
Sponse-produced change in the CS and 
avoidance acquisition, may be so inter- 
pretable. 
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15 Ss received light-on reinforcement and 15 
reinforcement for 25 min. each with yoked con 


was a red light (on for .1 sec. 


was the spontaneous GSR. After 
ly group that increased in responding was 
group. The experimental light-on group” 
ing. There was a significant interaction 
t-off variable and minutes of acquisition, the 
n responsiveness during acquisition while the 
In extinction, the experimental light-on 
the other 3 groups and the 2 light-off 
response persistance. The results were 


period was run. The onl: 
the experimental light-off 
showed the least respondi 
between the light-on/ligh 
light-off Ss increasing i 
light-on Ss went down. 

group responded less than 
groups showed substantial 


or 


Ss received light-off 
trols. Reinforcement 
off for .1 sec.). The response 
training a 15-min. extinction 


interpreted under the assumption that the red light was an aversive 


stimulus. 


The results of recent experiments 
purporting to demonstrate operant 
GSR conditioning (Fowler & Kimmel, 
1961; Kimmel & Kimmel, 1963; Rice, 
1966; Van Twyver & Kimmel, 1966) 
with visual reinforcement raise a num- 
ber of important theoretical issues. In 
these studies human Ss are run in 
total darkness and brief presentations 
of dim light are made contingent upon 
either responding or nonresponding. 
The Ss are told merely to relax and re- 
main alert. 

If it is assumed that the procedure of 
running Ss in the dark is analogous to 
depriving them of food or water, it 
would follow that a drive-reduction 
hypothesis could explain the obtained 
effects. However, it may be just as 


likely that the visual reinforcer has its’ 


effect merely because it interrupts the 
monotony of total darkness. Here the 
assumption is made that variation in 
stimulation is what is needed, rather 
than absolute stimulation of a par- 
ticular quality. 


1 Done under United States Public Health 
Service Grant MH-12262-02. 
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The present study was intended to 
comparatively evaluate the drive-reduc- 
tion and monotony hypotheses by rein- 
forcing some Ss (run in the dark) with 
light onset and others (run in the 
light) with light offset. A drive-reduc- 
tion explanation would require that 
better conditioning be achieved in the 
light-on condition, while the monotony 
explanation predicts no difference be- 
tween the two, since light onset and 
light offset both serve to interrupt the 
monotony of previously unchanging 
stimulation. 


METHOD 


Subjects and instructions —Eighty-two 
undergraduate students at the University of 
Florida served as Ss. The Ss of the same 
sex were run in yoked pairs and the propor- 
tion of males and females was constant among 
conditions. The Ss were otherwise assigned 
randomly to the four treatment groups. 
They were instructed to relax and remain as 
motionless as possible, while avoiding be- 
coming drowsy. 

Apparatus—Data were collected from 
pairs of Ss seated in the inner booths of an 
IAC 1200 series audiometric chamber. The 
GSR was picked up as a dc resistance 
change from the palm and back of S's right 


ry 
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by i-in. zinc electrodes in Lucite cups filled 
with saline electrode paste. The response 
was fed into a Biophysical Instruments Co. 
amplifier and, from this to a Texas Instru- 
ments Co. RectiRiter with a paper speed of 
3 in/min. 

The reinforcing stimulus was a circular 
red light, 1 in. in diameter, produced by a 
multiple stimulus projector) In the light-on 
condition, Ss were run in total darkness ex- 
cept when a reinforcement occurred. In 
the light-off condition, the red light remained 
on except when a reinforcement occurred, 
when it went off. In either type of rein- 
forcement, the duration of stimulus change 
was .1 sec. A 3-sec. time out period auto- 
matically followed each reinforcement, so 
that elicited responses would not be rein- 
forced, 

Procedure.—Each pair of Ss was seated 
in separate experimental booths when they 
arrived at the laboratory. Then, S's right 
hand was cleaned with acetone and the GSR 
electrodes attached. The lights were ex- 
tinguished and the doors closed, leaving 
half of the Ss in total darkness and half 
in the presence of the red light. Instructions 
were read by E over the intercom and ques- 
tions answered. 

A 10-min. period of no stimulus change 
was then run. During this period, the ampli- 
tude of spontaneous GSRs emitted by the 
experimental S was monitored and the 
automatic reinforcement control device was 
programmed so that only responses equal 
to or greater than 1% of the basal resistance 
would be reinforced. Following this habitua- 
tion period, a 25-min. acquisition period was 
run in which both Ss of the pair received 
appropriate reinforcement whenever the ex- 
perimental § emitted criterion responses. 
Following the acquisition period, 15 min. of 
extinction was run. At the conclusion of the 
extinction period, the electrodes were re- 
moved, a brief interview was conducted with 
each S separately, Ss were thanked, asked 


2 The assumption that changing from the 
previously used dim white light to a bright 
red light would be insignificant was, as 1s 
amply shown by the results, inaccurate. The 
change had been made for convenience. 

3 Eight pairs of light-on Ss were rejected, 
two pairs for failure to follow instructions, 
two pairs for equipment failure, two pairs 
due to E errors, and two pairs because one 
member of the pair failed to make any rein- 
forceable GSRs. Three pairs of light-off 
Ss were rejected, one pair for failure of one 
member to make any reinforceable GSRs 


not to discuss the experiment with anyone, 
and dismissed. 


RESULTS 


The basic dependent variable was the 
number of criterion responses per min- 
ute, transformed to VX + 1 and ex- 
pressed as a percentage of the average 
transformed response frequency during 
the last 5 min. of habituation. Figure 
1 presents the averages of these mea- 
sures in 2-min. blocks for the four 
groups during acquisition (omitting the 
first minute). The graph indicates 
that only the experimental light-off 
group showed a systematic tendency 
toward increased responding, while the 
experimental light-on group responded 
lowest of the four. 

Analysis of variance of these data 
revealed that the interaction between 
the light-on/light-off variable and 
blocks of reinforcement minutes was 
significant, F (11, 616) — 3.15 error 
MS = 320.54. This interaction stem- 
med from the fact that the two light-off 
groups combined showed a steady in- 
crease in responding while the two 
light-on groups combined remained es- 
sentially unchanged during acquisition. 


RELATIVE FREQUENCY OF RESPONDING 


Relative frequency (percentage) 


Fie. 1. 
of responding of the four groups during ac- 
quisition, in 2-min. blocks omitting the first 
minute. 


and two pairs because of equipment failure. 
Both Ss of a pair had to be rejected when 
either one was. There were, thus, a total 
of 60 Ss remaining, 15 in each group. 
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Fic. 2. Relative frequency (percentage) 
of responding of the four groups during 
extinction, in 2-min. blocks omitting the last 
minute, 


The extinction data, similarly trans- 
formed and grouped into 2-min. blocks 
(last minute omitted), are shown in 
Fig. 2. The most striking feature of 
these data is the low level of respond- 
ing throughout extinction of the experi- 
mental light-on group. Beyond that, 
the persistence of responding through- 
out extinction in both light-off groups 
is noteworthy. 

Analysis of variance of the extinc- 
tion data revealed no significant effects. 
Duncan's multiple-range test on the 
overall extinction group means, how- 
ever, showed that the experimental 
light-on group responded significantly 
less frequently than each of the other 
three groups. ($ < 01). 


Discussion 


On the basis of the results of several 
earlier studies, it was assumed that the 
experimental light-on group would dem- 
onstrate a higher level of responding than 
its yoked controls. Yet, as was shown in 
Fig. 1 and 2, the two light-on groups 
hardly differed during acquisition and 
they differed Significantly in the reverse 
direction from that expected during 
extinction. 

Among the possible factors that could 
have been responsible for this unexpected 


outcome, none was more likely than the 
fact that a different visual reinforcer 
had been used in the present study. 

It is possible that the red light was an 
aversive stimulus to S. It is apparent 
from the data in Fig. 1 and 2, particularly 
the latter, that the behavior of the experi- 
mental light-on group accords with the 
assumption that the light was aversive, 
Their controls, Occasionally receiving an 
aversive stimulus but unable to influence 
its occurrence, showed no appreciable 
change in rate of response during 
acquisition, 

The assumption that the red light may 
have been an aversive stimulus is in even 
greater accord with the acquisition and 
extinction behavior of the experimental 
light-off group. For these Ss each re- 
sponse emitted during acquisition would 
produce a brief respite from the aversive 
red light. A separate analysis of vari- 
ance of the acquisition data of only the 
two experimental groups showed that 
they diverged significantly across acquisi- 
tion minutes, F (11, 308) = 2.13, error 
MS = 375.20, showing that the light-on 
and light-off contingencies had opposite 
effects. Even the light-off control group 
showed some tendency to increase in re- 
sponding during acquisition, presumably 
because of fortuitous coincidences of re- 
sponses and light-off reinforcements, The 
two light-off groups’ persistent respond- 
ing during extinction, exceeds that of any 
extinction behavior previously observed 
in this laboratory (Kimmel, 1967). Even 
after 15 min. of extinction both groups 
were still responding at a rate higher than 
their initial rate some 40-45 min. earlier. 

The argument that the red light was 
aversive is also supported by noting that 
there was an overall tendency for the 
basal conductances of the Ss to increase 
during the course of the experimental 
procedure, on the average an increase of 
about 5 micromhos having occurred from 
the first minute of habituation to the last 
minute of extinction. This is rather dif- 
ferent from the conductance decreases 
typically observed in previous studies- of 
instrumental GSR conditioning with vis- 
ual reinforcement (eg, Van Twyver & 
Kimmel, 1966). 
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The question may be raised whether 
instrumental autonomic conditioning using 
aversive stimulation has been reported 
elsewhere. Aside from the early failure 
to achieve GSR shock-avoidance by 
Mowrer (1938), the evidence tends to 
support the possibility. Kimmel and 
Baxter (1964), Kimmel, Sternthal, and 
Strub (1966), and Kimmel and Sternthal 
(1967) have all reported evidence of suc- 
cessful shock-avoidance using the GSR. 
Grings (1964) has reported both shock 
avoidance and punishment effects, and 
Senter and Hummel (1965) have shown 
the suppressing effects of punishment 
training on the emitted GSR. The sug- 
gestion that punishment and escape train- 
ing may have operated in this study, thus, 
is in accord with these recent findings. 
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both dimensions, 
steep monotonic 


In previous studies of conflict asso- 
ciated with sport parachuting (Epstein, 
1962; Epstein & Fi enz, 1962; Fenz, 
1964), parachutists were tested with 
word-association tests containing built- 
in dimensions of words varying in rele- 
vance to parachuting. Almost invari- 
ably, the novice parachutists produced 
monotonic gradients of GSR and reac- 
tion time on a control day and steeper 
gradients on the day of a jump. With 
equal consistency; the experienced 
parachutists produced monotonic gradi- 
ents on a control day and inverted V- 
shaped curves on the day of a jump. 
Cross-sectional comparison of para- 
chutists with different levels of experi- 
ence as well as longitudinal investiga- 
tion of individual Ss revealed that the 
peak of the inverted V was displaced 
toward the low relevant end of the di- 
mension with increasing experience and 
mastery. The results could not be ex- 
plained by familiarity with words, as 
the inverted V-shaped curve appeared 
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from the Medical Research Council of 
Canada to Walter D. Fenz; and by Grant 
MH 01293 from the -National Insti 
Mental Health, to Seymour Epstein. 


tute of 


AND 


52 


SEYMOUR EPSTEIN 


University of Massachusetts 


only on the day of a jump, and mis- 
haps were observed to reverse the di- 
rection of development of the curve, 
The phenomenon was explained by 
assuming that with repeated, successful 
experience in coping with a source of 
threat, two developments take place: a 
broadening and steepening of a gradi- 
ent of conditioned anxiety and the de- 
velopment of a gradient of inhibition of 
anxiety, the latter becoming increas- 
ingly steeper than the former. Asa 
result, a highly adaptive warning Sys- 
tem is brought into existence which 
utilizes increasingly early anxiety reac- 
tions at increasingly low intensity 
levels. In the complete absence of 
danger, 
eventually be inhibited out of existence. 
The theory is presented in detail else- 
where (Epstein, in press) and need not 
be reviewed here, other than to note 
that one of its assumptions is that the 
inverted V-shaped curve results from 
inhibition that is cue specific for the 
area of threat. 

In the previous experiments with 
word-association tests, only a dimen- 
sion of Parachuting cues was used. 


The aim of the Present study was to 
“examine novice and experienced para- 


Y 


the anxiety would presumably» | 
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chutists with two dimensions, a para- 
chuting-relevant dimension and an 
emotion-arousing dimension unrelated 
to parachuting, in order to determine 
to what extent the inhibitory reactions 
of the experienced parachutist are cue 
specific and to what extent, general. 


METHOD 


Subjects —The Ss consisted of 16 inex- 
perienced and 16 experienced sport para- 
chutists. The former had made between 
one and five jumps at the time of testing, 
and the latter had made over 100 jumps. 
The Ss were recruited at the Orange Sport 
Parachuting Center at Orange, Massachu- 
setts, and were asked to take part in the 
experiment for the payment of the costs of 
one parachute jump. 

Materials —Two parallel forms of a word- 
association test were constructed, each con- 
taining a dimension of words at four levels 
of relevance to parachuting and a dimension 
of words at four levels of emotion-inducing 
value, but unrelated to parachuting. The 
words were selected from a larger group by 
two judges who rated the paracuting-rele- 
vant words on a 4-point scale for the de- 
gree to which they were relevant to para- 
chuting, and the emotion-inducing words on 
a similar scale for the degree to which they 
were presumed to be emotion-arousing. 
Only words rated the same by both judges 
were retained. Each word list contained three 
words per stimulus category, that is, three 
neutral, three low, three medium, and three 
highly parachute-relevant words, and three 
neutral, three low, three medium, and three 
highly emotional ‘words. The words were 
arranged randomly among 2 number 
buffer words. Five practice words were pre- 
sented at the beginning of each list. Fol- 
lowing are the word lists, with critical words 
capitalized, emotional words followed by the 
letter "e", parachuting words by the letter 
“p”, and increasing levels along the dimension 
by the numbers 14. 

Word List A*: FUN, DARK, WISH, SMOOTH, 


2]t was of critical importance that the 
emotion-arousing dimension unrelated to 
parachuting elicit reactions as great or 
greater tham the parachuting-relevant di- 


mension for experienced parachutists on à 


control day, as otherwise inverted V-shaped 


curves for the parachuting dimension could 
be attributed to an intensity factor, the 


semer HEIGHT (p-3), RIVER (p-1), 
GIRLS (e-2), WRITE (e-1), COVER- 
ALLS (p-2), TABLE, LIGHT, LOVE (e-3), 
WALL (e-1), NIPPLE (e-4) sart, DAY 
(p-1), EMBRACE (e-2), CHICKEN, SKY- 
DIVER (p-4), HUNGRY, TAXI (p-2), 
rapraror FIGHT (e-3), BICYCLE (p-1), 
HAPPY, COCK (e-4), GREEN, ALTITUDE 
(p-3), BEAUTIFUL, KISS (e-3), HEAVY, 
TAKE-OFF (p-2), CHILD, BAIL-OUT 
(p-4), BREAD (e-1), SUCK (e-4), roor- 
BALL, NUMBER, PILOT (p-3), RHINOCEROS, 
TOILET (e-2), ren, JUMP-MASTER 
(p-4). 

Word List B: WALL, RED, HAPPY, DESK, 
rox, AIRPLANE (p-3), UNCLE (p-1), 
ANGRY (e-2), TABLE (e-1), WINGS 
(p-2), RIDE, MEMORY, SHIT (e-3), SOAP 
(e-1), FUCK (e-4), CHAIR, sea (p-1), 
WOMAN (e-2), TIGER, PARATROOPER 
(p-4), ELEPHANT, SKY (p-2), READ, 
DANCE (e-3), PENCIL (p-1), KING, 
MASTURBATE (e-4), BLUE, STREAMER 
(p-3), SCHOOL, MARRY (e-3), CHEESE, 
AIRPORT (p-2), man, JUMP (p-4), 
BUTTER (e-1), BOWEL MOVEMENT 
(e-4), SOFT, TOWN, ALTIMETER (p-3), 
terrer MOTHER (e-2), KING, FREE- 
FALL (p-4). 

After having been screened for clarity 
of pronunciation, words were presented by 
a tape recorder at 30-sec. intervals. The Ss 
were instructed to respond as quickly as 
possible with the first word that occurred 
to them. 

Apparatus.—Skin resistance Was recorded 
by a Grass "Model 5 polygraph. Grass finger 
electrodes were attached to the index and 
middle fingers of the nondominant hand, and 
Bentonite electrode paste was used. Reac- 
tion time was recorded by a penmarker con- 
trolled by E. 

Office space was provided at the Orange 
Municipal Airport, at Orange, Massachusetts. 

Design and procedure.—The choice of an 
experimental design required consideration 
of the effects of (a) sequence, since all Ss 
were tested twice, (b) time of testing, i.e., 
whether the testing was done on the day of 
a jump, or on a control day a week from the 
jump, and (c) differences associated with 
word lists. 


stronger response tendencies being more 
likely to evoke inhibitory reactions than the 
weaker response tendencies. Thus it was 
necessary to select taboo words that could 
be expected to elicit. unusually strong emo- 
tional responses. 
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All effects were counterbalanced in an 
orthogonal factorial design. Eight Ss in 
each group were tested first on the day of a 
jump, and again one week after a jump; 
eight others were tested first on a week 
before a scheduled jump, and again on the 
day of the jump. The Ss agreed to refrain 
from jumping for one week preceding the 
“before jump” testing and for one week fol- 
lowing the “after jump” testing. Testing on 
the day of a jump took place approximately 
one hour before the scheduled jump. Half 
of the Ss in each subgroup were tested with 
List A first and half with List B first. 

Upon entering the experimental room, S 
was seated facing the examiner, a brief ex- 
planation was given him about the word- 
association test, and he was told that the 
purpose of the test was to measure emotional 
reactions to different words. Electrodes were 
then attached to the right hand index and 
middle fingers, After about 10 min., the 
tape recorded list of words was introduced 
with the following comment: 

Now we will begin a test of speed of 
association to words. After you have 
heard a word—and not before you have 
heard the whole word—say the first 
word that occurs to you as quickly as 
possible. If you are not sure of a word, 
respond to what you think it was. Please 
do not comment or ask questions be- 
tween words, but wait for the end of 
the test. Any questions? 


REsutts 


Scores consisted of the mean re- 
sponse for the three words at each of 
the levels in a dimension. Figure 1 
presents the GSR data for experienced 
and inexperienced parachutists for the 
parachuting dimension on the day of a 
jump and on a control day. The re- 
sults support the findings of earlier 
Studies. Inexperienced jumpers pro- 
duce a monotonic gradient of increasing 
GSR as a function of increasing stimu- 
lus relevance. Each of the 16 inex- 
perienced Ss produced a gradient on a 
control day and a steeper gradient on 
the day of a jump. Experienced jump- 
ers produce a monotonic gradient on a 
control day and an inverted V-shaped 
curve on the day of a jump. Fifteen 
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Fic. 1. Conductance change to a parachut- 
ing dimension for experienced and novice 
parachutists. 


of the 16 experienced jumpers pro- 
duced an inverted V-shaped curve on 
the day of a jump. For 9, the largest 
response was to low-relevant words, 
for 6 to medium-relevant words. 

The analysis of variance indicates 
that the interaction between groups, 
time of testing, and stimulus dimension 
is highly significant, F (3, 84) — 11.21, 
$ < 001, substantiating the above ob- 
servations. 

Reaction time produced results simi- 
lar to GSR. Figure 2 shows that in- 
experienced jumpers produced mono- 
tonic gradients along the parachuting 
dimension on both testing occasions, 
and that their gradients on the day of 
a jump were steeper than on a control 
day. Experienced jumpers produced 
a monotonic gradient to the parachut- 
ing dimension on the control day and 
an inverted V-shaped curve on the day 
of a jump. Twelve experienced jump- 
ers produced their longest reaction time 
to low-relevant words, three to me- 
dium-relevant words, and one to high- 
relevant words. The analysis of vari- 
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Fic. 2. Reaction time to a parachuting 
dimension for experienced and novice para- 
chutists. 


ance indicates that the interaction be- 
tween groups, time of testing, and 
stimulus dimension is highly significant, 
F (3, 84) 277, p < 901. 

Figure 3 presents the average GSR 
of experienced parachutists to the non- 
parachuting-relevant stimulus dimen- 
sion on the day of a jump and on the 
day remote from a jump. Both experi- 
enced and inexperienced jumpers pro- 
duce gradients of increasing GSR as a 
function of increasing stimulus rele- 
vance on both days of testing. However, 
while the gradient for novice jumpers 
is steeper on the day of a jump than 
on a control day (as was the case for 
the parachuting dimension), for the ex- 
perienced jumpers the gradient is 
steeper on a control day than on a 
jump day. The analysis of variance 
reveals that the interaction of groups, 
dimension, and day of testing is sig- 
nificant, F (3, 84) = 460, P< 01. 

Figure 4 presents the results for re- 
action time to the nonparachuting-rele- 
vant stimulus dimension. The same 
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trend as for GSR is observed for both 
inexperienced and experienced jump- 
ers. For inexperienced jumpers the 
gradient for reaction time is steeper on 
the day of a jump than on a control 
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day, while for experienced jumpers, the 
reverse holds. While the direction of 
findings is the same as for the GSR, 
the analysis of variance fails to yield 
significance for the triple interaction 
reflecting these trends. 


Discussion 


It may be concluded that the inverted 
V-shaped curve of experienced parachut- 
ists on the day of a jump is specific for 
„a dimension of parachuting-relevant cues. 
It is noteworthy that the experiment suc- 
ceeded in producing a dimension of non- 
parachuting, emotion-arousing cues that 
elicited steeper anxiety gradients from 
experienced parachutists on a control day 
than the parachuting-relevant dimension. 
"Thus, to the extent that strength of initial 
reaction tendency could be a factor in 
evoking inhibition, the nonparachuting 
dimension had an adavantage in eliciting 
inverted V-shaped curves. Yet, not a 
single experienced parachutist produced 
an inverted V-shaped curve to the non- 
parachuting dimension, while 15 out of 
16 experienced parachutists produced such 
curves to the parachuting dimension. 

In addition to evidence for specific in- 
hibitory reactions associated with cues of 
parachuting, the data of the experienced 
parachutists provide evidence for a gen- 
eral inhibitory factor associated with 
anxiety produced by cues unrelated to 
parachuting. Thus, the novices demon- 
strated an increase in steepness of their 
gradients of GSR and reaction time to 
the nonparachuting dimension on the day 
of a jump, while the experienced para- 
chutists demonstrated a reverse effect, 
ie, the steepness of their gradients to the 
nonparachuting dimension declined from 
the control to the jump day. 

The overall findings on the parachut- 
ing dimension, namely that both groups 
produced similar monotonic gradients of 
GSR and reaction time on a control day, 


and that on the day of a jump the novice 
parachutists produced steeper gradients 
and the experienced parachutists inverted 
V-shaped curves, confirm the results of 
previous studies (Epstein, 1962; Epstein, 
in press; Epstein & Fenz, 1962; Fenz, 
1964). The extremely high reliability in 
all studies, with the data generally apply- 
ing to all individuals within groups, as 
well as the generality of the findings for 
different experimental conditons and dif- 
ferent response measures, such as self- 
ratings of fear along a time dimension 
(Epstein & Fenz, 1965), and physiolog- 
ical recordings obtained during ascent in 
the aircraft (Fenz & Epstein, 1967) sug- 
gest the operation of an extremely funda- 
mental mechanism associated with the 
mastery of anxiety and stress. 
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This experiment 


tested the hypothesis 
probabilities comprising the likelihood 
conservatism evidenced in a subjective probability revision task. 


that the magnitudes of the 
ratio influence the degree of 
2 


sets of sampling distributions were constructed so that while the 


the revisions were more conservative if the component 


were small than if 


probabilities 


constituting the ratios 
other. Then 2 groups of Ss 
sets of distributions. The 
for a given ratio 
probabilities 


of the conservatism reported in previous experiments. 


When people revise their subjective 
probabilities in light of relevant data, 
the amount of revision is less than the 
amount prescribed by the normative re- 
vision model, Bayes’ Theorem (e.g. 
Edwards, Lindman, & Phillips, 1965 ; 
Peterson, Schneider, & Miller, 1965; 
Phillips & Edwards, 1966; Phillips, 
Hays, & Edwards, 1966; Vlek, 1965). 
The orderliness of the experimental re- 
sults makes it unlikely that the sub- 
optimality, called conservatism, is sim- 
ply due to confusion resulting from an 
unfamiliar task. And, while the degree 
of conservatism responds to variations 
in the experiments, such as sample size . 
(Peterson, Schneider, & Miller, 1965) 
and payoffs (Phillips & Edwards, 
1966), its causes appear to lie deeper 
than mere procedural variables. Moti- 
vated by the belief that optimal deci- 
sions require accurate subjective prob- 
abilities, a good deal of research has 
been devoted to trying to uncover these 
deeper causes. 

Two somewhat conflicting hypoth- 
eses have been advanced to account 
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for conservatism. The first (Phillips, 
1966) is that Ss correctly apprehend 
the probabilistic aspects of the experi- 
ment, but they are unable to perform 
the required subjective mathematical 
manipulations, ie. Bayes’ theorem, in 
order to revise their subjective prob- 
abilities optimally. The orderliness 
of the data rules out the possibility that 
the manipulative errors are random, 
but within this limit the hypothesis has 
merit. 

The second hypothesis is just the 
opposite of the first; that Ss are able 
to perform the appropriate subjective 
manipulations but they fail to appre- 
ciate correctly the probabilistic aspects 
of the experiment (Beach, 1966 ; Peter- 
son, Ulehla, Miller, Bourne, & Stilson, 
1965). This hypothesis is supported, 
e.g., by the finding that substitution of 
Ss’ erroneous estimates of sampling 
distributions into Bayes' theorem per- 
mits prediction of subsequent errone- 
ous (conservative) subjective probabil- 
ity revisions ( Peterson, DuCharme, & 
Edwards, 1967). Moreover, appropri- 
ate training about the kinds of data to 
be expected under various experimental 
conditions leads to increased accuracy 
of the estimated sampling distributions 
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and corresponding decreased conserv- 
atism in subjective probability revisions 
(Wheeler & Beach, 1967). 

Both of these hypotheses probably 
account in part for conservatism. 
However, they are not necessarily the 
only possibilities. Vlek (1965) has 
proposed a third hypothesis which, if 
true, would have important theoretical 
and methodological implications for the 
use of Bayes' theorem as a psycholog- 
ical model. The hypothesis is that Ss 
violate the likelihood principle. 

To understand Vlek's point, consider 
Equation 1, which is Bayes' theorem in 
ratio form. 


P(Hi[D) |P(D|H:) P(H)) 


P(E,|D) ~ PDA.) PH, M 


The optimal revision of subjective 
probabilities should proceed as fol- 
lows: The S begins with prior prob- 
abilities P(H,) and P(H,) about 
which of two hypotheses (H, and H,) 
is true. Their ratio is indicative of 
which he favors and how much. Then 
he observes a datum D and, consulting 
his subjective sampling distributions 
for the two hypotheses, forms the likeli- 


P(D|H) 
P(D|[H3) 
of the likelihood ratio indicates the 


"impact" the datum has on S’s prior 
probabilities and dictates the ratio of 


of his posterior probabilities Pane 


hood ratio The magnitude 


That the impact of the datum is de- 
pendent only upon the magnitude of the 
likelihood ratio, not upon the magni- 
tudes of its component probabilities, is 
called the likelihood principle. Vlek’s 
(1965) suggestion is that for Ss, equal 
likelihood ratios may not always have 
the same impact. When a datum has 
a low probability for both hypotheses, 
the probabilities that comprise the 


likelihood ratio will be small. If Ss 
regard these small probabilities as rela- 
tively uninformative, they may be dis- 
inclined to rely upon the ratio in order 
to modify their subjective prior prob- 
abilities. 

It is not difficult to see the implica- 
tions of Vlek's (1965) hypothesis. 
Bayesian theoretical treatment of hu- 
man information processing would have 
to take Ss' violation of the likelihood 
principle into account, a tedious if not 
impossible requirement. There are 
also methodological implications. In 
the past the practice was to assume 
that the likelihood principle held for 
Ss' revisions, and then to group the 
experimental data in such a way that 
all responses were pooled and averaged 
for objectively equal likelihood ratios 
irrespective of the magnitudes of the 
probabilities that comprised the ratios. 
Thus, Ss’ revisions for a likelihood 
ratio of 


P(D[H) _ .02 


POIH) ^ 04 7 90 


were pooled with revisions for ratios 
of 


dà = .50 and 2$ = 50, 
Formally, these ratios should all lead 
to the same amount of revision. Sub- 
jectively, the first ratio may have less 
impact than the second, and the second 
may have less impact than the third. 
The purpose of this experiment was 
to test Vlek’s hypothesis. 


METHOD 


For each of two groups of Ss there were 
two decks of 100 3X 5 in. index cards. On 
each card was written a letter from A 
through F; in one deck the letters were writ- 
ten in red ink (the red deck) and in the other 
they were in green ink (the green deck). The 
relative frequencies of the letters differed 
in each of the decks; the distributions are 
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TABLE 1 
Deck COMPOSITIONS FOR THE TWO EXPERIMENTAL GROUPS 


Letters 
A B [o] D E F 

Group 1 

Red Deck 35 .06 26 18 12 03 

Green Deck 35 "15 43 27 04 .06 

Sum of Probabilities 70 21 39 45 46 09 
Group 2 

Red Deck .05 A2 22 .06 39 16 

Green Deck .05 .30 Al 09 A3 32 

Sum of Probabilities 40 42 3. AS 452, A8. 
Likelihood Ratio 1:1 1:2.5 2:1 HEY 3:1 1:2 


These distributions con- 
stitute the sampling distributions for the 
decks. For Group 1 the large likelihood 
ratios are comprised of small probabilities 
and vice versa. For Group 2 the large likeli- 
hood ratios are comprised of large probabil- 
ities and the small ratios are composed of 
small probabilities. The ratios for each let- 
ter (datum) are the same for both groups 
and formally should lead to the same amount 
of revision about which of the two decks 
(Hi or Ha) was the source of a given datum. 

Procedure.—To give Ss some idea of what 
was meant by the probabilities of the letters 
in the decks, they were shown the shuffled 
cards one at a time, twice through each 
deck. Then they estimated the probabilities 
for each letter for first one deck and then 
for the other. Estimates were made by di- 
viding 100 pennies into six stacks, one for 
each letter. Then the number of pennies in 
each stack was entered into blanks on an 
answer sheet. After this was done the verid- 
ical relative frequencies were displayed and 
Ss were told to use them throughout the re- 
mainder of the experiment. 

The revision task was presented as a hypo- 
thetical game in which E would flip a fair 
coin to select one of the two decks, ran- 
domly draw a card from the deck, and 
announce the 
the color. The Ss were to start at .50-.50 
and to revise their subjective 
probabilities about which of the two decks 
had been selected. Next, the card was 
replaced, a second one drawn from the same 
deck, and Ss revised their previous revision. 
Then the process was repeated for a third 
time. 

The deck probabilities were displayed 
throughout the experiment and the bobbing of 


shown in Table 1. 


Ss' heads made it quite apparent that they 
were consulting the display and basing their 
revisions upon it. 

The revisions were made by arranging the 
100 pennies in two calibrated upright 
troughs; one trough was red and one was 
green. The pennies in each trough signified 
S's posterior probabilities for each of the 
two decks. After each revision each S wrote 
the number of pennies in the troughs in 
blanks on his answer sheet. After each se- 
quence of three draws the pennies were re- 
turned to .50-.50. 

The triplets of letters forming the draw 
sequences Were arranged so that no letter 
followed itself or any other letter more than 
once in any of the three positions and each 
letter occurred five times in each of the 
three positions. There were 30 different se- 
quences, each written on an index card; the 
cards were shuffled before each experimental 
session. 

Subjects.—Twenty-five. male undergraduate 
university students served as Ss, 12 in Group 
1 and 13 in Group 2, Three or four Ss 
were run in each experimental session. 
Post-experiment questions showed that some 
Ss had taken statistics courses and a few 
remembered having heard of Bayes' theorem, 
but that none knew that it was appropriate 
in this situation. 


RESULTS 


Throughout the data analysis the red 
deck was regarded as H, in Equation 
1. The veridical likelihood ratios are 
given by the distributions in Table 1. 
Because each S’s prior probabilities for 
the first datum were always .50-.50 
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and he recorded his posterior probabil- 
abilities, it is a simple matter to use 
Equation 1 to infer the subjective likeli- 
hood ratio underlying his revision. 
The posterior probabilities from the 
first revision were the prior probabil- 
ities for the second, so inference of the 
second likelihood ratio was also pos- 
sible and similarly for the third. 

The inferred likelihood ratios indi- 
cate the amount of revision that took 
place after each datum was seen regard- 
less of the accuracy or inaccuracy of 
the previous revision. Ratios greater 
than 1 indicate that the revision favored 
the red deck and ratios less than 1 
favored the green deck. Taking the 
log of the inferred likelihood ratios and 
computing the mean permits the 
amount that Ss revised after each da- 
tum to be compared with the optimal 
amount. The log transformed ratios 
that favored the green deck, of course, 
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INFERRED LOG LIKELIHOOD RATIO (ABSOLUTE) 


had negative signs. However, because 
the means of the Ss' revisions were all 
of the appropriate sign, it is permissible 
to treat all of the inferred and optimal 
log likelihood ratios as positive. 

The experimental hypothesis re- 
quires that Group 2 be more conserva- 
tive for small ratios than Group 1, be- 
cause its small ratios were composed of 
small probabilities. For large ratios, 
however, Group 1 should have been 
most conservative because its large ra- 
tios were made up of small probabilites. 
Within both groups the same kind of 
results were expected for the 2:1 ratios 
(letters C and F) that were composed 
of different probability magnitudes. 

Figure 1 shows the relation between 
the log likelihood ratios inferred from 
Ss' responses and the veridical log ra- 
tios. Complete nonconservatism would 
place both curves on the diagonal. The 
data points represent means across Ss 
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Fic. 1. 


Comparison of inferred and veridical log likelihood ratios for both experimental 


groups. Double points are for 2:1 ratios composed of either large or small probabilities. 
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ACCURACY RATIO 
<—§ CONSERVATISM 


SUM OF PROBABILITIES IN LIKELIHOOD RATIO 


Fic. 2. Conservatism as a function of the 
magnitudes of the probabilities comprising 
the likelihood ratios. 


and across sequences for the same 
datum in all three positions in the se- 
quences. The double points on each 
curve are for the two levels of prob- 
ability magnitudes for the 2:1 data. 
The differences between the two groups 
are clearly in accord with the hypoth- 
esis. 

Next is the question about the 
relation between probability magni- 
tudes and degree of conservatism. 
Because of the way in which the 
deck probabilites were selected, prob- 
ability magnitude can be indexed 
using the sums of the two prob- 
abilities that comprise the likeli- 
hood ratio associated with each datum. 
Degree of conservatism can be mea- 
sured by computing Accuracy Ratios 
(Peterson, Schneider, & Miller, 1965), 
the ratio of the mean inferred log likeli- 
hood ratio to the veridical log likeli- 
hood ratio for each datum. (Letter A 
was excluded from this analysis be- 
cause the associated optimal log likeli- 


hood ratio is .00 and the Accuracy Ra- 
tio is undefined.) An Accuracy Ratio 
of 1 indicates complete accuracy and 
no conservatism ; values less than 1 in- 
dicate various degrees of conservatism. 

In terms of these two indexes, Fig. 
2 shows the relation between conserva- 
tism and the magnitudes of the prob- 
abilities in the veridical likelihood ra- 
tios. For both experimental groups, as 
the magnitudes increase accuracy in- 
creases, i.e., conservatism decreases. 

Finally, the different compositions of 
the decks for the two groups of Ss per- 
mits comparison of differences in con- 
servatism as a function of differences 
in magnitudes, holding the veridical 
values of the ratios constant. In Fig. 
3 the differences between the two 
groups’ accuracy ratios is plotted 
against the differences in the sums of 
their respective decks’ probabilities for 
each datum. The greater the mag- 
nitudes of the probabilities composing 
the ratios, the greater the accuracy of 
revisions even though the veridical 
likelihood ratio is the same in both 
cases. For the smallest difference in 
magnitudes there was à reversal in 
accuracy but the difference was very 
small. 


DISCUSSION 


The results clearly support Vlek's 
(1965) hypothesis that the magnitudes of 
the probabilities influence revision of sub- 
jective probabilities. However, it is 
equally clear that the ratios of the prob- 
abilities have primary influence on revi- 
sion. The curves in Fig. 1 show the in- 
fluence of the ratios; the sums of the prob- 
abilities cannot account for the orderly 
increase in inferred ratios as a function 
of the veridical ratios. The double points 
on the curves are additional evidence for 
the primacy of the ratios; the amount of 
revision is in the same range for both 
points on each curve and within that 
range the amount is greater for the large 
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probabilities than for the small probabil- 
ities. In short, Ss violate the likelihood 
principle but not entirely; the influence 
of the ratios is only partially mitigated 
by small probability magnitudes. The in- 
fluence is sufficient, however, to warrant 
attempts to deal with it theoretically and 
to avoid methods of data analysis that 
confound it with other factors that pro- 
duce conservatism, 

A second important aspect of Fig. 1 
is the fact that Ss virtually never revised 
for letter A, even when it was composed 
of large probabilities, Only one $ was a 
consistent exception, and, as might be ex- 
pected, he was in Group 1. Other Ss oc- 
Casionally made minor adjustments after 
seeing A, but these tended to be more 


in the way of repairing over- or under- 
estimations from the previous datum. 
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MAGNITUDE OF UCR AS A FUNCTION OF VARIABILITY IN 


THE CS-UCS RELATIONSHIP: 


SHIRLEY C. PEEKE Ax» WILLIAM W. GRINGS 


University of Southern California 


The unconditioned GSR to an aver- 
sive stimulus, rather than having a con- 
stant and invariable magnitude, has 
been found to have a magnitude 
that decreases with repeated stimula- 
tion (simple adaptation). In addition 
to the effect of adaptation, recent stud- 
ies (Kimmel, 1966; Lykken, 1959) 
indicate that the GSR also decreases as 
a function of the immediately preceding 
stimulation. Investigations of the eye- 
blink response (Kimble .& Ost, 1961) 
as well as the GSR have shown that 
the magnitude of response to an aver- 
sive stimulus is diminished if the aver- 
sive stimulus is preceded by a CS or 
by a warning signal. These studies 
compared magnitude of UCR in two 
situations, one in which the UCS was 
preceded by a CS at a constant inter- 
stimulus interval (ISI) and one in 
which the UCS was presented alone, 

In examining the ways in which the 
two situations differ, the most obvious 


1This paper is based upon a thesis by the 
first author presented to the University of 
Southern California in partial fulfillment of 
the requirements for the MA degree. The 
re: Nas supported in part by a Grant 
(MH-03916) to the second author from the 
National Institute of Mental Health. 


factor is the simple presence and ab- 
sence of the CS. However a second 
factor, the constancy or reliability of 
the interval by which the CS precedes 
the UCS, might also be important. 
The effectiveness of the CS as an in- 
fluence on the UCR might be due in 
large part to the constancy of the ISI. 
In that case, the presence of a CS 
with an inconstant ISI would result 
in a different effect upon the UCR than 
is found with a constant ISI. 

The effect of the CS upon the UCR 
is similar to the effect of a ready signal 
upon motor-reaction time. Tt was 
found that the reaction time of a vol- 
untary motor response is decreased 
when a ready signal is used. In addi- 
tion reaction time varies with the vari- 
ability of the foreperiod (the interval 
between a ready signal and a signal to 
give the motor response) so that a 
variable foreperiod results in longer 
reaction times to the second signal than 
does a constant foreperiod (Klemmer, 
1956). Since the ISI in the paired 
CS-UCS study corresponds to the fore- 
period in the reaction time study the 
manipulation of that variable in a CS- 
UCS study should have results similar 


—— ———————— 
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to the reaction time findings. From 
the results of Klemmer it would be pre- 
dicted that in a paired CS-UCS study 
variability in the ISI would result in a 
UCR that differed from the UCR in 
a constant ISI paradigm. 

The motor reaction time to any sig- 
nal is often thought to reflect the S's 
state of readiness or preparatory set 
for that signal (Woodworth & Schlos- 
berg, 1954). This preparatory set is 
facilitated somewhat by the occurrence 
of a ready signal (as shown by the 
shortened reaction time relative to the 
no-ready-signal condition). The facil- 
itation is even greater when the fore- 
period is constant rather than variable. 
Grings (1965) has suggested that a 
similar preparatory set may be present 
after repeated CS-UCS pairings, so 
that the CS gives rise to a response of 
preparation for receipt of the UCS. 
This set would affect the response to 
the UCS by changing the magnitude of 
the UCR. 

If a preparatory set develops during 
repeated CS-UCS pairings, it should 
also develop prior to CS-UCS pairings 
if S is told verbally what the relation- 
ship between CS and UCS will be. 
The effect of this instruction-induced 
set should be apparent on the first trial 
after receipt of instructions. 

In the present study, different set- 
inducing instructions were presented 
to each of three groups of 5s informing 
them of the temporal relationships be- 
tween the CSs and the UCSs that they 
were to receive. In one group, Ss 
were given paired CS-UCS trials with 
a constant ISI. In a second group, 5s 
were given paired CS-UCS trials with 
an ISI that varied from one trial to 
the next. A third group was given un- 
paired CSs and UCSs. It was pre- 
dicted that the UCR magnitude would 
be least for the constant paradigm, re- 
flecting the greatest preparedness for 


the UCS. The UCR magnitude should ` 
be greatest for the unpaired paradigm, 
reflecting the least preparedness for 
the UCS, and the UCR should be of 
intermediate magnitude for the variable 
paradigm. It was further predicted 
that if a preparatory set were respon- 
sible for these differences in UCR mag- 
nitude, the difference should be ap- 
parent on the first trial due to the in- 
structions. 


METHOD 


Subjects—The Ss were 60 students 
from introductory psychology classes assigned 
by means of a table of random numbers to 
one of the three treatment groups. An 
approximately equal ratio of males and fe- 
males was assigned to each group. 

Apparatus—The CS was an 800-cps tone 
produced by a Hewlett-Packard audio-oscil- 
lator (Model 201B) and presented to S 
through Grason-Stadler earphones. The 
UCS was a .5-sec. de shock from a Grass 
S5 stimulator delivered to the volar surface 
of the left forearm by two silver electrodes. 
CS and UCS durations and interstimulus 
intervals were controlled by Hunter elec- 
tronic timers, Intertrial intervals were de- 
termined by a Gerbrands tape timer and were 
40, 55, or 70 sec, in duration randomly 
varied. 

The GSR was obtained as a de resistance 
change through 12 X 21-mm. silver fingertip 
electrodes coated with electrode paste and 
taped firmly to the first and third fingers of 
S’s right hand. Recording of the GSR was 
accomplished by a Darrow-type bridge of 
which the output was amplified and recorded 
by an Offner Type R Dynograph. Current 
through S was 45 pa. 

Design.—Twenty Ss were assigned to each 
of three groups which differed in CS-UCS 
relationship. Group C received paired CS- 
UCS trials with a constant 5.5-sec. ISI, 
Group V received paired CS-UCS trials with 
an ISI of .6, 2, 3, 5.5, 9, or 11 sec. (with a 
mean ISI of 5.5 sec.). Both of these groups 
received delay paradigms; that is, the CS 
had the same duration as the ISI. A third 
group, R, received unpaired CSs and UCSs 
interspersed randomly with the restriction 
that no more than two CSs or two UCSs 
occur serially. Since the duration of inter- 
shock interval might affect the magnitude of 
UCRs, the same schedule of interval dura- 
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tions were presented to each group of Ss, 
As a result, the same trial could be compared 
across the three groups without confounding 
due to different intershock intervals. 


cedure, 
min. while E calibrated the equipment. This 


he first felt the shock, Then the shock was 
increased in 2-3-y, steps until S reported the 


of responding, Following the shock series, 
S was given specific instructions for the 
block of trials he was about to receive. The 
Ss in the constant ISI group (C) were in- 
structed, “You will be given a tone which 
lasts exactly 5.5 sec, As the tone goes off 
you will be given a shock.” The variable 
ISI group (V) was instructed, “You will 

given a tone which varies in length of 
time it is on, When the tone goes off you 
will be given a shock.” The random group 
(R) was told, “You will be given a num- 
ber of tones and shocks with no Particular 
relationship to each other, Either a tone or 
shock may come at any time.” 


ReEsutts 


The magnitude of the UCR to the 
shock was defined as the size of the 
maximum resistance change following 
shock onset with a latency of 1-3 sec, 
The deflections were measured in rela- 
tion to the resistance at Onset of the 
Tesponse. The change in resistance 
measure was transformed into Square 
root of conductance change (VA t) 
units. 

UCR analysis.—In order to deter- 
mine whether or not the treatment 
groups differed initially in their respon- 
siveness to shock, the magnitude of re- 


sponse to the three adaptation shock- 
alone trials was averaged for Ss in 
each treatment group. The means did 
not differ from each other (Fig. 1). 
Therefore, the treatment groups did 
not differ initially in the magnitude of 
their response to the shock. 

Of the 10 trials presented, only 
Trials 1, 4, 7, and 10 were used in 
the analysis since these were the only 
trials for which the CS duration was 
comparable for the three groups (5.5. 
sec. duration). The UCR magnitudes 
for these four trials for the three groups 
are shown in Fig. 1. A treatment by 
trials analysis of variance yielded a 
significant treatments effect with an 
F (2, 57) = 10.73, p< 01 and a sig- 
nificant trials effect with an F (3, 
171) = 297, p < .05. The "Treatments 
X Trials interaction was not significant. 
A Scheffé post-hoc analysis of the dif- 
ference between the treatment group 
means showed that each group differed 
significantly from each other group. It 
appears that the CS-UCS relationship 
affects UCR magnitude in the manner 
predicted with the constant group (C) 
having the smallest UCRs, the random 
group (R) having the largest UCRs, 
and the variable group (V) having 


MAGNITUDE UCR 


TRIALS 


Fic. 1. Mean magnitude of UCR over 
trials for the constant interval group (C); 
the variable interval group (V), and the un- 
paired random group (R). 
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intermediate sized UCRs. While there 
was a significant trials effect, examina- 
tion of the trend for each treatment 
group reveals that the variable group 
(V) was the only one showing a sig- 
nificant decrease from Trial 1 to Trial 
10, ¢ (19) = 345, p < .01. 

Effect of instructions.—A. significant 
decrease in UCR magnitude from the 
last adaptation trial to the first paired 
trial occurred for the constant group 
(C), + (19) = 3.01, p <.01, but not 
for Groups V or R. Groups C and V 
received the same treatment on Trial 1 
(both received a paired trial with an 
ISI of 5.5 sec.) but differed in the 
type of instructions which had inter- 
vened between the last adaptation trial 
and the first paired trial. Thus, the 
difference in instructions must have 
been responsible for the greater UCR 
decrement in Group C than Group V. 
The UCR magnitude of Groups V 
and R did not differ on Trial 1 but 
differed significantly on Trial 10, 
t (38) 22.52, p<.05. This differ- 
ence is attributed to the adaptation 


occurring in Group V (possibly due to - 


increased familiarity with the range of 
ISI durations by the time Trial 10 


occurred). Group R showed no 
adaptation at all. 
Conditioned response — analysis.— 


Since variation in UCR magnitude 
occurred in groups differing in CS- 
UCS relationship, this variation might 
be attributed to differences in the pres- 
ence and degree of anticipatory condi- 
tioned responding. In order to deter- 
mine, first of all, whether or not con- 
ditioning occurred for the paired cS- 
UCS paradigms and not for the ran- 
dom paradigm, a one-way analysis of 
variance was performed for the magni- 
tude of the maximum response to the 
CS for Groups C, V, and R. A sig- 
nificant treatments effect was obtained 
with an F (2, 57) =7.493, P< 01. 


A Scheffé post-hoc analysis indicated 
that Groups C and V did not differ 
while Group R differed significantly 
from both other groups (p < .05 in 
both cases). That is, significantly 
greater responding to the CS occurred 
for the two paired paradigms than for 
the unpaired paradigm (Fig. 2). 
However, CR magnitude for the three 
groups did not differ on Trial 1, F (2, 
57) = 2.38. 

Since magnitude of CRs did not dif- 
fer for Groups C and V, anticipatory 
responding cannot be responsible for 
the difference in UCR magnitude found 
for the two groups. Both groups 
showed the highest magnitude of con- 
ditioned GSR on Trial 1 and a de- 
creasing magnitude with subsequent 
trials. The high level of responding 
on Trial 1 was most likely a result of 
instructions that the CS would be fol- 
lowed by a shock. 

It might be predicted that Group C 
would show more inhibition of delay 
(a longer latency between CS onset 
and CR onset) than Group V since 
Group C was told exactly the duration 
of the interval between CS onset and 
UCS onset and Group V was not. 
Latencies from CS onset to the begin- 
ning of the first response occurring in 
the delay interval are shown in Fig 3. 
Although Group C showed consistently 
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tone for each treatment group. 
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Fic. 3. Mean latency of response to 
tone for each treatment group. 


longer latencies than Group V, the dif- 
ference between the two groups was 
not significant when a Treatments x 
Trials analysis was made, F (1, 30) = 
3.56, p < 10. 


Discussion 


The hypothesis that UCR magnitude 
varies with the constancy of the relation- 
ship between the CS and UCS was sup- 
Ported since unpaired CSs and UCSs 
(Group R) resulted in the largest UCRs, 
paired CSs and UCSs with a constant 
ISI (Group C) resulted in the smallest 


this difference in UCR magnitude was a 
result of an instruction-induced prepara- 
tory set was Supported for the constant 


ence developed gradually which became 
Significant by the tenth trial. Perhaps 
this difference was due to Ss in the vari- 
able group having gradually become fa- 
miliar with the range of ISIs so that the 
Situation became more predictable and 
adaptation could take place to the UCS. 
It appears that the differences between 
the three groups could best be described as 
differential rate and degree of adaptation. 


In the Kimble and Ost (1961) study 
the paired CS-UCS (constant ISI) trials 
showed gradual adaptation rather than 
immediate adaptation as shown in this 
study. The difference in results appears 
to be due to the presence of set-inducing 
instructions in the present study and their 
absence in the Kimble and Ost study. 

The difference in UCR magnitude for 
the three treatment groups might be due 
to differences in CR magnitude in the de- 
lay interval. If this were the case, it 
would be expected that CR magnitudes 
would be functionally related to type of 
treatment in the same way that UCR 
magnitudes were, This does not seem to 
be the case. No difference was found in 
CR magnitudes for Groups V and C, 
whereas a significant difference was found 
in UCR magnitudes for these two groups. 
Also, the hypothesis of a difference be- 
tween CR latency in the two groups was 
tested and found to be insignificant. 
Neither CR magnitude nor latency varied 
in a way that would be expected if they 
Were to influence magnitude of UCRs. 
From this evidence, it is concluded that 
while UCR magnitude can be varied by 
use of differential CS-UCS relationships, 
no simple relationship between UCR mag- 
nitude and CR magnitude or latency has 
appeared. 

At the present time there are two major 
explanations of UCR diminution in clas- 
Sical conditioning. The explanation fa- 
vored in the present Study is termed a 
perceptual one in the sense that it assumes 
that S’s Perception of the experimental 
Situation established a set toward receipt 
of stimulation. An alternate explanation, 
favored by Baxter (1966) and Kimmel 
(1966) is that of conditioned inhibition. 
The strongest evidence to date for the 
inhibition view is the presence of UCR 
diminution with a delay paradigm and its 
absence with a trace paradigm. The 
strongest evidence for the perceptual ex- 
planation in the present study is the im- 
mediate appearance of diminution upon 
instruction for the constant group. Be- 
fore that evidence can be made maximally 
relevant for differentiating between the 
explanations, further experimentation is 
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needed with instructions manipulated as 
one of the independent variables. 
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HUMAN ORIENTING REACTION AS A FUNCTION OF ELEC- 
TRODERMAL VERSUS PLETHYSMOGRAPHIC RESPONSE 
MODES AND SINGLE VERSUS ALTERNATING 
STIMULUS SERIES? 
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Indiana University 


80 Ss were presented with 300-msec. tones and lights separated by a 
mean interval of 45 sec. and arranged in 40-trial sequences which con- 
tained both repeated single and alternating patterns as well as trials 


constituting changes from these patterns. 

volume pulse change (VPC ) were recorded. 
reliably after a stimulus change from single 
not after change from alternation, F < 1. 


GSR and digital blood- 
Both responses increased 
repetition, »<.001, but 
For Trials 1-15 the GSR 


habituated reliably and at the same rate to both repetition patterns 


P < .001, but the VPC did not habituate 


to either pattern, F <1, 


A continuous time-estimation task given to half of the Ss in order to 


distract them from the stimuli did not 


affect the above results, 


Assuming the GSR and VPC to be components of the orienting re- 
sponse (OR), these findings create difficulties for OR and related 


theories of autonomic behavior. 


Although noted by Pavlov (1927, 
p. 12), the importance of the orienting 
reaction (OR) has been emphasized 
only recently to American investigators 
by the work of Sokoloy (1960, 1963). 
Sokolov’s theory postulates a hypothet- 
ical “neuronal model” which develops 
on the basis of past stimulation received 
by the organism and which acts as a 
predictor of future stimulation. The 
theory asserts that the OR will de- 
crease or habituate to a repeated series 
of stimuli, while a change in the 
series, because it is not predicted by 
the neuronal model, will reinstate or 
dishabituate the OR. 

1 This research was supported by Contract 
Nonr 908 (15), Office of Naval Research, 
Roger W. Russell, Project Director. The 
author is indebted both to other members of 
the project for their help in data collection 
and analysis, and to I. Gormezano and F. A, 
Stare for their helpful criticisms, respectively, 
during the conception and the interpretation 
of this study. A brief version of this experi- 
ment has been reported at the 39th Annual 
Meeting of the Midwestern Psychological 
Association, May 1967, 

? Now at the University of Toronto. 
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Apart from its intrinsic value, OR 
theory poses an embarrassing problem 
for Es who consider that they are clas- 
sically conditioning autonomic - re- 
sponses with interstimulus intervals 
(ISIs) which do not exceed the latency 
of these responses. Such short ISIs 
preclude measurement of the response 
to the conditioned stimulus (CS) on 
training trials, so that CS-alone test 
trials have to be used to assess condi- 
tioning. However, because both CS 
and the unconditioned stimulus (UCS) 
initially elicit similar responses, there 
being no neutral CSs, it is difficult to 
assert that the response to the CS alone 
on test trials is due to conditioning 
rather than to nonassociative factors 
(Zimny, Stern, & Fjeld, 1966). From 
an OR viewpoint, in particular, the 
presentation of an unpaired CS test 
trial constitutes a relative change from 
the previously experienced paired CS- 
UCS stimulus pattern. The basic dif- 
ficulty is that the associative operation 
of pairing CS with UCS is also that 
which transforms CS-alone presenta- 
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tions into relatively novel trials, and 
this problem exists even for the dif- 
ferential conditioning paradigm, an 
arrangement which has been said to be 
the most unequivocal way of demon- 
strating associative autonomic learning 
(Kimmel, 1964). 

A preliminary step towards assessing 
the role of OR habituation and dishabit- 
uation in response changes which have 
been traditionally attributed to classical 
conditioning is to examine these OR 
processes in the absence not only of a 
UCS but also of any repeated pairings 
of two stimuli, pairings which con- 
stitute the characteristic pattern of 
short-ISI conditioning. ^ Response 
changes over trials consisting of sep- 
arately presented tone and light stimuli 
were therefore studied as a function of 
the following two-level factors: (a) 
whether the stimulus presented was a 
part of a repeating pattern or repre- 
sented a change from that pattern, (b) 
whether the repeated-stimulus series in- 
volved a single repetition pattern (e.g., 
tone, tone, tone,...) or an alternating 
repetition pattern (e.g. tone, light, 
tone, . . .), (c) whether the response 
mode was electrodermal or plethysmo- 
graphic, these responses being among 
"the most commonly studied compo- 
nents of the orienting reflex" (Uno & 
Grings, 1965, p. 312), and (d) whether 
a continuous time-estimation task was 
or was not given to Ss. The last factor 
was varied in order to determine 
whether the amount of OR increase to 
change could be reduced if S were dis- 
tracted from the stimuli. 


METHOD 


Design—The single (S) and alternating 
(A) levels of the repeated-pattern-type fac- 
tor were varied within Ss, a procedure 
which introduced the order of presentation 
(1st-2nd) as an additional factor. This fac- 
tor was counterbalanced by presenting the 
S and A series in two sequences to two equal 


groups of Ss. In both sequences, each con- 
taining 40 trials, the S series consisted of 
15 trials with the same stimulus (x, x, . . 
x), while the A series consisted of 25 trials 
with the stimuli being alternated (x, y, . . . 
x, y, X). The change (C) from a repeated 
(R) stimulus pattern consisted of presenting 
a stimulus which was different from (S 
series) or the same as (A series) that on 
the immediately preceding trial. Italicizing 
the two change trials in both sequences, Se- 
quence I (S first) consisted of the series, 
X X... Xy Xs. X y, X 4, while 
Sequence II (A first) consisted of the series, 
Ky Vp swe Xp Vy X, Ej. oy Ky Ms The change 
(C) from S repetition therefore occurred 
on Trials.16 and 40 in Sequences I and II, 
respectively, while the C from A repetition 
occurred on Trials 26 and 40 in Sequences I 
and II, respectively. The nature of the 
stimulus on a given trial was counterbalanced 
by specifying x as tone and y as light for 
half the Ss and reversing this specification 
for the remaining Ss. The Ss were also di- 
vided equally between those who did and 
did not receive the time-estimation task, 
but the response-mode factor was varied 
within Ss by recording both the electroder- 
mal and plethysmographic responses from all 
Ss. 

Subjects.—The Ss were 80 undergraduates 
drawn mainly from a summer-school pop- 
ulation at Indiana University who were told 
that they could earn up to $2.00 for partici- 
pating. 

Apparatus.—The S was seated in a padded 
armchair placed in an electrically shielded 
room separate from E and the main appara- 
tus. There was intercom contact between 
E and S, while a peephole also allowed E 
to observe, without S’s knowledge, whether 
S's eyes were open. Brush crystal earphones 
on S delivered a 60-db. SPL white masking 
noise which was continuous except when a 
2,000-cps, 80-db. tone was delivered. For the 
light stimulus, the illumination from a ceil- 
ing lamp was increased from 1.5 to 3.3 ftl. 
as measured with a Macbeth Tlluminometer 
directed at the wall in front of S at eye level. 

The duration of the tone and of the light 
was controlled by an electronic timer, The 
nature of the trials and the intertrial inter- 
vals were programmed through a Western 
"Union tape transmitter. At about 15° above 
eye level and 48 in. away from S were 
three colored #-in. glasses mounted to form 
an isoceles triangle with base and sides of 
23 in. and 2 in, respectively. An orange 
glass was the downward-pointing apex, with 
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glasses being the points 
joining the base of the triangle. Behind 
each glass was a 24-y, bulb, the onset and 
offset of which was under E's push-button 
control. 

The electrodermal 
sponse (GSR) was picked up by two elec- 
trodes of the i 


the green and red 


left palm and 


about 4 in. above the elbow. The GSR 
an electronic dc ampli- 
fier of the bridge type supplying a constant 
30 ua. to the electrodes, The amplifying 
unit, designed and constructed at Indiana 
University, had an automatic range-changing 
device, and a range of nine sensitivities 
which could be adjusted according to S's 
basal level and changes in resistance, The 
Output was recorded on a Stoelting servo- 
graphic recorder run continuously at a speed 
of 6 in/min, 

To record the plethysmographic response 
of change in digital blood-volume pulse, the 
tranducer was attached to the tip of S's 
left index finger. For most Ss the trans- 
ducer was Photoelectric, but for some Ss a 
Piezoelectric transducer was used. The sig- 
nal was amplified and recorded by an Offner 
type-R dynograph run continuously with a 
Paper speed of 24 mm/sec, 

(T) group receiy- 
ing time-estimation instructions were told of 
the tone and the light, and then of their task, 
which involved silently counting the seconds 
ng out the passing of each 2 min, 


the light, since these would be presented at 
irregular intervals, and were told that their 
performance would be monitored by E, who 
would take some money from .S's original 
$2.00 stake whenever the latter’s estimate of 
the passing of each 2 min. was inaccurate, 
The E undertook to flash a colored light 
two or three times during the session in 
order to indicate to S that he should speed 
up (green), slow down (red), or maintain 
(orange) his current rate of counting. The 
no-ask (NT) group received the same 
instructions as the T group except that: 
(a) the reference to time estimation was 
omitted, (b) the colored lights were men- 
tioned along with the tone and the light 
as the stimuli which S would Teceive, and 
(c) S was instructed to keep his eyes open 
on pain of forfeiting a dollar from his $2.00 
stake. 

The duration of the tone and light (from 


the ceiling) was 300 msec., while the inter- 
trial intervals were randomly varied between 
373, 45 (mean), and 523 sec. The colored 
lights were interpolated at about 10 sec. 
after the tenth and thirtieth trials for ap- 
Proximately 1 sec. The Ss for the T and 
NT groups were run successively in Pairs, 
with the performance of S in the T group 
determining the color of the colored lights 
and the total earnings for both members of 
the pair. The allocation of each pair to 
other conditions was random. 


RESULTS 


Response measures —The GSR, a 
drop in skin resistance initiated þe- 
tween 1 and 4 sec, after stimulus onset, 
was transformed into conductance 
change (A C) scores in microhms. 
The plethysmographic digital blood- 
volume pulse change (VPC), whether 
recorded through a photo- or a piezo- 
electric transducer (74 Ss and 6 Ss, 
respectively), is a decrease in pulse 
magnitude representing peripheral 
vasoconstriction. However, the mea- 
surement of the VPC response is often 
complicated by brief and nonstimulus- 
connected arythmic pulse-magnitude 
changes. The response definition 
adopted Sought to control for such brief 
changes by requiring response initia- 
tion and termination to occur 
for longer than two-pulse periods. 
Specifically, Tésponse initiation was 
looked for from 2 to 8 Sec. after stim- 
ulus onset, and defined as occurring 
at the first (base) pulse, I, for which 
either (a) I was followed by Pulses J 
and K where I > J >K, or (b) I was 
followed by Pulses J, K, and L where 
Tox T Response termination was 
taken to have occurred at Pulse T, 
Which followed I and J and which itself 
was followed by Pulses U and V such 
(a Oe The amount of change 
was the algebraic difference between 
I and T expressed as à percentage of 
I, which is a volume-pulse change mea- 
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sure expressed as a percentage of the 
base pulse. 

Change from repeated pattern.—Fig- 
ure 1 shows the mean electrodermal 
and plethysmographic performance on 
the block of repeated (R) trials imme- 
diately preceding a change (C) trial 
and on the C trials themselves as func- 
tion of the task (T-NT), pattern-type 
(S-A), and order (first-second) fac- 
tors. The counterbalanced order fac- 
tor, introduced through the variation 
of the pattern-type factor within Ss, 
makes it difficult to statistically assess 
the data in the form of a simple 
R-C x T-NT x S-A X First-Second 
factorial design, because the distribu- 
tion of Ss does not allow a split-plot 
model to be applied. Such a model is 
applicable, however, to an "interactive" 
factorial, in which the order factor is 
reclassified so that it is connected to 
the pattern-type factor. It has been 
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shown that the statistical results from 
this interactive factorial are logically 
equivalent to effects describable in the 
simple factorial terms of change, task, 
pattern type, and order (Furedy, 
1967). In these simple factorial terms, 
the electrodermal results (left panel of 
Fig. 1) yielded significant, p < .001, 
effects due to change, F (1, 76) = 
35.475, pattern type, F (1, 76) = 
25.494, and interaction between change 
and pattern type, F (1, 76) = 45.850; 
the p values for all other effects ex- 
ceeded .1. Separate analyses of the 
Change X Pattern-Type interaction 
showed that the performance on the C 
trials was superior to that on the R 
trials with the single pattern, F (1, 
76) = 59.979, p<.001, but the in- 
feriority of the C relative to the R trials 
with the alternating pattern was not 
significant, F (1, 76) = 2461, p > 1. 

For the plethysmograph (right panel 
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of Fig. 1), the effects due to change, 
pattern type, and their interaction were 
again significant, b < .001, with F (1, 
76) —14.660, 12.790, and 13.266, re- 


and to an 
Order x Change interaction, F (1, 
Finally, there 
were near-significant, p < .l, effects 
due to task, F (1, 76) = 3.248, with 
the NT group superior to the T group, 
and to a triple interaction between 
order, change, and pattern type, F (1, 
76) = 3713. For all remaining 
effects, F < 2,p>.1. Separate anal- 
yses for the significant Change x Pat- 
tern-Type interaction showed that the 
Superior performance on the C trials 
relative to R trials was significant with 
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the single pattern, F (1, 76) = 19.114, 
P < .001, but not with the alternating 
pattern, F «1. Similarly, separate 
analyses for the significant Change x 
Order interaction showed that the su- 
perior performance on the C trials 
was significant under the first-pre- 
sented condition, F (1, 76) — 15.838, 
P <.001, but not under the second- 
presented condition, F (1, 76) = 1.372, 
b2.l As the triple interaction be- 
tween order, change, and pattern type 
had not reached significance, no sepa- 
rate analyses were performed. How- 
ever, the form of this interaction seemed 
to be that the Change x Order inter- 
action was present with the single, but 
not with the alternating, pattern, 
Initial pattern repetition. —F. igure 2 
shows the mean electrodermal and ple- 
thysmographic performance over the 
first 15 trials in blocks of 3 trials as a 
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Fic. 3. The effect of initial repetition as a function of the pattern-type, task, and 


response-mode factors. 


function of task and pattern type. A 
Trials x Task X Pattern-Type analysis 
for the GSR showed only a significant 
trials effect, F (4, 304) — 28.615, 
p < .001, with a monotonic decrease of 
responding over trials; for all other 
effects, F « 1. The same analysis for 
the VPC showed no significant trials 
effect, F « 1, and all other effects also 
failed to approach significance, F < 14. 

To compare the amount of response 
decrease or habituation between the 
two response modes, a common ordi- 
nate for them was established by ex- 
pressing each score as a percentage 
of the mean score for that response 
mode over the first three trials. Fig- 
ure 3 shows the data plotted as a func- 
tion of response mode, task, and pat- 
tern type. The difference in slopes be- 
tween the electrodermal and plethysmo- 
graphic modes, as represented by the 
Response-Mode X Trials interaction, 
was significant, F (4, 304) = 10.990, 
p<.001. Task, pattern type. and all 
interaction effects involving these two 


(Scores for each response mode are expressed as a percentage of 


the mean score on Trials 1-3 for that mode.) 


factors as terms were insignificant, 
F<12. 

Task effect Since the T and NT 
groups failed to differ in any significant 
way with respect to the effects of 
change from repetition and of repetition 
itself, the question arises whether the 
task had any influence on Ss. One 
implication of such influence is that 
responses to the interpolated colored 
lights would differ between the T and 
NT groups, since these stimuli had in- 
formational significance only for the T 
group. Figure 4 shows the mean elec- 
trodermal and plethysmographic re- 
sponses to the two interpolated colored- 
light trials as a function of the task 
factor. Although the greater electro- 
dermal responding of the T relative to 
the NT group was not significant, 
F <1, the T group did give signif- 
icantly greater plethysmographic re- 
sponses than the NT group, F (1, 78) 
= 8.357, p<.0l. In addition, the 
Groups X Trials interaction for the 
plethysmograph was near significant, 
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colored-light trials as a function of the task factor, 


F (1, 78) = 3729, p <.1, with the 
difference between groups appearing 
to decrease over trials, All other 
effects in the case of both response 
modes failed to approach significance, 
ur s 


Discussion 


In the plethysmographic mode, though 
not in the electrodermal, the effect of 
change from a repeated pattern of stimula- 
tion was complicated by an interaction of 
this effect with the order factor. How- 
ever, at least for the first-presented condi- 
tion, the predominant feature of the 
change-from-repetition results for both 
response modes was the interaction be- 
tween the change and pattern-type factors, 
there being a highly reliable response in- 
crease to change from a single pattern, 
but no evidence for such an increase to 
change from an alternating pattern. It is 
noteworthy, moreover, that these latter 
null findings are based on the same size 
of sample and degree of experimental con- 


trol which produced the highly reliable 
increase to change from single repetition. 

Although the alternation pattern used 
here does not seem to have been studied 
in other experiments, the null findings 
appear surprising in the light of outcomes 
Such as those reported by Berlyne, Craw, 
Salapatek, and Lewis (1963), who found 
that changes from seemingly more com- 
plex patterns did reliably increase re- 
sponse amplitude. However, the patterns 
used by these Es were visual configura- 
tions with the components presented si- 
multaneously. The alternation pattern 
used here, though basically simple, does 
have sequentially presented components 
which require temporal integration for 
pattern registration. Such registration, 
at least as regards the autonomic re- 
sponses studied here, did not appear to 
occur, 

As for the influence of initial pattern 
repetition prior to change, the only reli- 
able effect was the difference in rate of 
response decrease over trials between the 
electrodermal and plethysmographic re- 


xf 


sponse modes. This difference, moreover, 


was that while the GSR did decrease 
monotonically and reliably, the VPC in 
the same sample of Ss showed no evi- 
dence of doing so. This failure of the 
VPC to habituate is not consistent with 
an interpretation of experiments by Unger 
(1964) and Zimny and Miller (1966), 
two studies which have been said to 
clearly demonstrate the phenomenon of 
plethysmographic habituation (Zimny & 
Miller, 1966, p. 91). However, the 
former experiment showed only that from 
20 Ss it was possible to select 12 Ss who, 
at some unspecified stage along a trial 
series of numbers presented seriatim, 
failed to respond over three successive 
trials (Unger, 1964, p. 13), and this is 
hardly compelling statistical evidence for 
habituation. Zimny and Miller (1966, 
p. 87) did obtain a highly significant 
(p <.005) monotonic response decrease 
over five successive trials with 26 (un- 
selected) Ss, but the stimuli producing 
this result consisted of 5-sec. applications 
of hot or cold surfaces to the forearm, 
with intervals of 25 sec. between applica- 
tions. As these conditions do not ensure 
that the effects from the last stimulus are 
completely dissipated before the onset of 
the next stimulus, it is at least plausible 
that sensory adaptation rather than ha- 
bituation was responsible for the observed 
reliable response decrements to successive 
stimulus onsets. Consistent with this pos- 
sibility is the failure of successively pre- 
sented tone stimuli to produce a reliable 
response decrement in the same experi- 
ment ; these stimuli were of short duration 
and were separated by at least 4 min. 
(Zimny et al., 1966, p. 85). Aside from the 
above two experiments, Uno and Grings 
(1965) varied tone intensity and found a 
marginally significant (p < 05) Trials X 
Intensity interaction which they interpret 
as suggesting that if there was any habit- 
uation, it occurred only to tones at the 
lower intensities (60 and 70 db.). With 
the possible exception of low-intensity 
tones, however, the evidence does not 
point to plethysmographic habituation to 
initial repetition as a reliably demon- 
strable phenomenon. 
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Although some features of the results 
are consonant with the OR position, 
others are not. The most relevant of the 
latter is the failure of both the electro- 
dermal and plethysmographic responses to 
increase as a function of change from 
alternation. Although alternation is not a 
particularly complex pattern, it would 
seem, in terms of Sokolov's (1960, 1963) 
theorizing, that 25 alternating trials were 
not sufücient to establish a neuronal 
model for alternating input. In addition, 
while the alternating pattern used here is 
different from the paired-stimulus .pat- 
terns of classical conditioning, the ap- 
parent failure of the present alternating 
pattern to be registered does raise the 
question whether the patterns of stimu- 
lation used in classical autonomic condi- 
tioning are sufficiently simple for neu- 
ronal models to be established, and for the 
ORs to be reinstated following pattern 
changes. 

The failure of the plethysmograph to 
decrease as a function of initial repeated 
stimulation also casts doubt on the gen- 
erality of OR theory, since habituation to 
repetition is one of the defining character- 
istics of the OR (Sokolov, 1960, p. 189). 
The present failure to habituate, it should 
be noted, was not confined to the alter- 
nating pattern, so that, as regards the 
repetition operation, the expected effects 
from the influence of the neuronal model 
failed to occur in the plethysmograph in- 
dependently of the type of repetition pat- 
tern employed. Finally, to the extent that 
one expects the orienting behavior to be 
influenced by the degree of the organism's 
attention to the stimuli presented, the 
failure of the time-estimation task to 
affect responding either to change from 
repetition or to repetition itself is not 
consistent with the OR position, It is 
possible that the task simply did not dis- 
tract Ss at all, but the results concern- 
ing responses to the colored lights (which 
contained information about performance 
only for the time-estimation group) pro- 
vided some evidence that Ss were not en- 
tirely unaffected by the task. 

In conclusion, the results obtained for 
the first-presented condition under the 
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TABLE 1 


Operation 
Response Mode Uum ms 
Repetition Change 
RECETA MB gig iR 
Electrodermal 
Single pattern decrease* | increase 
Iternating Pattern | decrease | no effect 
Plethysmographic 
Single pattern no effect | increase 


ternating pattern | no effect | no effect 


* No difference in rate of decrease between pattern 
ypes, 


various combinations of change, pattern- 
type, and response-mode factors are sum- 
marized in Table 1, The Startling fea- 
ture of the table is the lack of correlation 
between outcomes, Thus, whether a 
response system is or is not affected by 
the initia] repetition operations appears 
to be unrelated to Whether it is or is not 
affected by the change operations, The 
table also serves to warn against using a 
term like "dishabituation," which involves 
the implicit and sometimes false assump- 
tion that prior habituation to repetition 
has occurred. No unifying explanation 
has been offered for the results reported 
here, but they do Suggest that until the 
behavior of the electrodermal and plethys- 
mographic components of the OR to stim- 
ulus repetition and to change from repeti- 
tion is better understood, it is premature 
to subsume too wide a class of autonomic 


phenomena under the rubric of the orient. 
ing reflex. 
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MODALITY EFFECTS IN SHORT-TERM MEMORY: 
STORAGE OR RETRIEVAL?* 


BENNET B. MURDOCK, Jr. 


University of Toronto 


2 experiments were conducted to 
of auditory to visual presentation was due 
or in retrieval from short-term memory (STM). 


determine whether the superiority 


to differences in storage 
The 1st experi- 


ment compared recognition and recall with serial lists, under the 
presumption that retrieval difficulties would be minimized or eliminated 


with a recognition procedure. 
fidence-rating procedure (test 


which eliminated the need for sequential dependencies. 


The 2nd experiment utilized a con- 
for knowledge of list membership) 


The supe- 


riority of auditory presentation was manifest in both experiments, and 


the conclusion was suggested tha! 


ences in storage. 


In two prior studies (Murdock, 
1966b; 1967) it was shown that, with 
a probe technique in short-term mem- 
ory (STM), retention of both serial 
and paired-associate material was bet- 
ter with auditory than with visual pre- 
sentation. Even though information- 
ally speaking the material was exactly 
the same (common English words) 
and the conditions were equated as 
carefully as possible, still the differences 
were large and striking. This paper 
reports two further experiments which 
attempt to determine whether the locus 
of the effect is in storage or retrieval. 

It is generally accepted that retrieval 
involves a search through memory, and 
that recall may be unsuccessful because 
of failure to locate the item in question. 
The distinction is that of availability 
vs. accessibility (Mandler, 1967), and 
an experimental separation has been 
achieved by Tulving and Pearlstone 
(1966) by comparing cued and non- 
cued recall. Here a different approach 
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t modality effects represent differ- 


is followed in which recall and recogni- 
tion are compared. As suggested, for 
instance, by Lachman and Tuttle 
(1965), differences in storage should 
be manifest regardless of the type of 
retention test, while differences in re- 
trieval should only show up in recall. 

More specifically, both previous ex- 

riments relied exclusively upon re- 
call, and the obtained superiority of ma- 
terial presented in an auditory mode 
to material presented in a visual mode 
may have reflected differences in avail- 
ability (storage), accessibility (retrie- 
val), or both. In the first experiment 
here recognition and recall are com- 
pared, and it is assumed that recogni- 
tion eliminates the search or retrieval 
problem. That is, in the previous ex- 
periments it may have been more diffi- 
cult to locate the target item; here in 
the recognition task two list items are 
presented and S must simply decide 
whether they had been adjacent in pre- 
sentation. Thus, no search for an 
item is necessary ; instead S must de- 
cide whether the test items on the 
recognition task match the order of 
memory traces formed during list pre- 
sentation. Given the nature of the 
tests, a partial learning model of recog- 
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nition memory (McNulty, 1966) is 
Probably not applicable. An explicit 
distinction between a one-stage recog- 
nition task and a two-stage recall task 
has been proposed by Kintsch and 
Morris (1965) and by Rosenberg and 
Cohen (1966), and it is the sampling 
(or search) Process of recall that is 
by-passed by a recognition test. 

To anticipate, it will be shown that 
the modality effect is also present 
under a recognition procedure, so the 
implication is that modality effects rep- 
resent storage differences, not retrieval. 
The second study is a further test of 
the storage hypothesis, and its purpose 
Will be described in the introduction to 
Exp. II. 


ExPERIMENT I 
Method 


recall X(i--1), where i designates ordinal 
Position, X (i) the word at that ordinal posi- 
tion, and 1 Ci- 9, For recognition, a yes- 
nO procedure was employed (as in Murdock, 
1965); an A-B or A-X sequence was pre- 


to indicate a proper or an improper order. 

In recognition, the A-B or target pairs 
were X()-X(i-- 1) for Probe Position i 
(again 1=j= 9). For A-X distractor pairs 
Or "lures" the combination was X (i)- 
X(i+2) for 1=i=5; it was X(i)- 
X(i—1) for 6=i=9. The reason for 
using subsequent items for early probes but 
prior items for later Probes was that intru- 
sion matrixes of other studies clearly sug- 
gested that they would be the most seductive 
lures. (Naturally, the same pattern was 
used both under visual and under auditory 
Conditions.) A 5-point confidence judgment 
was given by S after each yes-no decision ; 
l was a "pure guess," 3 was “average,” and 
5 was “very sure” 

For visual presentation, the words were 
photographed on 16-mm. film with one word 
per frame and displayed by a Dunning 


Animatic single-frame Projector. Each word 
was shown for à sec.; following the last 
word in the list was a row of asterisks, 
The asterisk frame was followed immediately 
by the probe, which was a single word for 
the recall condition but two words for the 
recognition condition. For the latter probe 
both words were photographed on a single 
frame, one above the other. They were, of 
course, A-B or A-X to designate a proper 
or an improper sequence. For auditory pre- 
sentation, the lists were recorded and re- 
produced on an Ampex F-44 Fine Line tape 
recorder and the temporal Spacing mimicked 
that of the visual presentation as closely as 
possible. Headphones were used; the quality 
of the equipment was such that intelligibility 
seemed to be no problem. 

On each trial, then, a single serial list of 
10 words was presented to S; presentation 
was either auditory or visual, and the probe 
was either a single word (for recall) or two 
words for a yes-no (recognition) decision. 
The words were common English words; 
more specifically, they were selected at ran- 
dom from a pool of words from the Thorn- 
dike-Lorge (1944) list with G-count of 204-. 
They were all two-syllable words not more 
than eight letters long with homophones, 
contractions, archaic words, and proper 
nouns deleted. All Ss were tested individ- 
ually; each was given 72 lists with an inter- 
trial interval of 15 sec, There were 24 Ss, 
12 of each sex from the Toronto subject 
pool, 

Design—The experimental design was a 
Seven-way factorial 2x 2x 6x 2 x 2X9x2, 
with the first three variables (2x 2x 6) 
for between-Ss comparisons and the last 
four variabls (2x2x9x 2) for within- 
Ss comparisons, For the former, the 
variables were Order, Set, and Ss (in 
order of appearance, with three of each sex). 
The Order variable counterbalanced modality 
so the order (in blocks of 18 lists) was 
AVVA or VAAV. The Set variable 
counterbalanced material; all lists that were 
auditory under Set A were visual under 
Set B, and vice versa, Since Order and 
Set were orthogonal, all lists appeared 
equally often in each modality and with 
each type of retention test. 

The 2X2x9x2 within-Ss comparisons 
constituted the 72 test lists. The first vari- 
able here was Mode (auditory or visual), 
the second Replication, the third Probe 
(position), and the fourth Test (recall or 
recognition), Thus, all Ss had both Modes, 
both types of retention tests, and all nine 
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orthogonal combination. 
the variable is properly 
named under recall; however, under recogni- 
tion this variable in reality represented one 
A-B test and one A-X test. 

In addition to effecting the necessary 
counterbalancing controls and making all 
experimental effects of interest within-Ss 
comparisons, this design makes possible the 
unique assignment of every single response 
by every single S to one cell of this seven- 
way classification. Actually, there are 1,728 
total observations (24 Ss by 72 lists/S) 
which enter into the analysis. The depen- 
dent variable(s) can be accuracy (right or 
wrong) or origin of the response (as in in- 
trusion matrixes) for recall; accuracy and/or 
confidence judgment for recognition. The 
advantages of this type of classification have 
been mentioned elsewhere (Murdock, 1967) ; 
suffice it to say that we have found designs 
of this general type convenient, efficient, 
and powerful for studies of STM. 


- probe positions in 


Results 


The main results are shown in Fig. 1 
and 2, where probability of recall is 
plotted as a function of probe position 
for recall (Fig. 1) and for recognition 
(Fig. 2). The ordinates are scaled 
differently because the chance level for 
recognition is .50; the dotted lines in 
the recognition graph represent £i 
Standard Error. The recall curves 
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Fic. 1. Recall probability as a function 
of probe position (Exp. I). 
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Fic. 2. Probability of correct recognition as 
a function of probe position (Exp. T) 


(Fig. 1) agree well with those pre- 
viously reported (see Fig. 4, p. 320, 
Murdock, 1967) ; the greater irregular- 
ities in the present data are not surpris- 
ing when it is considered that only 4 
the total number of observations are 
involved, The main point however is 
in the recognition curves; at least for 
the later probe positions the auditory 
superiority is clearly manifest, and the 
implication is that modality effects are 
due to differences in storage, not re- 
trieval. 

In view of the different base lines 
for recall and recognition, separate 
ANOVAs were performed. In both 
cases, several analyses were conducted 
by collapsing in various ways over the 
counterbalancing variables, an atc sin 
transformation was used, and the ex- 
pected mean square was always used as 
the error term. (The assumption here 
is one of binomial variability, and this 
assumption was not contraindicated by 
examination of the higher-order inter- 
action terms in the various analyses.) 
The conclusions were very clear ; both 
for recognition and recall the effects 
of Mode and Probe were highly signif- 
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icant (p< 001) but the interaction 
was not. 

Perhaps the most startling aspect of 
the data is the fact that recall is better 
than recognition. Over most of the 
(probe) range recognition accuracy 
does not exceed chance (one standard 
error), yet over this same range recall 
probability is clearly greater than zero. 
The difference is greatest at Probe 
Positions 5-7, but it is evident even at 
earlier serial positions, However, the 
difficulty with interpreting this finding 
is the proper base line (chance level) 
for recall. It is at least possible that 
S can forget a sequential association 
yet guess a word from the general list 
Position of the probe and so be cor- 
Tect some unknown proportion of the 
time. Since one does not know the 
pool of items from which S might be 
guessing, it is probably unwise to 
attach too much importance to this 
particular recall-recognition compari- 
son. 

The only other factor that turned up 
in the analysis of variance was a highly 
significant (p < 001) effect of type of 
recognition test (ie, A-B vs. A-X). 
What happened was that there was 
approximately a 60-40 bias in the ratio 
of yes to no responses. This bias did 
not interact with either mode or probe. 
Inspection of the distributions of con- 
fidence judgments Shows then that as 
the retention interval decreases Ss be- 
come more accurate in assigning yes 
and no and more discriminating in their 
use of confidence judgments, but the 
pooled frequency stays constant in 
about this 3:2 ratio, 

Finally, intrusion matrixes for the 
recall gave clear-cut confirmation to 
what is already obvious in Fig. i; 
namely, the superiority of auditory to 
visual presentation, As one indication, 
the a posteriori probabilities (prob- 
ability that the probe was from Posi- 


tion i given a response from Position 
i+ 1) showed a large primacy and re- 
cency effect for auditory compared to 
visual presentation. Specifically, for 
i=1andi=9 thea posteriori prob- 
abilities were .60 and -80, respectively, 
for auditory presentation; they were 
only .17 and .41, respectively, for visual 
presentation. 


ExPERIMENT II 


The general conclusion from Exp: I 
is that modality effects are due to stor- 
age; in some way, the traces laid down 
during presentation are less persistent 
(or more vulnerable to interference) 
with visual than with auditory presen- 
tation. A factor which may limit the 
generality of this conclusion is that the 
required associations were based on 
temporal relations. That is, the defin- 
ing relationship for every association 
was temporal contiguity, and it is gen- 
erally considered that audition is a 
temporally distributed modality while 
vision is a spatially distributed modal- 
ity. The question may be raised as to 
whether the direction of the difference 
still obtains when the sequential con- 
straint is removed. 

One way to test this possibility would 
be to eliminate order information from 
the recognition test. That is, instead 
of testing for sequential associations, 
test merely for "knowledge of list 
membership." Sucha recognition pro- 
cedure requires 5 only to report yes or 
no as to whether a given probe word 
was or was not in the list just pre- 
sented. There are no sequential asso- 
ciations required here at all, and if the 
auditory presentation still yields supe- 
rior performance then the implication 
would be clearly that, in some way, the 
modality effect would reflect a real dif- 
ference in effectiveness of the two stor- 
age systems. 
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Method 


Procedure.—A. — signal-detection (TSD) 
analysis and procedure was used, based 
closely on a previous recognition-memory 
study (Murdock, 1965). Here there were 
two types of probes, targets (which had been 
present in the list), and lures (which had 
not been present in the list). All lists were 
nine words long; after the list one probe 
word was presented (target or lure) to 
which S had to respond with a yes-no 
judgment plus confidence rating. 

Within each replication each of the nine 
input positions was tested once (ie, with 
a target word X(i) for 1=i=9.) Targets 
and lures occurred in a 3:2 ratio; thus, six 
were needed for each replication. Because 
performance on recognition tests depend so 
clearly on the nature of the lures (eg. 
Dale & Baddeley, 1962) two types were used. 
In what were labeled Probes 10, 11, and 
12 the lures were words from Position 2, 
5, and 8, respectively, from the previous 
list. Probes 13-15, however, were novel ; 
they were words which previously had not 
appeared in the experiment. Thus, of the 
six lures, three had appeared in the prior 
list and three had never appeared before. 

All lists were presented in an auditory or 
a visual modality with the same equipment 
as in Exp. I. The probe word was always 
in the appropriate modality. With auditory 
presentation E called out the probe word 
at the termination of the list. With visual 
presentation E held up an index card with 
the probe word written on it. In each case 
S responded with a single number: 6 for 
sure-yes; 5 for fairly sure-yes; 4 for doubt- 
ful-yes; 3 for doubtful-no; 2 for fairly sure- 
no; and 1 for sure-no. This 6-point scale 
was written down and available to S 
throughout the course of the experiment. 

The pool of words was the same as that 
used in Exp. I, though different filmstrips 
and tapes were used. All Ss were given 90 
lists, 45 under each modality. A slower pre- 
sentation rate was used (1 word/sec) and a 
blank frame followed every word so that, 
as before, actual duration of stimulus presen- 
tation was approximately 3 sec. in each 
modality. There were 32 Ss, 16 of each sex 
from the same pool as Exp. I. In other 
essential respects the procedure followed 
that of Exp. I. 

Design.—The experimental design was 
again a seven-way factorial 2X 2 X 2X 4x 
2X3X15, with the first four variables 
(2x 2X 2x4) for between-Ss and the last 


three variables (2X 3X 15) for within-Ss 
effects. The former were for counterbalanc- 
ing; they were Mode Order (auditory to 
visual or visual to auditory), First Set 
(Set A or Set B), Probe Order (alpha 
or beta), and Ss (1-4 in order of appear- 
ance, with two of each sex). All Ss had 
the same 90 lists in the same order, with 
the first 45 in one modality and the second 
45 in the other modality. The Mode Order 
variable controlled for practice effects across 
the experiment, the First Set variable in- 
sured that the same lists occurred both under 
auditory and under visual, and the Probe 
Order variable provided two different ran- 
domizations to guard against differential 
order effects. These controls necessitated 
duplicate sets of stimulus materials, but since 
Mode Order, First Set, and Probe Order 
were orthogonal all reasonable precautions 
were observed, 

The 2X3X15 within-Ss comparisons 
constituted the 90 test lists. The first vari- 
able here was Mode (auditory or visual), 
the second variable was Replication (1-3), 
and the third variable Probe Position (1- 
15). As stated above, Probes 1-9 repre- 
sented target words (words which in fact 
had been presented in the list and to which 
the correct response was yes); Probes 10- 
15 represented target words of two types. 
Probes 10-12 came from the prior list, 
while Probes 13-15 were novel. The one 
restriction in the otherwise random Probe 
Order (alpha or beta) was that Probe 10 
could never follow Probe 2; Probe 11 could 
never follow Probe 5; or Probe 12 could 
never follow Probe 8. 

With this design there are a total of 2,880 
observations (32 Ss x 90 lists/S) which 
enter into the analysis. The dependent vari- 
able is the 6-point confidence rating made 
by S on each trial. 


Results 


In Fig. 3 is shown probability of 
correct recognition of target words as 
a function of probe position, and audi- 
tory is again superior to visual. The 
advantage of this method of presenting 
the results is that it shows how recog- 
nition accuracy varies with serial posi- 
tion; the disadvantage is that it is 
based only on target words so that— 
at least theoretically—the modality 
effect could represent criterion factors 
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Fic. 3. Probability of correct recognition 
as a function of probe position (Exp. II). 


rather than sensitivity (Donaldson & 
Murdock, 1968; Murdock, 1966a). 
(This problem is not present in Fig. 2, 
as the data is pooled over A-B and 
A-X trials. The difficulty, of course, 
is that in tests for "recognition of list 
membership" one cannot have specific 
lures for any particular probe position.) 

A more comprehensive test can be 
done by scoring each response as right 
Or wrong where correct is Confidence 
Rating 1-3 for Probe 10-15 (lures) 
but Confidence Rating 4-6 for Probe 
1-9 (targets). For the analysis of vari- 
ance the authors collapsed over Ss, con- 
sidered Mode Order, First Set, and 
Probe Order as a single octal variable, 
and used an arc sin transformation. 
The modality effect was significant, 
F (1, ©) = 4.30, b <.05, but it did 
not interact with Probe position or 
with replications (in both cases, F — 
1). The mean number of errors per 
S were 4.5 and 6.1 for auditory and 
visual, respectively. 

The most Satisfactory analysis, how- 
ever, is in terms of TSD, and the ques- 
tion of interest is whether the d' values 
differ for auditory and visual. While 
a number of comparisons are Possible, 
one reasonably representative one is 
shown in Fig. 4 which is an ROC plot 


(on double-normal coordinates) for 
Probe Positions 2-4 (targets) against 
Probe Positions 10-12 (lures). That 
is, for each the three positions were 
combined, and for each distribution 
(ie., distribution of confidence judg- 
ments for targets and for lures) there 
were 288 observations (32 Ss x 3 rep- 
lications x 3 positions). The values of 
d' (taken as the intersection with the 
negative diagonal) were 1.80 and 1.21 
for auditory and visual, respectively, 
Recently a method of maximum like- 
lihood estimation of parameters of con- 
fidence-rating ROC curves has been de- 
veloped (Ogilvie & Creelman, 1968). 
For the two curves shown in Fig. 4, 
the values of d' were each significantly 
different from zero and also differed 
significantly from each other, ¢ (1,000) 
i946, bos 001 The. values of d' 
given above are in fact the maximum 
likelihood estimates; for auditory, the 
slope of the line did not differ signif- 
icantly from unity but for visual it did 
(it was .73). Finally, the double-nor- 
mal plot was clearly a good fit; in 
neither case was the deviation from 
linearity significant (at the .05 level). 
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The above comparison pits words 
from the early part of the list (Posi- 
tions 2-4) against those presented in 
the prior list. When the same target 
words are compared against lures 
which never were presented (i.e., Probe 
13-15) the d' values were appreciably 
larger, 2.81 for auditory and 2.08 for 
visual. Again, the modality difference 
was clearly significant (.001 level). 
The larger d' values here (as well as 
mean values from the ANOVA) 
clearly suggest that there are dif- 
ferences among lures in terms of their 
past history. As a direct test, those 
which had appeared in the previous list 
were considered targets (or signal) 
while those which had not were con- 
sidered lures (or noise) ; the d" values 
were .64 (auditory) and .73 (visual) 
and both differed significantly (p < 
001) from zero. In other words, 
lures from the prior list are more likely 
to elicit false positives than are those 
that are novel; persistence of memory 
traces from List i—1 is detectable 
after presentation of List i (also see 
Melton, 1963). 


Discussion 


The results seem quite clear; recogni- 
tion of list membership for auditory 
words is better than for visual words. 
Statistically, the effect is greatest with a 
TSD analysis, but it shows up even 
in an analysis based on overall accuracy. 
Experiment I shows that sequential asso- 
ciations, whether tested by recall or 
recognition, were superior with auditory 
presentation; Exp. II showed the modal- 
ity effect to obtain even in a task which 
did not demand sequential associations. 

While Exp. II required no sequential 
associations, it would clearly be wrong to 
claim that no temporal factors at all were 
involved. Actually, it could be argued 
that this experiment was the yes-no ana- 
logue of the two-alternative forced-choice 
discrimination - of - recency study by 
Yntema and Trask (1963). Therefore, 


the temporal-spatial distinction. between 
audition and vision has not been by- 
passed; what Exp. II does show is that 
the modality effect is present even when 
adjacent associative connections are not 
required. 1 

Both experiments demonstrate a modal- 
ity difference when recognition measures 
are used, even though the type of recogni- 
tion test differed for the two experiments. 
Given that STM involves encoding, stor- 
age, and retrieval, if it is accepted that 
the types of recognition tests used here 
obviated the need for retrieval of the 
item, then the locus of the modality effect 
is narrowed down to two alternatives. In 
our opinion, the encoding hypothesis also 
seems unlikely. For one thing, Ss were 
college students, and 500 msec. is cer- 
tainly adequate time to read a word. In- 
trospectively, it seems quite clear that 
each word is seen or heard at the time 
of presentation. Perhaps most important, 
under the slower rate of Exp. II (1 
word/sec) the modality effect still held 
up, and it was most pronounced for 
words in the early part of the list 
(see Fig. 3). The results suggest, then, 
that the modality effect represents dif- 
ferences in storage, and that these differ- 
ences are manifest even when sequential 
associations are not required. 


REFERENCES 


Dae, H. C. A., & BADDELEY, A.D. Alter- 
natives in testing recognition memory. 
Nature, 1962, 196, 93-94. 

Donatpson, W., & Mumpock, B. B, Jr 
Criterion changes in continuous recognition 
memory. J. exp. Psychol., 1968, 76, 325- 
330. 

Kiwrscg, W., & Morris, C. J. Application 
of a Markov model to free recall and recog- 
nition, J. exp. Psychol., 1965, 69, 200-206. 

LACHMAN, R., & TUTTLE, A. V. Approxi- 
mations to English (AE) and short-term 
memory: Construction or storage? J. exp. 
Psychol., 1965, 70, 386-393. 

MANDLER, G. Verbal learning. In New 
directions of psychology III. New York: 
Holt, Rinehart and Winston, 1967. 

McNutty, J. A. A partial learning model 
of recognition memory. Canad. J. Psy- 
chol., 1966, 20, 302-315. 


86 BENNET B. MURDOCK, JR. 


MzrroN, A. W. Implications of short-term 
memory for a general theory of memory. 
J. verbal Learn. verbal Behav., 1963, 2, 
1-21. 

Munrpocx, B. B., Jr. Signal-detection theory 
and short-term m mory. J. exp. Psychol., 
1965, 70, 443-447. 

Munpocx, B, B., Jm. The criterion problem 
in short-term memory. J. exp. Psychol., 
1966, 72, 317-324. (a) 

Murpocx, B. B, Jr. Visual and auditory 
stores in short-term memory. Quart. J, 
exp. Psychol., 1966, 18, 206-211. (b) 

Murpocx, B. B., Jn. Auditory and visual 
Stores in short-term memory. Acta Psy- 
chologica, 1967, 27, 316-324. 

Ocnvr, J. C, & Creetman, C. D. Maxi- 
mum likelihood estimations of ROC curve 


parameters. J. math, Psychol., 1968, in 
press. 

Rosenzere, S., & Comen, B. b. Referential 
processes of speakers and listeners Psychol. 
Rev., 1966, 73, 208-231. 

THORNDIKE, E, L., & Lorcz, I. The teach- 
er's word book of 30,000 words, New 
York: Teachers College, Columbia Uni- 
versity, Bureau of Publication, 1944, 

Tutvine, E, & PEARLSTONE, Z. Availability 
versus accessibility of information in mem- 
ory for words. J. verbal Learn. verbal 
Behav., 1966, 5, 381-391, 

YNrEMA, D. B., & Trask, F. P. Recall as 
a search process. J. verbal Learn. verbal 
Behav., 1963, 2, 65-74, 


(Received May 1, 1967) 


Journal of Eaperimental Psychology 
1968, Vol. 72, No. 1, 87-93 


EFFECTS OF DELAY OF INFORMATIVE FEEDBACK, POST- 
FEEDBACK INTERVAL, AND FEEDBACK PRESENTATION 
MODE ON VERBAL PAIRED-ASSOCIATES LEARNING* 


ROBERT E. JONES, Jr. 
Portland State College 


The purpose of this experiment was to determine the main and 
interactive effects of delay and mode of informative feedback (IF), 


nature of activities required of S during delay, and length of post-IF 
interval in a verbal paired-associate task. Longer post-IF (6 sec.) 
and IF delay (6 sec.) intervals facilitated performance. However, 
specific IF delay effects varied as a function of IF mode and nature 
of activity interpolated during IF delay. Unfilled IF delay facilitated 
performance under the S-R mode but not under R-only mode. 
Performance was virtually identical with immediate IF for both 
IF modes, but the R-only mode was the more difficult with both 


filled and unfilled intervals of IF delay. 


Perhaps the first really definitive 
analysis of the time intervals involved 
in simple motor-skills tasks was made 
by Bilodeau and Bilodeau (1958). 
These investigators conducted a series 
of five experiments involving motor 
learning tasks such as knob turning, 
stick displacing and lever pulling, and 
identified the intratrial and intertrial 
time intervals involved in their proce- 
dure. These time intervals were: (a) 
the foreperiod interval (time from 
ready signal to response evocation), 
(b) informative feedback (IF) delay 
(time from response evocation to the 
presentation of IF), (c) postfeedback 
(post-IF) interval (time from IF with- 


1 This article is based upon a dissertation 
submitted to the University of Utah in partial 
fulfillment of the requirements for the PhD 
degree. The author wishes to thank Lyle 
Bourne, Jr., under whose direction this re- 
search was conducted. 

This research was supported by a pre- 
doctoral fellowship (MH 12685) from the 
National Institutes of Health. 

2 In these appendixes are listed the ob- 
tained frequency of occurrence of trigrams 
in the English language from three inde- 
pendent sources (Thorndike-Lorge, 1944; 
Pratt, 1942; and Underwood & Schulz, 
1960). The sum of these three counts was 
used as the frequency index. 
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drawal to the succeeding ready signal), 
and (d) the intertrial interval (total 
time from one response to the next). 

Recent reviews (Bilodeau & Bilo- 
deau, 1961) of the literature have re- 
vealed contradictory evidence concern- 
ing the effects of these timing variables 
in a variety of human learning tasks. 
Short unfilled intervals of IF delay ap- 
parently do not adversely affect human 
performance on tasks requiring sim- 
ple, discrete responses (eg., Bilodeau 
& Ryan, 1960; Bourne & Bunderson, 
1963; Noble & Alcock, 1958). How- 
ever, there is some discrepancy between 
studies using motor learning tasks and 
those using tasks of a more intellectual 
nature (e.g., concept identification and 
verbal learning) concerning the effect 
of the total intertrial interval. Studies 
using motor learning tasks (e.g., Bilo- 
deau & Bilodeau, 1958; Denny, Allard, 
Hall, & Rokeach, 1960) report that 
intertrial interval is the significant vari- 
able. However, Bourne and Bunder- 
son (1963), using concept-identifica- 
tion tasks, found that two components 
of the intertrial interval (IF delay and 
post-IF intervals) had differential 
effects. More specifically IF delay had 
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no effect, while the post-IF interval 

had a facilitative effect on performance, 

Jones and Bourne (1964) using a ver- 

bal paired-associates (PA) task found 

that increases in both IF delay and 

post-IF intervals benefited performance 

and that these effects were essentially 

additive. 

A possible factor contributing to the 
discrepancies noted above may be task 
differences in the nature of responses 
required of Ss, For example, concept 
identification and PA tasks rely heavily 
upon symbolic responses (language) 
which may mediate the IF delay inter- 
val, whereas the results of one study 
Suggest that verbal responses may be 
of minimal value in mediating IF de- 
lay on very simple motor skill tasks 
(Boulter, 1964). Although IF delay 
has had differential effects in PA and 
concept-identification ^ tasks— facilitat- 
ing performance (Battig & Brackett, 
1961; Jones & Bourne, 1964) and no 
effect (Bourne & Bunderson, 1963) 
on the two classes of tasks, respectively, 
—there remains a distinct possibility 
that unfilled IF delay intervals may 
facilitate performance on other than 
motor-skills tasks by providing S op- 
portunity to rehearse verbal responses, 
Where the responses are simple and 
little or no response learning is re- 
quired (as in concept-identification 
tasks) presumably rehearsal is of little 
utility and no effect is noted from IF 
delay. Where responses are relatively 
complex and response learning is re- 
quired (as in PA tasks with poorly 
integrated response items) presumably 
rehearsal in IF delay aids response in- 
tegration and facilitates performance, 
It follows, therefore, that activity inter- 
polated during IF delay would be ex- 
pected to counteract the facilitative 
effect of IF delay in PA tasks, 

Another basis for the discrepancy in 
results noted above may lie in task 
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differences in the mode of IF presenta- 
tion. In the PA task IF is typically 
accomplished by the contiguous presen- 
tation of both stimulus and response 
(the S-R mode), while in many other 
laboratory tasks only the correct re- 
sponse (the R-only mode) is given. It 
may be argued that the contiguous 
presentation of an S-R pair obviates 
the effect of IF delay. Indeed, the pre- 
sentation of an S-R pair can be de- 
scribed as a learning trial and presenta- 
tion of the stimulus alone as a test for 
prior learning. 

The main Purposes of this experi- 
ment are: (a) to compare filled and 
unfilled IF delay intervals and (b) 
to compare S-R and R-only modes in 
a verbal PA task with difficult re- 
sponses. In addition, the study also 
examines another intratrial time inter- 
val, the post-IF interval, 


METHOD 


The experimental design was an incom- 
plete 2* factorial incorporating : (a) two 
lengths of post-IF interval (0 and 6 sec.), 
(b) two lengths of IF delay interval (0 
and 6 sec.), (c) two modes of IF presenta- 
tion (S-R pair and R-only), and (d) two 
types of interpolated activity (counting and 
no counting during IF delay). The incom- 
plete nature of the design results from the 
impossibility of varying interpolated activ- 
ity during 0-sec. IF delay conditions. Thus, 
of 16 potential treatment conditions, only 12 
were used. 

Subjects. —The Ss were students enrolled 
in introductory psychology classes who re- 
ceived class credit for participation in the 
experiment. Ten Ss were randomly assigned 
to each of the 12 experimental conditions. 

Materials —The learning materials con- 
sisted of 16 (eight pairs) low frequency non- 
sense trigrams selected from the tables pro- 
vided by Underwood and Schulz? (1960). 
The trigrams were selected on the following 
criteria: (a) total letter frequency of occur- 
rence was between 10 and 20, (b) all tri- 
grams which were meaningful words or com- 
mon abbreviations were excluded, and (c) all 
trigrams in which any letter occurred twice 
were excluded. In order to minimize intra- 
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list similarity, the final list was constructed 
so that no letter appeared twice in the same 
ordinal position in any of the trigrams. 
That is, all trigrams began with different 
letters, and no letter appeared as the second 
or third letter in more than one trigram. 

In addition, all trigrams which in E’s 
opinion were easily pronounceable were used 
as stimuli in order to equate response dif- 
ficulty as much as was feasible. 

Procedure —The PA list was typed on 
3-in. wide paper tape and presented by a 
Gerbrands memory drum at a 3-sec. rate. 
The stimulus item appeared on the left side 
of the tape and was followed by the S-R 
pair, or the R member only, depending on 
the IF mode, Under the R-only mode, the 
R was placed on the right side of the tape 
to differentiate clearly among stimuli and 
responses. 

In order to prevent serial memorization 
of responses the list was presented in four 
different orders which were constructed such 
that no pair appeared twice in any quarter 
of the list. A series of Xs separated each 
repetition of the list, resulting in a 3-sec. 
intertrial interval. 

To produce a 6-sec. post-IF interval, two 
blank spaces were interpolated in the tape 
between IF and the succeeding stimulus. 
Informative feedback delay was accomplished 
in a similar fashion except for those condi- 
tions involving counting (C) during IF de- 
lay. For the no counting (NC) inter- 
polated groups two blank spaces occurred 
between each stimulus and IF presentation. 
The Ss were required to sit quietly during 
the IF delay interval. That is, overt stimu- 
lus rehearsal and/or response anticipation 
were not permitted during IF delay. For 
the C groups a number (randomly selected 
between 20 and 100) immediately followed 
each stimulus item and in turn was followed 
by one blank space. The Ss were required 
to say each number as it appeared and then 
count backwards from that number until IF 
was presented. 

Upon arriving for the experiment Ss were 
randomly assigned to an experimental con- 
dition and instructed in the anticipation 
method of PA learning. Those Ss serving 
in IF delay conditions were er in- 
structed that they must respond only during 
the interval during which the stimulus was 
in view in the aperature of the memory 
drum. Any response given during the IF 
delay interval was recorded but not counted 
even if correct and Ss were reinstructed as 
to the proper time to respond. Three Ss 


were dropped from the experiment for re- 
peated overt responding during the IF delay 
interval. 

Learning was carried to a criterion of one 
perfect recitation of the entire list. 


RESULTS 


Total trials and errors (omissions 
plus intralist and extralist intrusions) 
were submitted to analysis of variance 
based on a 3 X 2 X 2 factorial model, 
with the O-sec., and the C and NC 
conditions of 6-sec. IF delay compris- 
ing the three-level variable. The vari- 
ance attributable to this factor was sub- 
sequently partitioned on individual de- 
grees of freedom, which permitted in- 
dependent tests for the significance of 
IF delay and of interpolated activity. 
Similar breakdowns of the variances 
due to the two first order interactions 
involving IF delay (ie. IF delay by 
IF presentation mode and IF delay by 
post-IF interval) were performed. 

The results of analysis of variance of 
both dependent variables were virtually 
identical and indicated that all four 
main effects, as well as the IF delay 
by IF presentation mode interaction, 
were significant sources of variance in 
the present study. Therefore only the 
data for trials to criterion are shown 
in Table 1. 

Mean trials to criterion for all 12 
groups are shown in Table 2. The 
effect of increased post-IF interval was 
to reduce trials to criterion across all 
experimental conditions. This effect 
is consistent with the findings of 
Bourne and Bunderson (1963) and 
with the results of previous investiga- 
tions of PA learning which have ma- 
nipulated intratrial time intervals 
(Hovland, 1939; Jones & Bourne, 
1964). 

The effects of IF delay in the present 
experiment were greatly influenced by 
both the IF presentation mode and the 
nature of interpolated activity during 
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TABLE 1 
ANALYSIS OF VARIANCE OF MEAN TRIALS 
TO CRITERION 
Source MS df F 
Post IF (P) 626 1 |17.249* 
Ire delay (0) 448,5 2 |12:36€ 
0 sec. vs. 6 sec (D) 218.5 1 | 6.02* 
Interpolated (I) activity | 678.5 1 |18.69*9* 
IF Mode (M) 219 1 6.03* 
D'xP 161.5 2 | 1:69 
DXP 33 1 | 9 
XP 90.3 1 | 249 
D'xM 103 2 | 2:84 
DXM 189,03 1 | 5.21* 
IXM 17.11 1| 47 
PxM 1 1] 33 
D'XPxM 56 2 | 154 
Ss within groups 363 | 108 


Total 119 
* «05 


P «.05, 
** P cl. 


IF delay. With immediate IF, per- 
formance was virtually identical under 
both S-R and R-only presentation 
modes, However, the S-R mode was 
considerably easier for Ss in conditions 
of IF delay, 

The unfilled interval of IF delay 
within the S-R Presentation mode fa- 
cilitated performance, a result which 
is consistent with some earlier experi- 
ments (eg. Jones & Bourne, 1964), 
This effect was particularly noticeable 
under conditions of O-sec. post-IF 
interval. The very slight improvement 
from IF delay with 6-sec. post-IF 
interval may be an indication that 
asymptote on improvement from in- 
creasing the distribution interval was 


closely approached ( Underwood, 
1961). 
TABLE 2 
MEAN TRIALS To CRITERION 
IF Delay 
IF Mode | Post IF a= 
o 6NC 6c 
S-R 0 sec, 20.9 10.1 17.7 
6 sec. 15.7 8. 14.5 
R-Only | 0 sec. 17.7 16.2 24.9 
6 sec. 14.5 14.1 15:1 


Similar but insignificant effects were 
Obtained from the unfilled interval of 
IF delay under the R-only IF mode. 
That is, IF delay apparently facili- 
tated performance with O-sec. post-IF 
interval, but had virtually no effect for 
the groups with the 6-sec. post-IF 
interval. 

Counting during IF delay increased 
mean trials to criterion. The means 
for the 6-sec, IF delay groups with 
interpolated activity are very nearly 
equal to comparable means obtained 
under immediate IF. In comparing 
performance within the R-only mode 
under immediate and delayed IF, it is 
apparent that performance deteriorated 
somewhat with counting, particularly 
under the O-sec. post-IF condition. 

Surprisingly, effects of interpolated 
activity and IF presentation mode did 
not interact in the present experiment. 
Under both C and NC conditions of 
F delay, performance was superior 
with IF presentation in the form of 
S-R pairs, and the introduction of 
backward counting during IF delay in- 
creased task difficulty in a fairly con- 
Stant manner for both TF modes. 

The means presented in Table 2 
Were averaged across the two post-IF 
conditions in order to test more pre- 
cisely the IF Delay x IF Presentation 
Mode interaction. It should be noted 
that the obtained F,F (1, 108) = 5.21, 
? < .05, underestimated the degree of 
interaction because in testing the De- 
lay X Mode term on a single degree of 
freedom, C and NC conditions of IF 
delay were averaged. 

The differences among the six means 
were tested for reliability with Dun- 
can’s multiple-range test (Table 3). 
The results indicated that only one 
mean (S-R mode, delayed IF with NC 
activity) differed significantly from any 
others. This result is interpreted as 
indicating that delayed IF significantly 
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TABLE 3 
Duncan’s MuLTIPLE-RANGE TEST OF DIFFERENCES BETWEEN 6 TREATMENT 
MEANS PRESENTED IN TABLE 2 

Sh 

Group M 9 is TA is $5 Io Signitcant 
Ranges 
-R) 6 NC 9.4 5.4* 6.7* 8.1* 8.9* 01.3* Ra = 5.389 
(R) 6 NC 14.8 1.3 2.7 3.5 4.9 Rs = 5.669 
S-R) 6 C 16.1 1.4 2.2 3.6 R, = 5.854 
(R) 0 NC 17.5 0.8 2.2 Rs = 5.987 
-R) 0 NC 18.3 1.4 R$ = 6.092 
(1) (2) (3) (4) (5) (6) 


Note—Any treatment means not underscored are significantly different, 


* p <.05. 


benefited performance under the S-R 
mode but not under the R-only mode. 
No significance was indicated for the 
difference between the means of S- 
and R-only modes with the unfilled IF 
delay interval. Because it was ex- 
pected that performance would differ, 
the two means were tested by t. The 
results indicated that performance Was 
better under the S-R mode, ¢ (38) = 
3.85, p < 01. 


Discussion 


On the basis of previous studies of 
post-IF and IF delay intratrial intervals 
(e.g, Bilodeau & Bilodeau, 1958; Jones 
& Bourne, 1964), it might be expected 
that increasing the length of both inter- 
vals would reliably improve performance. 
In the present study lengthening the post- 
IF interval (always unfilled) produced a 
general facilitative effect which is con- 
sistent with findings in a variety of tasks 
(e.g, Bourne & Bunderson, 1963; Wein- 
berg, Guy, & Tupper, 1964). Although 
there are other possible explanations, it 
seems most reasonable to consider the un- 
filled post-IF interval analogous to a 
study period, and thus response integra- 
tion and/or associative hookup should be 
facilitated by the increased opportunity 
for rehearsal resulting from its extension. 

The overall effect from delayed IF was 
significant with the longer interval gener- 
ally facilitating performance, However, 


unlike post-IF effects, in the present 
study IF delay must be qualified for its 
interaction with other variables. The 
significant interaction between IF mode 
and IF delay was interpreted as indi- 
cating that short unfilled intervals of IF 
delay facilitate performance, in compari- 
son to O-sec. delay, under the S-R mode 
but have little or no effect under the R- 
only mode. The results obtained under 
the S-R mode with no interpolated activ- 
ity replicated in all essentials the results 
of a previous study (Jones & Bourne, 
1964). The facilitative effects of short 
unfilled intervals of IF delay apparently 
combined additively with the effect of 
lengthening the post-IF interval. Per- 
formance under the R-only mode was 
consistent with the results of a number of 
previous studies utilizing tasks other than 
PA learning (Bilodeau & Bilodeau, 1958; 
Bourne & Bunderson, 1963; Noble & 
‘Alcock, 1958) in that there were no ap- 
parent differences in performance under 
immediate and delayed IF conditions. 
The differences in performance attribut- 
able to IF mode in the present study indi- 
cate that facilitative effects from delayed 
IF observed in previous studies of verbal 
learning (Battig & Brackett, 1961; Jones 
& Bourne, 1964) were a function of the 
IF presentation mode employed (i.e., S-R 
pairs). 

In the present experiment backward 
counting during IF delay consistently in- 
creased task difficulty compared to NC 
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conditions, 
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Similarly, Peterson and 


Peterson (1959) and Murdock (1963), 
investigating short term retention, found 


that backward 


counting over a 6-sec, in- 


terval reduced the Proportion of single 
trigrams correctly recalled by approxi- 


mately 60%. 
it might be 


From these and other data, 
expected that backward 


counting would differentially affect per- 
formance under the two IF mode condi- 
tions. For example, any operation which 
Promotes forgetting of the stimulus 


should inhibit 


performance seriously un- 


der the R-only mode, Such a trend was 


evident in the 


present data (see Table 2) 


in that, relative to immediate IF, per- 


formance was 
with interpolat 


inferior under delayed IF 
ed activity, It may be re- 


called that in testing the comparisons just 


noted, the da 


ta were averaged across 


post-IF conditions; the resulting con- 
founding of post-IF effects may account 


for the failure 
liable differenc 
The lack of 


to obtain a statistically re- 
e. 
interaction between these 


two variables may indicate that the prin- 


cipal benefit fr 


om unfilled intervals of IF 


delay with the S-R mode results from 


the fact that d 


luring IF delay S has time 


to rehearse materials other than the im- 
mediate stimulus, That is, because IF 
satisfies the conditions for associative 


learning, S m 


ay rehearse the preceding 


pair, or any other pair, during the IF 


delay interval, 


The effects of IF delay on performance 
under the R-only mode may be explained 


by rehearsal al 


that early in Practice $ must retain the 


Stimulus durin 


g IF delay, by repetition 


or rehearsal, until the R member of the 
pair is presented. Later in Practice, after 


On the basis 


of the present data, there- 


fore, it appears that the consistently facil- 
itative effects of all unfilled intratrial in- 


tervals in the 


PA task can be explained 
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by the opportunity such intervals provide 
S for rehearsal.s 

The present data support and extend 
Bourne and Bunderson’s (1963) previous 
findings concerning the relative effects of 
post-IF and IF delay intervals in that, un- 
der the R-only mode, IF delay had no 
reliable effect and lengthening the post- 
IF interval facilitated performance. Fur. 
ther, these results, in conjunction with 
other recent data, demonstrate remarkably 
similar effects on performance of a 
variety of tasks from manipulating short 
unfilled intratrial time intervals; For 
example, Weinberg, Guy, and Tupper 
(1964) obtained facilitative effects on 
simple motor skill performance by length- 
ening the post-IF interval, Finally, the 


?An alternative explanation, first sug- 
gested by Bilodeau and Bilodeau (1958) 
may be found in data on distributed practice 
(DP) effects on human performance, (It 
should be noted that IF delay and post- 
IF interval have often been lumped and in- 
vestigated under the rubric of distributed 
practice). It is well documented that DP 
interacts with task difficulty such that the 
more difficult task benefits disproportionately 
from DP (Underwood, 1961). It might be 
expected, therefore, that when an extremely 
simple response is required (e.g., stick pull- 
ing or button Pushing) little or no beneficial 
effect or even deleterious effects might re- 
sult from DP. With more difficult tasks, 
lengthening IF dela ; and hence DP, might 


present PA list consisted of poorly inte- 
grated response items, It is possible that 
lengthening intratrial intervals might provide 
for the "spontaneous recovery" of wholly or 
partially extinguished prior letter sequence 
habits, thus producing the repeated extinc- 
tion-spontaneous recovery-extinction cycle 
which Underwood hypothesizes results in 
greater net extinction of competing habits, 
However, because of unfilled post-IF inter- 
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present experiment demonstrates that IF 
presentation mode is a critical variable in 
determining the effects of IF delay on PA 
learning. At present there are no data 
available concerning the generality of IF 
mode effects on tasks other than verbal 
learning. However, it is of interest to 
note that Hockman and Lipsitt (1961), 
who obtained deleterious effects from de- 
layed reward on acquisition of a discrimi- 
nation learning task, and Brackbill and 
Kappy (1962) who found no effects on 
discrimination learning from delayed re- 
ward, used IF presentation modes which 
correspond very closely to those used in 
the present study, i.e., R-only and S-R, 
respectively. There were other methodo- 
logical differences between the two studies, 
but it is interesting to speculate to what 
extent the discrepant results might be ac- 
counted for by the differences in IF 
modes employed; the problem seems 
worthy of empirical investigation. 
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SOME EVIDENCE FOR A COMPARATOR IN 
ADAPTATION TO OPTICAL TILT: 


SHELDON M. EBENHOLTZ 
University of Wisconsin, Madison 


In 3 experiments, after varying 


periods of adaptation to a rotated 


visual field, Ss were immediately exposed to a changed optical tilt that 


ference between the LA and prism tilt. Performance in other transfer 


Situations as well as that descri 


bed above was consonant with the } 
| 


assumption that S Compares present with Previous optical tilt and 


that adaptation is a linear function of this angular difference, 


shown 
adaptation to optical tilt to be a nega- 
tively accelerated function of exposure 
(Ebenholtz, 1966; Mikaelian & 
Held, 1964). Inan attempt to account 
for the shape of this function (Eben- 
holtz, 1966), the Problem of the effec- 
tive optical stimulus in adptation arises, 
The present study explores two pos- 
sible solutions, each of Which could in 
principle account for the negatively 
accelerated time rate or adptation, 
The dependence hypothesis requires 
that the tilt to which S is exposed be 
defined relative 
state of adaptation of the eye. Thus for 
a given S the effective optical stimulus 
may be determined by the angular dif- 
ference between the level of adaptation 


2 In defining the stimulus for effective opti- 
cal tilt the registration of head position is 
implicitly assumed, 


Proportion of this diminishing angle, | 
the negatively accelerated time function 
may be derived. 

In contrast to the above, the inde- 
pendence hypothesis assumes that the 
effective optical stimulus is independent 
of the momentary state of adaptation. 
According to this view, tilt may be de- 
fined in terms of the angular flow of 
the retinal pattern contingent upon a 
lateral head movement. The effective 
optical stimulus js assumed to be the 
angular difference between two succes- 
sive flow patterns as, €.g., between the 
Preadaptation direction and that pro- 
duced by viewing through tilting 
prisms, or between respective directions 
of successive exposures to two different 
prism tilts. The Process of analyzing 
differences in Prism tilt is assumed to 
be the function ofa comparator (Held, 
1961) whose output largely reflects 
recent stimulation and which therefore 
becomes less efficient as the time inter- 
val between present and previous tilt 
increases. The negatively accelerated 
time-rate function is presumed to re- 
flect the diminishing sensitivity of the 
comparator over time, Failure of the 
comparator to detect an angular dif- 
ference would Presumably signal the 
completion of the adaptation process, 
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A test of the foregoing hypotheses 
may be made in a transfer paradigm 
where after reaching a given level of 
adaptation S is exposed to a prism tilt 
that matches his LA. Under this con- 
dition, from the point of view of the 
dependence hypothesis, effective optical 
tilt equals zero. Consequently, S's 
level of adaptation should remain con- 
stant during the transfer period. On 
the other hand, since a difference in 
prism tilt obtains between the initial 
and transfer periods then, reasoning 
from the independence hypothesis, the 
stimulus condition for further adapta- 
tion exists. Assuming adaptation to 
be a linear function of optical tilt 
(Ebenholtz, 1966; Morant & Beller, 
1965), the change in adaptation should 
be proportional to the difference angle 
between the two prism tilts and either 
increase or decrease depending upon 
the relative direction of the initial and 
transfer prism tilts, respectively. 


PROCEDURE 
Apparatus 


A tilited visual field was produced by 
having S view through a cylinder contain- 
ing two Dove prisms mounted in tandem. 
The cylinder was fixed over one eye by 
means of a head piece, while the other eye 
was occluded. Any desired degree of tilt 
between 0 (upright) and 45°, clockwise 
(CW) or counterclockwise (CCW) could 
be obtained by rotating one prism relative to 
the other. The field of view was restricted 
approximately to 30° in all directions. 

The measurement of adaptation was ac- 
complished in a completely darkened room 
by having S set an electroluminescent panel to 
the apparent vertical in the frontal plane. The 
panel produced a line of light 4 in. wide and 
12 in. long, and turned about an axis at its 
base. During testing S viewed the line mo- 
nocularly with head fixed and without prisms 
at a distance of 5 ft. The S's eye level was in 
the same horizontal plane as the pivot point 
of the luminous panel. Four measures of the 
upright were taken at each session with the 
starting position of the rod alternating be- 
tween 25° CW and 25° CCW of true ver- 


tical. The LA was determined by the dif- 
ference between the means of the pre-and 
postadaptation measurements. In all experi- 
ments S’s right eye only was exposed during 
adaptation and in all groups half of the Ss 
received CW tilts, the remaining Ss receiv- 
ing CCW tilts. Activity during the ex- 
posure period was limited to walking through 
familiar hallways and up and down staircases. 


Design 


Experiment I.—Sixteen female undergrad- 
uate Ss were assigned to one of two groups. 
The experimental (E) group was exposed 
to an optical rotation of 30° for 30 min. 
with tests at 15-min. intervals. Imme- 
diately following the last test session S's 
prism tilt was adjusted so as to approximate 
his LA. A second ł-hr. adaptation period 
followed with measures again taken at 15- 
min. intervals. The control (C) group was 
exposed to an optical rotation equivalent to 
the mean shift in tilt received by Group E, 
ie, the mean difference in optical tilt be- 
tween initial and transfer tasks. Since the 
average change in tilt for Group E was not 
known until all Ss of that group had com- 
pleted their tasks, Group C was run after 
Group E. 

It may be noted that the dependence hypo- 
thesis requires that no change in LA occur 
for Group E in the transfer task relative to 
the level attained immediately prior to the 
change in optical tilt. On the other hand 
the independence hypothesis permits the in- 
ference that the downward shift in optical 
tilt should produce a drop in LA for Group 
E and, further, that this change in LA 
should be equivalent to the LA achieved by 
Group C. 

Experiment II.—Two groups, each com- 
posed of 8 female Ss, were distributed on an 
alternating basis among two conditions. 
In Part 1 all 16 Ss were exposed to an 
optical tilt of 18° for a 2-hr. period. Mea- 
surements of adaptation were taken at 10- 
min. intervals throughout the experiment. 
Part 2 consisted of a 2-hr. transfer task in 
which, for Group D, optical tilt was adjusted 
downward by a decrement equal to the dif- 
ference between the LA and previous optical 
tilt, ie. the new optical tilt was given by 
the formula 18— (18— LA). This pro- 
cedure served to equate S's optical tilt with 
his previously attained LA at the beginning 
of the transfer period. For Group U optical 
tilt was adjusted upward during transfer 
according to the formula 18+ (18— LA). 
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Thus Groups U and D provided cases in 
which prism tilt during transfer deviated 
equally but in opposite directions relative 
to the prism tilt of the initial period. 

Predictions were similar to those of Exp. 
I, viz., the dependence hypothesis required no 
change in LA during transfer for Group D, 
but a large shift for Group U. The inde- 
pendence hypothesis required that the LA 
change in proportion to and in the direction 
of changes in prism tilt, i.e., that the changes 
in LA be in opposite directions but be equal 
in magnitude for the two groups. . 

Experiment III—Eight male and eigh 
female Ss were evenly distributed into two 
groups with tests carried out every 10 min. 
throughout the experiment. Group 1 was 
exposed to an optical tilt of 30? for 3 hr. 
This was followed by a 1-hr. transfer period 
in which optical tilt was altered to match 
the LA reached at the end of the initial 30- 
min. period. The matching technique used 
in the previous experiments introduced a 
slight error based upon the fact that optical 
tilt was measured in degrees of arc relative 
to the gravitational vertical whereas LA 
referred to a deviation from the mean pre- 
adaptation level. Since the pretest rarely 
yielded values equal to 0, but showed mean 
deviations within 3°, the matched optical 
tilt deviated somewhat from the LA. In 
Exp. III optical tilt was adjusted to re- 
flect S's preadaptation score, and thereby to 
provide a more precise attempt at matching 
LA with optical tilt. 

The Ss forming Group 2 were first ex- 
posed to a prism tilt of 0° for 3 hr. For 
the 1-hr, transfer period, a given S in Group 
2 was exposed to an optical tilt equivalent to 
that of the immediately preceding Group-1 
S, adjusted for the former’s preadaptation 
measure. In this fashion, Ss of the two 
groups were matched during transfer in 
terms of the deviation between the prism 
tilt and their respective preadaptation test 
scores, 
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As in the previous instances the dependence 
hypothesis requires no change in LA during 
transfer for Group 1, Based upon. the 


‘assumption that adaptation is linearly related . 


to differences in optical tilt, the independence 
hypothesis permits. several inferences con- 
cerning performance during the transfer 
period, First, since the magnitude of- optical 


tilt during transfer is less than that of the’ 


initial period, Group 1 should yield a nega- 


tively decelerated function of time for the | 


first 3 hr. Second, at the end of the first 
3 hr, Groups 1 and 2 should be equal in 
LA. This conclusion derives most explicitly 
from the linear functions describing the in- 
crement in adaptation (AA) in each group, 
where AA —K(TzT:). In this expression 
K represents the slope (ie, the proportion 
of adaptation), which is assumed to be con- 
stant for a given exposure interval, with the 
angular difference in optical tilt (T) be- 
tween two successive exposure periods in- 
dicated by T:-T;. Table 1 represents the 
values of T and AA for the two groups 


as a result of the initial 3-hr. exposure period 


as well as for the first } hr. of transfer. 
During transfer, Ta takes on the value given 
by LA as required by the paradigm presented 
above. The final level of adaptation reached 
by Group 1 after 3 hr. of transfer is given 
by the sum of the individual 3-hr. increments 
and equals K X LA. As Table 1 indicates 
Group 2 is expected to reach the same level 
after a comparable exposure interval. 

It was assumed that performance during 
the second 3 hr. of transfer would be com- 
parable for the two groups. 


RESULTS 


Experiment I.—The LA reached by 
Group E after 4 hr. of exposure to an 
optical tilt of 30° averaged 8.3°. This 
was significantly different from zero, 


TABLE 1 


DIFFERENCES IN OPTICAL Tikt 


(T) AND INCREMENTS IN ADAPTATION (A A) FOR THE INITIAL 
AND TRANSFER PERIODS OF GROUPS 1 AND 2, RES 


IPECTIVELY, OF Exp. III 


Group 1 Group 2 
TET: AA TCT: AA 
Initial $ hr. — , 30-0 K X 30 0 
First $ hr. of transfer LA-30 K X LA-K X 30 LAO K X LA 
ZAA SR K XLA } KXLA 
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LEVEL OF ADAPTATION (DEGREES) 
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Fic. 1. Magnitude of adaptation as a function of exposure time on original and 


transfer tasks, respectively, with prism tilt (T) as shown. 


t (7) = 5.55, p < 01. During transfer 
Ss of Group E were exposed to an 
optical tilt equivalent to their pre- 
viously attained LA. In setting the 
prism apparatus to the nearest degree, 
errors due to rounding yielded an aver- 
age T of 81°. This procedure pro- 
duced decrements in LA of 44^ and 
5.4? at the end of 15 min., and j hr. 
of transfer, respectively. Both decre- 
ments were significantly different from 
zero, t (7) — 480 and 5.96, respec- 
tively, with p < .01 in each case. 

For Group E the mean difference 
in optical tilt between initial and trans- 
fer tasks approximated 22°. On the 
assumption that this difference con- 
stituted the effective optical stimulus 
during transfer, it was expected that 
exposure of Group C to a T of 22° 
would produce increments in LA com- 
parable to the decreases in LA yielded 


by Group E during transfer. After 15 
and 30 min. Group C yielded 3.8° and 
5.3? of adaptation, respectively. In 
no case were these values significantly 
different from the corresponding trans- 
fer decrements of Group E; for 15 and 
30 min. t (14) =.56 and 096, p > 
.05, respectively. 

Experiment II.—The LA reached 
during initial and transfer tasks for 
Groups U and D, respectively, are rep- 
resented in Fig. 1. Analysis of vari- 
ance showed no significant differences 
between the groups during the first 
task, F (1, 14) < 1. Exposure time 
yielded a significant effect, F (11, 154) 
= 6.087, p < .01, with LA increasing 
between the first and twelfth test inter- 
val The Groups X Test-Interval in- 
teraction was not significant, F (11, 
154) < 1. 

During transfer the prism tilt for 
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Ss of Group U averaged 31.0? while 
that of Group D Ss averaged 4.8°, 
These values represent very nearly 
equal shifts in opposite directions with 
respect to the prism tilt of 18° em- 
ployed in the initial task. For Group 
D the tilt of 4.8° approximated the 
LA of 5.5 registered at the final test of 
the first task. As Fig. 1 indicates, 
after the first 10-min. transfer period, 
Group D yielded a decrement of 3,1° 
whereas Group U showed an increment 
of 2.2°. Both values were significantly 
different from zero; for Group U ¢ (7) 
= 2.76, p< .05 and t (7) = 843, p 
< Olin the case of Group D. In order 
to determine whether the magnitude of 
the changes in LA were different for 
the two groups, the transfer decrement 
at each test period and for each S in 
Group D was computed, This value 
was then added to the respective S’s 


LEVEL OF ADAPTATION (DEGREES) 


oo 


LA as measured at the test immediately 
preceding the transfer period. These 
transformed scores enabled a direct 
comparison to be made with the values 
of LA achieved by Group U. Although 
Group D consistently yielded slightly 
higher scores, analysis of variance re- 
vealed no significant differences in the 
magnitude of the change in LA, be- 
tween the two groups, F (1, 14) = 
2.39, p > .05. Neither exposure time, 
F (11, 154) « 1, nor the interaction of 
Groups X Time, F (11, 154) — 1.03, 
$ > .05, was significant. 

Experiment llL—During the first 
3-hr. interval, Group 1 reached an LA 
of 4.8? as a result of exposure to an 
optical tilt of 30°. This value was sig- 
nificantly different from zero, ¢ (7) = 
5.75, p < .01. Group 2 showed no sig- 
nificant adaptation effect, t (7) = 1.56, 
p> .05, the average LA being .33° 
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Fic. 2. Magnitude of adaptation as a function of exposure time on initial and 


transfer tasks, respectively, 


with prism tilt (T) as shown. 
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after 30 min. of exposure to a zero de- 
gree prism tilt. 

The average LA as a function of ex- 
posure time for Groups 1 and 2, re- 
spectively, is represented in Fig. 2. 
During transfer the prism tilt for Ss 
of both groups averaged 4.6°, a value 
quite close to the LA of 4.8? acheived 
by Ss of Group 1 after their initial ex- 
posure period. Analysis of variance of 
the transfer data showed no significant 
differences between the groups, F (1, 
14) = 1.56, p > .05. Neither exposure 
time nor the Groups X Time interac- 
tion was significant, F (5, 70) = .90 
and 1.02, respectively. 


DISCUSSION 


The three experiments were consistent 
in indicating a drop in LA under condi- 
tions in which the magnitude of optical 
tilt was set initially equal to the LA. 
This finding is similar to that reported 
by Kohler (1946) for the curvature after- 
effect induced by right-angled prisms. 
In the present context these results per- 
mit the rejection of the dependence hypo- 
thesis whereby the effective optical stimu- 
lus was defined in terms of the angle 
between the instantaneous LA and the 
given prism tilt. It also follows that 
explanations of the negatively accelerated 
time rate of adaptation in terms of a 
steadily diminishing separation between 
LA and prism tilt may be considered to 
be unlikely. On the other hand, the 
direction and magnitude of adaptation 
during the three transfer tasks was con- 
sonant with inferences drawn from the 
assumption that the effective stimulus for 
adaptation is the angular difference be- 
tween the prism tilts of the initial and 
transfer periods, respectively. Some sup- 
port is thus gained for a model of the 
adaptation process which has provision 
for the storage of optical tilt and the 
comparison of such information over suc- 
cessive exposure intervals (Held, 1961; 
Rock, 1966). 

The present data yielded two results 


which suggest some limitation to the 
comparator model presently used to infer 
changes in LA under transfer conditions. 
In Exp. II the transfer data yielded no 
significant differences in the decrement 
in LA exhibited by Group D relative 
to the increment in LA shown by Group 
U. Nevertheless, as Fig. 2 indicates, 
changes in LA in the upward direction 
were consistently lower than the down- 
ward shifts of Group D. Should this 
asymmetry prove to be reliable some 
change in the assumption that adaptation 
is a linear function of differences in opti- 
cal tilt would be required. The second dis- 
crepant result occurred in Exp. III. Groups 
1 and 2 were expected to reach the same 
LA after 3 hr. of transfer (See Table 1). 
Prior to that point Group 1 should have 
yielded higher values of LA than Group 
2, for it was assumed that the two groups 
were adapting in opposite directions and 
would therefore yield growth curves with 
slopes of opposed sign. The failure to 
obtain a significant interaction effect of 
Groups X Time probably reflects the 
relatively small sample size, as well as 
the slope of the adaptation curve at the 
time of the test. Thus it is probable 
that, judging from the shape of the func- 
tions in Fig. 2, a test aíter 5 min. of 
transfer would have shown a reliable dif- 
ference between the two groups. 
Although the assumption is tenable 
that adptation proceeds in response to à 
comparison that is made between prism 
tilts, it may be argued that a more par- 
simonious view exists. Namely, that S 
responds to each successive prism tilt 
independently of any previous tilt. Con- 
sonant with this possibility is the assump- 
tion that the vertical meridians of the 
eyes, which serve functionally to separate 
the hemiretinas, act as a base line against 
which to measure effective optical tilt. 
It would appear that the negatively de- 
celerated transfer functions represented 
in Fig. 1 and Fig. 2 argue against this 
view, for in this case negatively accel- 
erated functions should be the rule, Nor 
is it apparent what explanation of the 
negatively accelerated growth curves 
might be entailed by this position. 
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Nevertheless additional evaluation of this 


alternative is required. 
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EFFECTS OF PRIOR SERIAL LEARNING OF SOLUTION 
WORDS UPON ANAGRAM PROBLEM SOLVING: 


IL. A SERIAL POSITION EFFECT ' 


GARY A. DAVIS Ax» MARY E. MANSKE 


Wisconsin Research and Development 
Center for Cognitive Learning" 


20 Ss learned a serial list of 10 10-letter words. The 10 words were 
subsequently presented to Ss in the form of anagram problems. Ana- 


gram solution times were closely 
curve which appeared in the seri 


In any anagram problem-solving 
task, S is required to rearrange scram- 
bled problem letters into a meaningful 
solution word. Thus several writers 
have reasonably assumed that S will 
successively test and reject, implicitly or 
explicitly, various letter combinations 
until a solution word is found which 
contains exactly the given scrambled 
letters (Davis, 1966; Mayzner & Tres- 
selt, 1958; Ronning, 1965). It has 
been found that this hierarchy of letter 
combinations, or response tendencies, 
is related to characteristics of the Eng- 
lish language such as frequency of 
occurrence of the solution word and 
frequency of occurrence of bigram and 
trigram letter combinations (e.g., Mayz- 
ner & Tresselt, 1958; Ronning, 1965). 

In an earlier unpublished experi- 
ment, the present authors sought to 
exert more direct control over the hier- 
archy of potential solution words, or 
letter combinations, which S is assumed 
to test and reject. The strategy sim- 
ply was to have Ss learn a serial list of 
words which subsequently would be 
used as anagram problems. Since Ss 
were informed that the solution to 


1 This research was performed pursuant to 
contract OE 5-10-154 with the United States 
Office of Education, Department of Health, 
Education, and Welfare, under the provisions 
of the Cooperative Research program. 


related to the classic serial-position 
al learning task. 


each six-letter anagram was a single 
word from the lO-word list, it was 
assumed that S would systematically 
recall the “hierarchy” of words in their 
learned serial order until a word was 
found which would exactly match the 
anagram letters. It was thus predicted 
that anagram problem-solving time 
would be directly related to the ordinal 
position of the solution word in the 
serial list. 

There was another possible predic- 
tion, however. In view of the very re- 
liable U-shaped serial-position effect, 
in which the first and last items in a 
serial list are the first learned and the 
items just past the middle of the list 
are the most difficult to learn, it was 
conceivable that the anagram solution 
times could be related to the differential 
response strength reflected in the serial- 
position curve. And this interpreta- 
tion, in fact, was clearly more accurate 
_the anagram solution times plotted 
over the ordinal position of the solu- 
tion word in the serial list showed the 
same inverted U function as did the 
serial-list measure of errors to criterion. 

There were methodological difficul- 
ties in that experiment, however, which 
reduced the authors’ confidence in this 
finding. Namely, only two different se- 
quences of the 10 words were used, 
each S was tested on only 5 of the 10 
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possible anagram problems, and the 6- 
letter anagrams were, for the most part, 
too easy when the available population 
of solution words was so clearly limited 
and specified. 

The present experiment was a bet- 
ter designed and counterbalanced 
attempt to determine the effects of prior 
learning of solution words in a serial 
list upon later anagram problem-solv- 
ing performance. Based upon the ear- 
lier experiment, it was predicted that 
anagram solution times would be 
closely related to the differential re- 
sponse strength of solution words re- 
flected in the classic serial-position 
curve, 


METHOD 


Subjects—The Ss were 20 undergraduate 
students, juniors and seniors, in introductory 
educational psychology courses (6 male, 14 
female). 

Materials and procedure. —The serial list 
of solution words consisted of 10 10-letter 
words (ammunition, comparison, equivalent, 
hemisphere, philosophy, phonograph, legiti- 
mate, silverware, tournament, vocabulary), 
The words, presented via slide projector, 
were learned in a standard anticipation 
fashion, at a 3-sec. rate, with 10 sec. be- 
tween repetitions of the list. In order to 
counterbalance any word effects (upon later 
anagram solution times), a 10 X 10 Latin 
square was devised with two Ss learning 
each sequence. A fairly rigid criterion of 
five errorless trials, not necessarily con- 
secutive, was used to insure overlearning. 

After S met the serial-learning criterion, 
he solved 10 anagram problems. The S was 
informed that the solution to each anagram 
was a word on the serial list. Each anagram 
was typed in capital letters on a 3 x 5 white 
card. Anagram letter orders, which were 
randomly determined, were ONATNMIMIU, 
SPMOOICRNA, VUETNLOEAI, ESIM- 
PREHHE, HOHIPOYSLP, ROOGPAH- 
HPN, GTELTIIAME, EAISRRVWEL, 
NETRTUMONA, and ABROLVYUAC. 
Presentation order of the 10 anagram prob- 
lems was determined by another 10x 10 
Latin square with ordinal position of the 
solution word in the serial list as the Latin- 


ized (treatment) variable. The Ss did not 
use a paper and pencil. Solution times were 
recorded by a stopwatch, which was started 
with the presentation of the anagram and 
stopped when S correctly stated the solution 
word. Every S successfully learned the 
serial list and successfully solved every ana- 
gram problem. 


REsuLTS 


Anagram solution times and errors 
to criterion on the serial list, both plot- 
ted as a function of the ordinal position 
of the solution word on the serial list, 
are shown in Fig. 1. An analysis of 
variance on anagram problem-solving 
times showed no main effects (problem 
Sequences, ordinal position of anagram 
problem, ordinal position of solution 
word in serial list) to be significant, 
However, the quadratic component of 
the ordinal position of the solution 
word in the serial list did reach sig- 
nificance, F (1, 162) = 4.25, p <.05.? 


33 


30 


27 


w 
E H 
21 2 
Š mra 
L4 18 
3 2 
LEN p 8 
i H 
$ oae È 
EH x 
Zz 9 a 
"Anagram Time 
6 w-—~Serial List Errors 
3 
o 
1 2 3 4 5 6 7 8 9 10 
ORDINAL POSITION ON SERIAL ust 
Fic. 1. Mean anagram solution times (in 


seconds) and mean errors to criterion on 
the serial list as a function of the ordinal 
Position of the solution word on the serial 
list. (Each point represents scores from all 
20 Ss.) 


? While an F significant at only .05 could 
a one-in-twenty chance occurrence, the 
fact that the same inverted U function was 
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The linear trend was not significant, 
F (1, 162) =.88. The Pearson cor- 
relation between mean anagram solu- 
tion times and mean errors to criterion 
on the serial list, for each ordinal posi- 
tion, was r (8) = .781, p < 01. 

An analysis of variance on the more 
stable errors to criterion measure on 
the serial-learning task showed ordinal 
position to be significant, F (9, 162) = 
16.20, p < .001, and both the linear and 
quardratic components were significant, 
Fs (1, 162) = 37.28 and 100.93, re- 
spectively, p < .001 for both, Neither 
word differences, the Latinized vari- 
able, nor sequences differed significantly. 


Discussion 


It is quite clear that the anagram prob- 
lem-solving times shown in Fig. 1 reflect 
the same “serial-position curve” as does 
the performance on the serial list itself: 
Words learned better in the serial list 
were also solved faster as anagrams. 
Since 10 different sequences were used 
in the serial lists, with each solution 
word appearing equally often in each 
ordinal position, the U-shaped problem- 
solving function cannot be attributed to 
any word effects such as word frequency, 
bigram or trigram frequency, or any other 
word differences. Anagram presentation 
order was also carefully counterbalanced. 

Apparently, Ss did not solve the ana- 
gram problems by deliberately recalling 
the list in the serial order learned, a con- 
clusion supported by the nonmonotonic 
shape of the anagram curve in Fig. 1 
and by reports from Ss The problem 


found in the unpublished earlier experiment 
considerably raises one’s confidence in the 
findings. 

3 Tn order to clarify the subject’s approach 
to the anagram task, several subjects simply 
were asked how they solved the problems— 
did they actively and systematically r 
the earlier-learned serial list, matching 
word with the anagram letters, as the 
authors had previously supposed? Without 
fail, the subjects reported that they did not 


which naturally arises is to explain pre- 
cisely how Ss did in fact solve the ana- 
grams, ie. to explain why the anagram 
curve shows the serial-position effect. 
First, it is conceivable that the anagram 
letter cues elicited potential solution 
words directly and that the relative ease 
with which the anagram elicited the cor- 
rect solution word was related to the de- 
gree of response strength, availability, or 
integration of that solution word (cf. Jen- 
sen & Rohwer, 1965). A second possibil- 
ity is that, quite independent of the spe- 
cific anagram letter cues, Ss could have 
recalled the words in the order learned, 
rather than in the order of the serial list 
itself. As a third possibility, the pres- 
ent results would also be consistent with 
the finding in verbal learning experiments 
that when items are presented in a con- 
sistent serial order, but S is allowed to 
reproduce them in any order, he will often 
recall the last items first rather than be- 
ginning with the first item and recalling 
the list in sequence (Deese & Kaufman, 
1957). While all of these interpretations 
would be consistent with recent explana- 
tions of serial learning, introspective re- 
ports from Ss would seem to support the 
first hypothesis. 

Regarding human problem solving in 
general, the present experiment repre- 
sents an unsuccessful attempt to exert 
direct control over the hierarchy of re- 
sponse alternatives which S is assumed to 
sequentially test and reject. The data 
are, however, consistent with traditional 
notions of problem solving in which the 
strength of various responses and com- 
binations of responses are considered to 
be related to the amount or strength of 
prior learning (cf. Davis, 1966). Finally, 
it should be noted that these results are 
not inconsistent with experiments demon- 
strating critical effects of word frequency 
or bigram or trigram frequency upon ana- 
gram problem solving, since these factors, 
rather than the serial-position effect, nor- 


bother to systematically recall the list unless 
they could not solve the problem “other- 
wise. As one sübject put it, the solution 
words usually “just popped in." 
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EFFECT OF SECONDARY REINFORCEMENT SCHEDULES 
ON PERFORMANCE OF PROBLEM-SOLVING TASKS 


ANNE DOHERTY AND RICHARD A. WUNDERLICH 
Catholic University 


The effect of varying the rate of secondary reinforcement (S*) pre- 
sented during extinction from that presented during the initial phase 
was studied using problem-solving tasks. 90 adolescent male Ss were 
assigned to 6 groups, matched for age, I.Q., and arithmetic achieve- 
ment. For the 3 experimental groups, which received in the first 
phase both a candy reward as primary reinforcement (P7) and colored 
tokens as S”, the presentation of S* was increased, maintained, or elim- 
inated during extinction. For control groups, which initially received 
either P" unpaired with the tokens or the token reinforcement alone, 
tokens were increased, maintained at the earlier level, or introduced for 
the first time during extinction. Results indicated that groups pre- 
sented with an increase in S" in extinction continued to perform the 
tasks, completing a greater number of problems than groups whose ex- 
tinction rate of S" was either decreased or maintained at the initial 


level. 


It has been assumed that stimulus 
characteristics may acquire secondary 
reinforcement potential by virtue of 
being paired with primary reinforce- 
ment (Hull, 1943; Salzman, 1949). It 
has also been proposed that the char- 
acteristics of a stimulus retain rein- 
forcement potential because they lessen 
the discriminable differences between 
conditions in which primary reinforce- 
ment (P') is present and those in 
which it is absent. According to this 
hypothesis, although the Pr is removed 
during extinction, the difference be- 
tween this period and the conditioning 
phase is not made exceptionally notable 
and extinction is thus prolonged (Bit- 
terman, Feddersen, & Tyler, 1953; 
Mowrer & Jones, 1945). 

A recent approach to the study of 
secondary reinforcement (S") has in- 
vestigated the effects of varying the 
rates of S" during extinction. In 1962, 
Myers and Myers, using preschool 
children, demonstrated that changing 
a fixed ratio of S" during acquisition of 
an operant response to continuous S 
during extinction results in increased 


resistance to extinction in comparison 
to that evidenced when no change in 
S" is introduced. Increasing a fixed 
ratio of reinforcement during extinc- 
tion over that presented during acquisi- 
tion also appears to retard extinction 
(Myers & Myers, 1963). 

The present study extended earlier 
research on the nature of S" by investi- 
gating interval, rather than ratio, rein- 
forcement schedules; and by replacing 
operant bar-pressing tasks with a prob- 
lem-solving situation. In addition, the 
experiment employed older S's to deter- 
mine how varying schedules of S" 
would sustain their performance on 
problem-solving tasks. 


METHOD 


Subjects —A total of 90 male Ss, seventh 
and eighth graders from two parochial 
schools of comparable socioeconomic level, 
were used. Fifteen Ss were assigned to 
each of six groups equated for age and 
Otis IQ scores on the Stanford Arithmetic 
Achievement test were at the eighth-grade 
level in all groups. 

Procedure—Simple math problems, in- 
volving numerical progressions solvable by 
addition, subtraction, multiplication, or di- 
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vision were presented to Ss on small sheets 
of paper, each containing one problem. 
Three numerals of a Series were given as 
cues and Ss were required to provide the 
fourth in the series. The boys were in- 
structed that they could do as many, or as 
few, of the problems as they wished; and 
that no one, including their teachers, would 
be told how many they completed or left 
incomplete. They were also told that they 
would receive a reward from time to time, 
but the rewards were not exposed before- 
hand. The Ss usually placed their rewards 
Or tokens aside as they continued the task. 
Because all Ss were required to remain for 
a half hour in the experimental room, 
reading material was provided for those 
who did not choose to solve problems the 
entire time. 

During the first phase which lasted 15 
min, all Ss were reinforced on variable 
interval (VI) schedules with Pr (penny 
Tootsie Rolls), colored cardboard tokens, or 
a combination of both types of reward. 
Groups E-1, E-2, and E-3 were designated 
as experimental groups Scheduled to receive 
both Pr and Sr during this time. The mean 
for the VI schedule of PF was 2 min.; that 
for S", 60 sec. Control Groups C-1 and C-2 
received only tokens during the first phase 
on the same VI schedule as Groups E-1 
and E-2, while C-3 received Pr only, on the 
same VI schedule as the experimental groups. 

An attempt to extinguish performance on 
the tasks was introduced during the last 
15 min. of the experimental period. In this 
extinction phase, no group received Pr and 
the rate of token reinforcement was: (a) 
increased, (5) eliminated, (c) introduced for 
the first time, or (d) remained the same 
as that presented earlier. In Groups E-1 
and C-1, token reinforcement was increased 
from 60 sec. to a VI schedule with a mean 
of 20 sec. Groups E-2 and C-2 continued 
on the 60-sec. schedule of the initial phase. 
Group E-3 was extinguished without any 
secondary reward, while C-3 began to receive 
tokens for the first time during extinction. 
This groups VI schedule with a mean of 

Sec. was the same as Groups E-1 and C-1. 
The design of the experiment is presented 
in Table 1. 

During both phases of the experiment, 
automatic programming was used to record 
responses and to signal reinforcement, 
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There were no significant differences 
in the number of problems attempted 


TABLE 1 


SCHEDULES (IN SEC.) AND MEAN NUMBER 
OF EXTINCTION RESPONSES 


Initial Phase Extinction Phase 
Mean No. 
Candy | Tokens | Tokens of 
Responses 
E-1 120 60 20 52.8 
E-2 120 60 60 39.0 
E-3 120 60 AXL 17.2 
C-1 =? 60 20 30.2 
C-2 = 60 60 19.8 
C-3 120 m3 20 30.6 


by the groups in the initial phase of 
the experiment. The mean number of 
responses (problems completed) of 
each group during extinction is shown 
in Table 1. An analysis of variance 
of extinction responses yielded a sig- 
nificant F beyond the .01 level, F (5, 
84) = 14.54, Comparisons among ap- 
propriate experimental and control 
groups (E-1 vs. C-1, E-2 vs. C-2, and 
E-3 vs, C-3) by ¢ tests showed signif- 
icant differences, in each case beyond 
the .01 level. (The respective #’s are: 
3.62, 3.91, 4.91, df = 28). 

Additional comparisons of extinction 
Tesponses of the experimental groups 
showed E-1 significantly different from 
E-2, t (28) = 2.55 p < -025, and statis- 
tically significant differences between 
E-1 and E-3 ¢ (28) = 8.03 ? < .0005, 
as well as between E-2 and E-3, t (28) 
= 5.35 p < .0005. 


Discussion 


The results of this experiment demon- 
strate a positive S* effect. Groups which 
received S* during extinction completed 
a significantly greater number of prob- 
lems during that period than groups not 
receiving it. Many Ss who did not 
receive tokens, in fact, ceased to perform 
early in the extinction period. Further- 
more, in the groups receiving S% no in- 
crease in errors (which were negligible 
throughout the experiment for all groups) 
Was apparent despite their higher level 
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of performance. That S" contributes 
significantly to this effect beyond any 
reinforcement value of the token itself 
can be observed by comparing C-2, a 
group which received only tokens during 
both phases of the experiment, with E-2 
in which the tokens had been paired with 
Pr, The latter group showed significantly 
greater resistance to extinction than the 
former. This is not to say that the tokens 
had no reinforcement value. On the con- 
trary, in one group (C-1) even though 
the tokens were never associated with Pr, 
an increase of the rate of token delivery 
in extinction over that of the initial phase 
resulted in a total number of extinction 
responses comparable to that of E-2 in 
which the tokens had been paired with 
Bt 

In agreement with Myers and Myers 
(1962), this experiment demonstrated 
that extinction may be prolonged by in- 
creasing the rate of St during the ex- 
tinction process. In comparing the three 
experimental groups, the group that had 
a decrease in Sr during extinction made 
significantly fewer mean extinction re- 
sponses than a group which had no 
change in S", and both groups were sur- 
passed by a group that received an in- 
crease in St during extinction. This find- 
ing is in opposition to the original dis- 
crimination hypothesis by Bitterman et al. 
(1953), but lends support to the modifica- 
tion of this hypothesis presented by Myers 
and Myers (1963). Their modified hypo- 
thesis proposes that it is not simply the 
discriminable differences between acquis- 
tion and extinction, but the direction of 
these differences which affects the extinc- 
tion process. They indicate that incre- 
ments in S" will yield greater resistance 
to extinction than no change, and a de- 
crement in St will produce more rapid 
extinction than no change in S" from 
acquisition to extinction. Melching’s 
(1954) data with rats does not support 
the Myers and Myers’ (1962) modifica- 
tion. He found no differences in groups 
of rats which received either continuous 
St or no St in extinction. His explana- 
tion was that the change enabled the 
organism to distinguish extinction con- 


ditions from those of acquisition and the 
direction of the change (increment or de- 
crement) was not important. 

In the present experiment a change in 
the VI schedule from a mean of 60 sec. 
to one of 20 sec. approximates a change 
to continuous S* reinforcement on a FR 
schedule, as in Myers and Myers (1962) 
in that the average time Ss took to re- 
spond was 20 sec. Thus, the amount of 
increase from the S" schedule of 60 sec. 
in Group E-1 was similar to the amount 
of decrease in E-3 where the VI acquisi- 
tion schedule of 60 sec. was reduced to 
zero in extinction. In contrast to Mel- 
ching's (1954) data, a reduction in S" 
led to rapid extinction and the difference 
between the two groups was highly sig- 
nificant. The simple discrimination hypo- 
thesis was not upheld and the direction 
of the discriminable differences became 
the important factor in the effects of S 
on resistance to extinction. The interval 
reinforcement schedule used in the cur- 
rent experiment thus produced results 
which paralleled the ratio reinforcement 
schedule used by Myers and Myers 
(1962) and the effects noted in the per- 
formance of problem-solving tasks of this 
experiment suggest that these effects may 
be generalized beyond a simple bar-press- 
ing response. 

One final similarity between this ex- 
periment and that of Myers and Myers 
(1962) may be noted. In both experi- 
ments, a group whose rate of St under- 
went no change during extinction and a 
group which received a novel stimulus 
(tokens or buzzer) for the first time in 
extinction, without previous pairing of 
these stimuli with Pr, showed no signif- 
icant differences in their extinction per- 
formance. One possible explanation for 
this finding may be in considering the 
novel stimuli as providing a change in 
environmental stimulation. Marx, Hen- 
derson, and Roberts (1955) suggest a 
stimulus change or novelty factor in re- 
inforcement as an explanation of their 
results in a study of an operant response 
in rats. Deese (1958) also emphasizes 
the ability of new stimuli to affect re- 
sponse strength. More recently, Schultz 
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(1965) has provided the concept of 
"Sensoristasis," a drive for sensory varia- 
tion, affirming that even higher organisms 
will exhibit evidence of learning in situa- 
tions where the only known reinforcement 
is a change in stimuli, Schultz further 
states that these increases in the variabil- 
ity of stimuli may have reinforcing prop- 
erties for the learning of instrumental re- 
sponses. It would appear that this vari- 
able warrants further investigation with 
human Ss, 
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A group of 40 bilinguals learned 2 paired-associate lists, an English 
list and a Spanish list, and then relearned the 1st list. The transfer 
paradigm employed was A-B, A'-B'r where the items of the 2nd 
list are translations of the 1st but with the translated stimuli being re- 
paired with different translated responses. In contrast to expectations 
from previous research the retroactive inhibition (RI) obtained was 
related neither to the dominant language of S nor to the order of the 
lists learned. The RI obtained was considered in light of the hypo- 
thesis that nonreinforced evocation of the Ist-list responses during 


2nd-list learning is related to RI. 


Two recent studies have investigated 
transfer effects among bilinguals. In 
one (Young & Webber, in press), posi- 
tive and negative transfer paradigms 
were modified for use with bilinguals, 
and transfer was measured from Eng- 
lish to Spanish and vice versa using 
paired-associate learning tasks. Trans- 
fer was found in the predicted direction 
in the positive transfer paradigms while 
negative transfer was found in the 
Spanish-English but not in the English- 
Spanish paradigm. The Ss used in this 
study tended to be English dominant 
and an explanation of the results was 
developed such that negative transfer 
would be expected only when transfer 
Occurred from a list learned in the 
weaker or subordinate language to a 
list learned in the dominant language. 
The present experiment tests the gen- 
erality of this explanation by measuring 
retroactive inhibition (RI) using lang- 
uage dominance as a variable. Thus 
the English dominant bilingual should 
show RI from Spanish to English but 
not vice versa while the Spanish dom- 
inant bilingual should show the op- 
posite results. 


1The present research was performed 
under National Science Foundation Grant 
GB-3629 to David T. Hakes and the senior 
author. 


In the second study (Young & Sae- 
gert, 1966) a similar set of transfer 
paradigms was used with Spanish- 
English bilingual high-school students. 
In this case a serial learning task was 
employed and Ss tended to be balanced 
bilinguals, i.e., they rated themselves as 
English dominant but had spoken 
Spanish for a longer time. The results 
of this experiment were similar to those 
found previously in that positive trans- 
fer was found in the appropriate para- 
digms while the results of the negative 
transfer paradigms again presented 
some difficulties in interpretation. 
That is, consistent negative transfer 
occurred in the various negative trans- 
fer groups but the amount of transfer 
was not large enough to be statistically 
reliable. 

There is a general question as to the 
possibility of interlingual interference 
from associations formed in two differ- 
ing languages. The negative transfer 
obtained in the Young and Webber (in 
press) study was not consistent and 
the negative transfer obtained in the 
Young and Saegert (1966) study was 
consistent but unreliable. Hence there 
is no assurance that the transfer para- 
digms commonly used in verbal learn- 
ing studies will produce similar transfer 


109 


110 


effects when modified for use among 
bilinguals. For example, the para- 
digms used in the Young and Saegert 
study yield different transfer effects 
when pairs of adjective synonyms are 
used with English Speakers (Irion, 
1946). 

Another study suggests that cross- 
language interference would be mini- 
mal Postman, Keppel, and Stark 
(1965) investigated RI and found that 
the use of highly discriminable response 
classes reduced RI. In this study let- 
ters were used as responses in one list 
and adjectives were used as responses 
in the other. This procedure resulted 
in less RI than when adjectives or let- 
ters were used as responses in both 
lists, Studies of interlingual transfer 
would of necessity employ highly dis- 
criminable sets of responses (and stim- 
uli) with the apparent result that rela- 
tively small amounts of RT would be 
obtained, 

The present study was designed to 
determine if RI will occur following 
the learning of two lists in two different 
languages. The transfer paradigm em- 
ployed here is a modification of the 
A-B, A-Br paradigm in which the same 
items are used as stimuli and responses 
in both lists. Fach response is, how- 
ever, re-paired with a different stimulus 
from the first to the second list. This 
paradigm has been modified by using 
an A-B, A'-B/r paradigm in which the 
Second list consists of the translated 
first-list items but where the pairing of 
the translated second-list responses is 
inconsistent with the first-list pairings, 
Retention of the first list is measured 
after the learning of the second and 
RI is measured by comparison to an 
A-B, C'-D' control paradigm in which 
List 2 is composed of items unrelated 
to and in a different language from 
those of List 1. 
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Design—Each of the 40 Ss learned two 
paired-associate lists and then relearned the 
first list. Twenty Ss were English domi- 
nant while the other 20 Ss were Spanish 
dominant. Within each group, half of the 
Ss learned an English list and then a 
Spanish list while the second half learned 
a Spanish list and then an English list, 
Within each language order a common first 
list consisting of two sublists was learned, 
One of the two first-list sublists served as 
the Experimental list and formed with a 
sublist in the second list an A-B, A'-B'r 
paradigm of negative transfer, The second 
sublist in the first list served as the Control 
list and formed an A-B, C'-D' paradigm of 
zero transfer with the second sublist in the 
second list. Within each language order 
two different second lists were employed in 
the experiment to allow counterbalancing of 
the first-list Experimental and Control sub- 
lists, i.e., for half of the Ss-one sublist served 
as the Experimental list while for the second 
half it served as the Control list, 

Thus the experiment can be summarized 
as a 2' factorial design with three between- 
(language order, language dominance, and 
second list learned) and one within- S$ 
(first-list sublist) variables CN — 5 per cell). 

Procedure—Each of the 16-pair lists was 
composed of common nouns of low intralist 
similarity and was presented on a Lafayette 
memory drum. The Spanish first list was a 
direct translation of the English first list 
and the Spanish second list was a translation 
of the English second list. The lists were 
learned by the anticipation method at a 
2:2-sec. rate with a 4-sec. intertrial interval ; 
four orders of list presentation were used, 
The learning criterion for both learning and 
relearning was one errorless recitation. Ap- 
proximately 1 min. elapsed between the learn- 
ing of the first and second list in order to 
give additional instructions and to change 
tapes. A constant 22 min, elapsed between 
the end of first-list learning and the begin- 
ning of first-list relearning. The 22-min. 
interval would allow approximately 20 trials 
on List 2. Five Ss exceeded 20 trials in 
learning the second list but these Ss were 
not dropped from the experiment. The inter- 
val, if any, between the end of List-2 learn- 
ing and the start of List-l relearning was 
filled by a symbol cancellation task. 

The Ss were unpaid University of Texas 
volunteers. Of these, 15 were graduate 


— 


> 


RETROACTIVE INHIBITION IN BILINGUALS 111 


students in Romance Languages and Latin 
American Studies and the remainder were 
undergraduates with a variety of majors. 
The Ss were assigned randomly to subgroups 
within language dominance groups. Four Ss 
were dropped for failure to learn or to fol- 
low instructions and each was replaced by 
by the next S to appear in the laboratory. 

Each S was asked to rate his ability in 
English and Spanish on a 1-7 scale and to 
indicate the language in which he preferred 
to think. The Ss also indicated the num- 
ber of years Spanish and English had been 
spoken. Finally, Ss were asked to indicate 
on a 1-7 scale whether their languages had 
been learned in the same context or in two 
different contexts. 


RESULTS 


Characteristics of the samples.—The 
Ss were randomly assigned within 
dominance groups but were assigned 
to the dominance groups on the basis of 
their indicated preference for thinking 
in English or Spanish. However, as a 
check on the validity of this rating, the 
ratings of speaking ability in the two 
languages were analyzed. The mean 
ratings for the English Dominant group 
for English and Spanish ability were 
6.40 and 4.90, respectively, while for 
the Spanish Dominant group the rat- 
ings of English and Spanish ability 
were 4.35 and 6.70, respectively. The 
interaction between Language Domi- 
nance and Language Ability was found 
to equal F (1, 32) = 6145, p < .001, 
supporting the original basis for assign- 


ing Ss to groups. No other Fs were 
significant in the analysis. 

The ratings of years each language 
was spoken were analyzed and it was 
found that the English Dominant Ss 
had spoken English for 20 yr. and 
Spanish for 13.75 while the Spanish 
Dominant Ss had spoken English for 
9.50 years and Spanish for 20.30 yr. 
This Language Dominance X Language 
interaction, with F (1, 32) — 42.69, 
p < .001, was the only significant dif- 
ference found in the analysis of the 
years a language had been spoken. 

The two dominance groups differed 
considerably in the manner in which 
they had learned their two languages, 
The mean ratings (1 = both languages 
learned together, 7 = separately) for 
the context in which the languages 
were learned were equal to 3.80 for the 
English Dominant group and 5.80 for 
the Spanish Dominant group. This 
difference is significant, with F (1, 
32) = 6.80, p < .05. Thus with 15 of 
the 20 Spanish Dominant Ss rating 
themselves 7 (both languages learned 
separately) this group would appear to 
consist mostly of coordinate bilinguals 
(cf. Ervin & Osgood, 1965) while the 
English Dominant sample (with 5 of 
the 20 rating themselves 7) would be 
much more like compound bilinguals. 

First-list learning —Trials to learn 
List 1 were analyzed and no significant 


TABLE 1 
TRIALS TO LEARN THE SuBLISTS OF List 1 


Dominant Language 


Spanish English 
First-List 
TN E c E c 
M SD M SD M SD SD 
Spani. f 14.40 8.84 8.70 3.59 9.30 4.81 
English 10 $94 9.60 5.16 9.10 4.01 8.70 3.69 
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differences were found. The means of 
trials to learn the sublists of List 1 are 
presented in Table 1. The means for 
the Spanish Experimental and Control 
sublists for the Spanish Dominant 
&roup tend to be larger than the other 
groups. This difference is reflected in 
the near significance of the Dominance 
variable, with F (132) = 3.24, p < 
.10. The single significant difference 
found in the analysis was between the 
sublists: The easier of the two English 
sublists was, when translated, the more 
difficult of the two Spanish sublists, 
F (1, 32) = 10.02, p<.01. How- 
ever, since the List-1 sublists were 
counterbalanced, the Critical compari- 
sons are between Experimental and 
Control sublists and the interactions of 
this variable with other variables, The 
largest difference obtained from these 
comparisons was F = 1,28 which does 
not approach significance, Clearly, 
there are no differences between the 
Experimental and Control conditions 
within each group and with the possible 
exception of the Spanish Dominant 
group, the various groups themselves 
learned at equivalent rates, 
Second-list learning. —Trials to learn 
List 2 were analyzed and as in the 
analysis of trials to learn List 1 no sig- 
nificant differences were found. Each 
Second list also contained two eight-pair 
sublists one being an Experimental list 


and the other being a Control. In the 
analysis of trials to learn these sub- 
lists a single difference was found. In 
the English sublist the Experimental 
sublist was more difficult to learn than 
the Control sublist but when the two 
lists were translated into Spanish the 
opposite results were obtained, F (Gl; 
32) = 8.96, p<.01. In contrast to 
List-1 sublists, each List-2 sublist 
served either as an Experimental or as 
a Control sublist but not as both and 
the design of the experiment thus does 
not allow evaluation of List-2 transfer 
effects. However, the results again ap- 
pear to indicate that a list does not 
necessarily maintain its relative diffi- 
culty when translated. In addition the 
obtained difference might indicate that 
the Experimental sublists were more 
difficult to learn in English than in 
Spanish and as such would confound 
the comparison of retention between 
English and Spanish groups. For this 
reason the learning of the Experimental 
sublists of List 2 were analyzed alone. 
No difference approached significance 
in this analysis. Hence, in List 2 the 
Experimental sublists were of equi- 
valent difficulty and the total number 
of trials did not differ from one con- 
dition to another. 

Recall and relearning.—The number 
of items recalled on the first relearning 
trial and the trials to relearn List 1 


TABLE 2 
ITEMS RECALLED ON First RELEARNING TRIAL 


Dominant Language 


Spanish 
Language 


English 


Recalled 


Spanish 4.10 1.45 6.80 1.18 5.40 1.76 7.30 1.18 
English 4.90 2.23 6.80 1.87 5.30 2.10 7.10 1.11 
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TABLE 3 
TRIALS TO RELEARN First List 


| Dominant Language 


| Spanish | English 
Language | | 
Recalled 
E | Cc | E | © 
M SD M | | M | SD | M | SD 
Spanish 4.20 1.32 2.20 1.20 4.00 1.60 1.60 1.11 
English 3.90 1.26 1.60 .54 3.10 1.00 1.80 1.00 


were analyzed. The mean numbers of 
items recalled on the first trial are pre- 
sented in Table 2. The mean numbers 
of items recalled correctely on the first 


relearning trial for the Experimental . 


and Control sublists were 4.93 and 7, 
respectively. This was the only signif- 
icant difference found in the analysis, 
with F (1, 32) =44.16, p< .001. 
Differential recall effects were not 
found as a function of language domi- 
nance or language recalled. 

The analysis of trials to relearn the 
sublists of List 1 show essentially the 
same results as did the analysis of the 
first relearning trial. These relearning 
data are presented in Table 3 and again 
it can be seen that a significant dif- 
ference occurred between the relearning 
of the Experimental and Control sub- 
lists, with F (1, 32) = 73.69, p < .001. 
The mean numbers of trials to relearn 
the Experimental and Control lists 
were 3.80 and 1.80, respectively. 

It will be recalled that five Ss ex- 
ceeded the 20 trial limit imposed on 
List-2 learning and that their data were 
not dropped from the experiment. As 
a check, the data of those Ss learning 
within the 20 trial limit were analyzed 
separately and differences as large as 
originally obtained were found. The 
reanalyses of the data yielded substan- 
tially the same results as the original 
analyses and it can be concluded that 


the present results were not obtained 
because of a variable recall interval. 

Errors.—No intrusions from the im- 
mediately preceding list occurred dur- 
ing List-2 learning or during recall and 
relearning. Within a list all overt 
errors were in the same language as 
that of the list. A number of errors 
occurred which were translations of re- 
sponses previously associated to the 
translated stimuli of the prior list. 
Of these mediated errors, 32 occur- 
red during List-2 learning and 22 
during the relearning of List 1. How- 
ever, none of several tests indicated that 
the frequency of these errors was 
greater than would be expected by 
chance. 


Discussion 


The results of the present experiment 
are unequivocal. Forgetting in one lan- 
guage occurs as a function of associations 
formed in another language. In addition 
language dominance does not appear to 
be related to the amount of forgetting. 
Young and Webber (in press) have sug- 
gested that associations formed in 
the dominant language do not interfere 
with associations formed subsequently in 
the second language. In this respect the 
results of the present study do not sup- 
port the previous findings. 

At the end of List-1 learning no dif- 
ference existed between the Experimental 
and Control sublists. Yet following the 
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learning of List 2, retention differences 
of considerable magnitude appeared. 
Clearly, this difference was a function of 
learning the Experimental sublists in List 
2. Since a mixed-list design was em- 
ployed and each S served as his own con- 
trol, the practice control (the C'-D' sub- 
list of List 2) was not absolutely neces- 
Sary. The Control sublists of List 2 were 
not related to List 1 and they had no 
function other than to make Lists land2 
equally long. Thus differences in dif- 
ficulty in the Control sublists in List 2 
are relatively unimportant as long as the 
Experimental sublists are equivalent. 

An interference interpretation of the 
results of the present experiment would 
identify the source of interference as 
mediated ‘associations in a four-stage 
chain. That is, A-B is learned in the 
laboratory and A-A’ and B-B’ are the 
associations between translated items 
Which S brings into the experiment. 
When A-B, and then A’-B’r are learned, 
the A’-B’ mediated association interferes 
with the learning of A’-B’r and the 
mediation chain is A’-A-B-B’ where B’r 
rather than B’ is the correct List-2 
response. 

In incorrectly evoking the B^ response, 
the A-B association is unlearned or ex- 
tinguished (it is assumed that A-A’ 
associations are relatively impervious to 
unlearning in the laboratory, cf, Under- 
wood & Ekstrand, 1966) and later mea- 
surement reveals forgetting of List. 
associations. Unlearning and not un- 
availability of responses appears to be 
related to the observed retention loss. 
That is, the List-1 responses were present 
during List-2 learning in translated form 
and it does not seem likely that the for- 
getting found in the present study re- 
flects loss of appropriate responses, 
Rather, as in the McGovern (1964) 
study when the A-B, A-Br paradigm 
was employed, the loss represents an un- 
learning of A-B associations. Some sup- 
port for this assumption is found in the 
RI found in the trials to relearn analysis. 
During relearning all the B responses 
Were presented on every trial and should 
then have been readily available. 


ROBERT K. YOUNG AND M. ISABELLE NAVAR 


Not a single intrusion from List 1 was 
found during List-2 learning and intru- 
sions have been used as an index of the 
number of overt nonreinforced elicita- 
tions of List-1 responses during List-2 
learning (Keppel & Rauch, 1966). In 
another study using response classes 
which were highly discriminable between 
lists (Postman et al., 1965) no interlist 
intrusions were found. This was accom- 
panied by a decrease in the amount of RI 
observed when compared to a control 
using the same response class in both 
the first and second lists. It would ap- 
pear then that an interference theory of 
forgetting which identifies the source of 
unlearning of List-1 associations with the 
nonreinforced evocation of List-1 re- 
sponses during List-2 learning (Melton, 
1961) would predict a relatively small 
amount of RI in the present experiment. 
However, the amount of RI obtained is 
large and highly significant. 

It should be noted that two different 
transfer paradigms are under considera- 
tion. Postman et al, used an A-B, A-C 
paradigm while that used in the present 
experiment was an A-B, A'-B'r para- 
digm. In the present case the absence 
of List-1 intrusions during List-2 learn- 
ing does not necessarily imply that elicita- 
tion of List-1 associations did not occur. 
Elicitation of the erroneous A’-B’ associ- 
ation mediated by the A'-A-B-B' chain 
could result in the unlearning of the 
A-B associations of List 1 without the 
occurrence of List-1 intrusions. Hence 
the large amount of RI observed in the 
present experiment is not necessarily un- 
favorable to Melton’s hypothesis. 

An additional source of interference can 
be identified. Response competition dur- 
ing recall and relearning from the 
A-A’-B’r-Br mediation chain also may in 
part account for the retention loss, where 
A’-B’r was learned in List 2. 

It has been assumed in the previous dis- 
cussion that a bilingual S can translate 
from A to A’ and vice versa. Yet 
fluency in both languages does not neces- 
sarily imply the ability to translate from 
one language to another, As an alterna- 
tive, it could be argued that equivalent 
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words in different languages have the 
same or similar cortical representation 
(or trace). If associations are formed 
between traces then an analysis could 
be made which assumes mediated inter- 
ference via the associations between 
traces. In this case no assumption about 
S’s ability to translate from A to A’ 
would be necessary and yet an analysis 
similar to the one proposed above could 
be employed. 

In any event, the results of the present 
experiment indicate that RI was approxi- 
mately the same for English and Spanish 
dominant speakers regardless of the lang- 
uage of the list recalled. Furthermore, 
it has been demonstrated that the learn- 
ing of associations in one language is 
related to the forgetting of associations in 
a second language and on this basis alone 
it can be concluded that the two languages 
of a bilingual are interdependent and not 
independent. 
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This study tests the hypothesis that type of generalization (synonyms 
vs. homophones) is a function of set induced by instructions (semantic, 
phonic, or ambiguous) and of type of US (shock vs. buzzer) used in 


the original conditioning procedu: 


re. The GSR of 96 Ss was condi- 


tioned to a word (CS) under the appropriate experimental conditions 


was clearly demonstrated, but there was no effect of type of US. 
These results are related to previous studies and to the concept of 


The literature on semantic general- 
ization suggests that the type of uncon- 
ditioned stimulus (US) utilized in the 
original conditioning procedure is cri- 
tical to the type of generalization 
found: semantic or phonic. A test of 
this inference is made in the present 
study in the context of the concept of 
efficiency (Wishner, 1955; 1965), 
which emphasizes the value of analysis 
of behavior in terms of its appropriate- 
ness to task requirements. In the ex- 
periment reported here, instructions to 
S (attention directed to synonyms or 
homophones) were combined with type 
of US (shock or buzzer) as independ- 
ent variables for the purpose of assess- 
ing their effects, singly and jointly, on 
semantic generalization. 


1 This research was supported by the 
United States Public Health Service Grant 
M-08071 from the National Institute of 
Mental Health to the first author. Partial 
support was also received from Grant 
M-04211 to the second author. The study 
was done while the first author was at the 
Pennsylvania State University. The authors 
are indebted to F. W. Irwin for critical 
comments, 


A number of studies have examined 
the relationship of set to the generaliza- 
tion of verbal conditioning. Branca 
(1957) found that semantic generaliza- 
tion occurred only if S verbalized the 
expectation of a painful shock in con- 
nection with the generalization stimuli. 
In another study, expectancy of shock 
was manipulated by instructions and a 
close relationship was found between 
verbalized expectancy and changes in 
heart rate (Chatterjee & Eriksen, 
1962). 

Eisen (1954) reported the only 
study in which set, involving four cate- 
gories of generalization stimuli, was 
explicitly manipulated by instructions, 
He failed to obtain evidence either of 
generalization or of the influence of set. 
Two other studies, although not ma- 
nipulating set, used instructions to in- 
duce Ss to attend to the meaning of 
words. In one of these, Wylie (1940) 
obtained greater generalization to 
homophones than to synonyms. Peas- 
trel (1964), on the other hand, found 
greater generalization to synonyms 
than to homophones with normal Ss. 
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In all, then, cognitive factors appear 
to play an important role in generaliza- 
tion, but it is not clear that experi- 
mentally induced sets necessarily deter- 
mine type of generalization. 

Another problem which has received 
considerable attention concerns the 
relative importance of semantic and 
phonic factors in the generalization of 
verbal conditioning. If the studies 
bearing on this issue are classified ac- 
cording to the US used during condi- 
tioning, an interesting observation 
emerges: All the studies of normal 
adults reporting greater generalization 
to semantic than to phonic stimuli have 
used a relatively nonnoxious stimulus, 
such as a buzzer, as a US (Peastrel, 
1964; Razran, 1939; 1949; Riess, 
1940; 1946), whereas those studies 
reporting less semantic than phonic 
generalization or no semantic general- 
ization have used electric shock as a 
US (Eisen, 1954; Wylie, 1940). 
Luria and Vinogradova (1959) also 
reported phonic generalization only 
with shock in retarded children, 
although semantic generalization still 
predominated. Furthermore, there are 
a number of studies concerned only 
with the semantic category which have 
failed to demonstrate semantic general- 
ization or which have produced other- 
wise discrepant results. Although not 
all studies in which shock was the US 
have produced such discrepant results 
(cf. Feather, 1965), shock was the US 
in all of the studies with discrepant 
results (Branca, 1957; Chatterjee & 
Eriksen, 1962; Keller, 1943; Staats, 
Staats, & Crawford, 1962). It seems 
possible, then, that shock may have 
interfered with semantic generalization 
in these studies. 

The concept of psychological effi- 
ciency (Wishner, 1955) seemed to pro- 
vide a means of reconciling these ap- 
parently discrepant results. This ap- 


proach focuses on the appropriateness 
of behavior to task requirements. Pre- 
sumably all conditions which detract 
from S's ability to assume a task-cent- 
tered attitude impair his ability to be- 
have efficiently with respect to the task. 
Tf electric shock is assumed to be highly 
noxious, then its use as a US may have 
a distracting effect which conflicts with 
task-centeredness. Thus, the contra- 
dictory results reported above may be 
a function of the type of US utilized. 

In this study the procedure was to 
condition the GSR to a word, with 
either a buzzer or electric shock as US, 
and then test for generalization to a 
synonym and homophone of that word. 
The instructions were varied for three 
groups. One was given a semantic 
set: Ss were told to attend to similar- 
ities in meaning. The second group 
received a phonic set: Ss were told to 
attend to similarities in sound. The 
third group, used to evaluate the char- 
acter of spontaneous generalization, 
was given an ambiguous set: Ss were 
not oriented specifically either toward 
synonyms or homophones. 

Tt was hypothesized that in the ab- 
sence of disruptive stress, i.e., with the 
buzzer as US, normal adult Ss would 
behave in accordance with instructions. 
Hereinafter this will be referred to as 
the set hypothesis. In effect it pre- 
dicts an interaction between instruc- 
tions and category of generalization 
stimulus. 

On the other hand, it was anticipated 
that the stress resulting from the use 
of electric shock would interfere with 
Ss’ ability to attend to the task, thereby 
reducing the congruence of generaliza- 
tion with the instructions. This will 
be referred to as the stress hypothesis, 
and it predicts a second order inter- 
action between set, category of general- 
ization stimulus, and type of US, such 
that generalization will accord more 
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with instructions when a buzzer is the 
US than when shock is the US. 

The task requirements of the am- 
biguous set are not clearly defined. 
This condition could serve as a control 
for the semantic and phonic sets, or it 
might reveal a general tendency for 
Ss to choose one or the other mode of 
generalization. It should be kept in 
mind that ambiguous instructions do 
not necessarily result in the adoption 
of ambiguous sets by Ss. They may 
result in varying self-instructions by 
Ss, some adopting one and some 
another, but definite, set. Viewed asa 
group, the results may seem as ambigu- 
ous as if each S had adopted an 
ambiguous or no set. 


METHOD 


Subjects —The Ss were 96 male under- 
graduates from an introductory course in 
psychology. Their participation was a course 
requirement, All Ss, including alternates, 
were recruited and randomly assigned to the 
various experimental conditions prior to the 
experiment. The Ss were not informed that 
the experiment might entail electric shock 
until immediately prior to their actual in- 
volvement, 

Verbal stimuli—The stimuli were com- 
mon nouns, essentially the same as those of 
a previous study (Peastrel, 1964). "These 
were presented visually by means of a pro- 
jector and appeared white on a dark back- 
ground, The screen was about 7 ft. in front 
of S, and the height of the projected words 
was 3 in. 

The three sets of words used are presented 
in Table 1. It will be noted that the gen- 
eralization list included all of the words of 
the conditioning list, and in addition, the 
synonyms and homophones of TALE and 
CENT. The other words are presumably 
unrelated. 

All Ss were Presented with all of the 
words in Table 1. TALE was the conditioned 
stimulus (CS) for one half of the Ss and 
CENT was the CS for the other half. The 
generalization stimuli were the synonym (S) 
and homophone (H) of the CS. The syno- 
nym (s) and homophone (h) of the cent 
or TALE not receiving reinforcement are con- 
trols for novelty and for Specific reactions to 


TABLE 1 
SriMULUS Worps* 
Buffer Conditioning | Generalization 
List List List 

FLOOR TALE TALE 

TANK CENT STORY 

RANCH BENCH TAIL 

DOOR LAMP CENT 

HOUR MILE PENNY 

CHALK FIELD SCENT 

VINE BENCH 
LAMP 
MILE 
FIELD 


^ The word with the lowest frequency of usage in the 
table is CHALK. This word appears in printed matter 
13 times per million words (Thorndike & Lorge, 1944), 


synonyms and homophones unrelated to con- 
ditioning. 

Procedure.—The Ss were run individually. 
They were first given a very brief and vague 
description of the experiment. Those who 
were to be conditioned with shock were so 
informed at this time and given the op- 
portunity to leave if they did not wish to 
participate. Two Ss chose not to participate. 
The shock electrodes were attached to the 
right leg about 23 in. above the ankle. The 
electrodes, #4 in. in diameter (dimes), were 
applied with electrode jelly on the sides of 
the shin about 3 in, apart. The shock level 
was adjusted to as high a level as S would 
tolerate, which ranged from approximately 
2-8 ma. 

The GSR electrodes were both applied 
to the palm of the right hand. The elec- 
trodes were of the zinc-zinc sulfate paste 
type and were held in position by Plexiglas 
holders. 

All Ss were instructed to read the words 
aloud, to pay attention to the relationship of 
the buzzer (or shock) to the words, and to 
try to anticipate on what words the buzzer 
(or shock) would occur, Then, depending 
on prior assignment to an experimental con- 
dition, S was given one of three instruc- 
tional sets: a semantic set, a phonic set, or 
an ambiguous set. The semantic set was 
induced by instructing S to attend to and 
think of the meaning of the words—to look 
for words with similar meanings. Under 
the phonic set, S was asked to attend to the 
sound of the words—to look for words that 
sounded alike or rhymed. In the ambiguous 
Set, no reference was made to meaning or 
sound. 
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Headphones were then placed on all Ss. 
These were used to present a loud raucous 
buzzer to the buzzer Ss. In addition, along 
with the masking noise of a blower, the 
headphones served to muffle extraneous 
sounds. As a safeguard against rumors of 
shock influencing the buzzer Ss, all buzzer 
Ss were run before shock Ss. 

The buffer list was presented twice, fol- 
lowed without interruption by seven presenta- 
tions of the conditioning list. All presenta- 
tions of the CS were reinforced. The order 
of presentation of words in the buffer and 
conditioning lists was random within blocks. 
The generalization list was presented once, 
the CS not being reinforced on this occa- 
sion, The order of the generalization and 
control words was counterbalanced across 
Ss. There were four orders: HshS, sHSh, 
hSHs, and ShsH. It will be recalled that the 
s and h are included in the generalization list 
as controls particularly for novelty. TALE 
and cENT always preceded the test stimuli. 
Except for these restraints, the order of 
presentation of the generalization list was 
random. Following the test of generaliza- 
tion, one more conditioning block was pre- 
sented without reinforcing the CS. 

The length of presentation of the verbal 
stimuli was 6 sec. and the interstimulus inter- 
val, 3 sec. On reinforced presentations of 
the CS, the latency of US onset was 4 sec. 
and its duration 1 sec. A decrease in resist- 
ance was considered a response if its onset 
was from 1.25 to 7.50 sec. after the onset 
of the word. When multiple responses to a 
stimulus occurred, only the largest response 
was used in the analysis. Because of the 
extended CS-US interval, CRs could be ob- 
served on reinforced CSs. A response to the 
CS was considered a CR if it occurred before 
the unconditioned response and if its magni- 
tude was larger than four of the responses 
to the other five words in its block of con- 
ditioning words. The criterion of condition- 
ing was a CR on three or more consecutive 
presentations of the CS during conditioning. 

Before leaving, S was asked not to dis- 
cuss the experiment with other students. 

Apparatus—The GSR apparatus was a 
Model 22A Fels Dermohmeter. This in- 
strument could be operated on one of five 
sensitivities: 100, 200, 400, 1,000, and 2,000 
ohms per scale unit change (2 mm.). Only 
the first four were used in this study. The 
sensitivity was set for each S so as to permit 
the maximum discrimination of responses 
within the limits of the recording chart. 


TABLE 2 


DISTRIBUTION oF Ss ACCORDING To NuM- 
BER OF CRs ELICITED ON SEVEN 
PRESENTATIONS OF CS": 


Type of US 
Number of CRs 
Buzzer Shock 
45 7 4 
6 12 7 
7 29 37 


a First and ninth pasaon S 
bx® (2) = 3.104, .30 > p > 


The recorder was an Esterline-Angus 
Graphic Ammeter, Model AW. This pro- 
vided a permanent record of the GSR and 
of the occurrence of stimuli. 

The verbal stimuli were presented by an 
automatically timed filmstrip projector. 

A Foringer No. 1154AC shock apparatus 
provided a constant amperage ac shock. 

The buzzer was generated by a vibrator 
in series with a set of headphones. Intensity 
of the buzzer could be adjusted up to about 
70 db. by varying a resistor in series with 
the headphones and vibrator. 


RESULTS 


A number of comparisons bear upon 
the differential effect of the type of 
US on conditioning and responsivity. 
The Ss were presented with nine CSs 
in the course of the procedure, the last 
two without reinforcement. Table 2 
indicates the distribution of the number 
of CRs elicited on the second through 
eighth presentations of the CS. Al- 
though the eighth presentation of the 
CS was not reinforced, it is included 
here because it could not have been dis- 
criminated by S from the previous con- 
ditioning trials until after its termina- 
tion. Although the shock condition 
produces a greater number of CRs, 
this trend is not significant, x? (2) = 
3.10, p > .20. On the last presentation 
of the CS, a CR was produced by 40 
shock Ss, as compared to 32 buzzer S's. 


2This test is based on the 48 shock and 
47 buzzer subjects. The response of one 
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Again the difference is not significant, 
X (1) 2297, 10 > p > .05. 

The magnitude of the CR to the 
eighth CS is of interest since it occur- 
red immediately prior to the generaliza- 
tion and control words. Here the 
mean magnitude of response, 10 + 
log (AC + 10-7), for the buzzer Ss 
was 3.666 and for the shock Ss, 3.841, 
Mann-Whitney U Test, z= 3421, 
p «.001. Although this difference 
may reflect stronger conditioning with 
Shock, it might also have been the 
result of increased sensitization to the 
experimental situation resulting from 
differences in instruction and specif- 
ically, the anticipation of shock. To 
evaluate this possibility, the median 
magnitude of response to the last seven 
buffer words was calculated for each S 
The mean median responses were 3,323 
and 3.434 for the buzzer and shock 
groups, respectively, Mann-Whitney 
U Test, 2=2,042, 05> p> .01. 
Thus the increased responsivity of the 
shock group, contributing perhaps to 
the difference observed to the eighth 
CS, probably anteceded conditioning. 

The mean magnitudes of response to 
the generalization and control words 
are shown in Fig. 1. It can be seen 


O uS- BUZZER — Ex woRos 
9 US-SsHocK | —----— CONTROL WORO 
BN G—p, 
I Pd 
Hz 
I2 C 
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Pp e-a 
i 
: Semantic Ambiguous 
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Eia 
B D dnce 


Fic. 1. The mean magnitude of response 
to the experimental and control synonyms 
(S) and homophones (H) on the general- 
ization trial. 


buzzer subject to this last CS was lost be- 
cause of bodily movement. 


TABLE 3 


SUMMARY OF ANALYSIS OF VARIANCE OF THE 
GENERALIZATION WORDS 


Source df | MS F 
Between Ss 
Instructions (I) 2 |.461 | 1.95 
Type of US (US) 1.109 
Order (O) 3 |.462| 1.95 
IXUS 2]|.134 
IXO 6 | .086 
US x O 3 | .034 
IxUS xO 6 |.374 | 1.58 
Error (between) 72|.237 
Within .Ss 
Synonym-Homophone 
Categories (S-H) 1 |.449 | 6.24* 
-H Xx I 2 |.682 | 9.47*** 
S-H X US 1 | .098 | 1.36 
S-H XO 3 | .394 | 5.47** 
S-H X I x US 2.024 
$-HXIXO 6 | .099 | 1.38 
S-H X US xO 3 |.046 
S-H XI X US x O 6 | 027 
Error (within) 72 | 072 
*p <.05. 
** b c .01 
95 <.001 


that both the buzzer and shock Ss 
responded according to instructions; 
i.e., those instructed to attend to mean- 
ing generalized more to the synonym 
and those instructed to attend to simi- 
larities in sound generalized more to 
the homophones. The ambiguous set 
produced intermediate results favor- 
ing the homophone, particularly under 
the shock condition. The results of 
an analysis of variance of the means 
of the generalization words appear in 
Table 3. The significant interaction 
between instructions and synonym- 
homophone category supports the set 
hypothesis.* 

The second hypothesis, the stress 
hypothesis, is not supported by the 
data. Figure 1 shows that the results 
with the shock closely parallel those 


? The mean basal skin resistance of the 
buzzer and shock subjects at the time of 
presentation of the first generalization or 
control word was 54,600 and 56,900 ohms, 
respectively, (Mann-Whitney U Test, z= 
-198, p> .50). 
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with the buzzer and the triple inter- 
action between instructions, type of 
US, and synonym-homophone category 
in Table 3 does not approach signif- 
icance. Furthermore, no effect involv- 
ing type of US is significant. There 
appears to be little justification, then, 
for distinguishing the shock and buzzer 
groups. 

The set hypothesis was stated spe- 
cifically for the semantic and phonic 
instructions, the ambiguous condition 
being included for exploratory pur- 
poses. The interaction between in- 
structions and synonym-homophone 
category was therefore evaluated with 
the ambiguous set excluded and found 
to be significant, F (1, 56) = 1771, 
b «.001. In addition, this same inter- 
action was significant for the semantic 
and ambiguous instructions alone, 
F (1, 56) = 6.64, .05 > p > .01, and 
for the phonic and ambiguous instruc- 
tions alone F (1, 56) 26.10, .05 > 
p> 0l. 

As to the specific nature of the effect 
of instructions on generalization, there 
appears to be an element of asymmetry 
with regard to synonyms and homo- 
phones. Figure 1 reveals that it was 
generalization to synonyms that 
changed over the three instructional 
sets, whereas the magnitude of response 
to homophones remained relatively 
constant. Statistical analysis bears 
this out: the differences in mean re- 
sponse to the synonym as instructions 
are varied is significant, F (2, 84) = 
6.33, p < .01, but not the differences 
in mean response to the homophone. 
In order to reduce the possibility that 
these results were specific to the log 
conductance change transformation, 
homologus analyses were made for re- 
sistance change, conductance change, 
and \/conductance change, with essen- 
tially identical results. 

Table 3 also shows a significant 
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interaction of  synonym-homophone 
category by order. This order effect 
consists of decreasing magnitude of re- 
sponse as the S and H appear later in 
order) The question may arise 
whether the interpretation of the signif- 
icant S-H X I interaction should be 
modified by the significant S-H x O 
interaction. The absence of a signif- 
icant S-H X IX O interaction indi- 
cates that the S-H X I interaction is 
unaffected by order. By similar rea- 
soning, it can be seen that the non- 
significant S-H x I X US interaction, 
the test of the second hypothesis, can- 
not be attributed to order effects, as is 
shown by the absence of a significant 
quadruple interaction. 

The means for the control words 
were subjected to an analysis of vari- 
ance identical to the one for the gen- 
eralization words as in Table 3. The 
results are not tabulated since only the 
interaction between order and the 
synonym-homophone category was sig- 
nificant, 

The analysis thus far has focused on 
differences in the mean responses to the 
generalization words but gives no evi- 
dence that generalization to these words 
occurred. Generalization to synonyms 
and homophones was assessed by 
means of two analyses of variance: 
S-s X USXIXO and H-h x US x 
I x O, respectively. In each case, the 
mean magnitude of response to the 
experimental word (S and H, respec- 
tively), was significantly greater than 
the response to the corresponding con- 
trol word (s and h, respectively), F (1, 
72) for S — s = 48.02, p< .001; F 
(1, 72) for H — h = 8838 p < .001. 
In both cases, significant order effects 
were found. Analyses of the simple 
effects revealed significant differences 
in favor of the experimental word for 


4 Detailed statistical analyses of the order 
effects are available from the first author. 
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TABLE 4 
DISTRIBUTION OF Ss ACCORDING TO LARGEST GENERALIZATION RESPONSE* 


A. Generalization Words 


Instructions 


S»H|H»S| = o 
US—Buzzer 
Semantic Set 12 4 0 0 
Phonic Set 4 11 1 0 
Ambiguous Set 7 8 0 1 
US—Shock 
Semantic Set 13 2 1 0 
Phonic Set 4 12 0 0 
Ambiguous 6 10 0 0 


B. Control Words 
N s»h|h»s = o N 
16 6 7 0 3 16 
16 6 8 0 2 16 
16 5 8 2 1 16 
16 5 8 0 3 16 
16 7 6 0 3 16 
16 9 5 2 0 16 


* The O column indicates the numbers of Ss who failed to respond to either the synonym or homophone. 


three of the four orders. The failure 
to obtain a significant difference be- 
tween the experimental and control 
words occurred in each case when the 
control word was the first word in the 
test list order: sHSh for the synonyms 
and hSHs for the homophones. 

With respect to the main hypotheses 
tested above by parametric techniques, 
alternate nonparametric analyses were 
performed. This involved classifying 
Ss according to whether the larger gen- 
eralization response was elicited by the 
synonym or homophone. The result- 
ing distributions for the generalization 
and control words are given in Table 4. 
Here again it is apparent (Table 4A) 
that instructional set is an important 
determinant of generalization. Under 
both the buzzer and the shock condi- 
tions, synonym generalization predom- 
inated with the semantic set, and homo- 
phone generalization predominated 
with the phonic set. Results with the 
ambiguous set were intermediate. It 
is also clear from Table 4A that type 
of US made little difference. 

Statistical evaluation of these data 
was accomplished by recasting Table 
4A into 2 X 2 tables. The three Ss re- 
sponding equally to the synonym and 
homophone, or responding to neither, 
were excluded. Also the distinction 
between type of US was discontinued 


and the two sections of Table 4A were 
combined. 

The chi-square for the interaction be- 
tween instructions and synonym-homo- 
phone generalization category for the 
semantic and phonic groups alone is 
highly significant, x? (1) = 16.03, 
p <.001. This interaction is also sig- 
nificant for just the semantic and am- 
biguous groups, x? (1) = 823, p < .01, 
but not for the phonic and ambiguous 
groups, x? (1) = 1.15, p>.20. The 
order effects involved in these analyses 
were examined, and it is clear that 
order made the test of significance in 
the crucial difference between the se- 
mantic and phonic sets conservative. 

A similar analysis of the control 
words in Table 4B revealed no signif- 
icant interactions.* 


Discussion 


The results of this study support the 
hypothesis that set is an important deter- 
minant of generalization and are con- 
sistent with other studies in this area 
(Branca, 1957; Chatterjee & Eriksen, 
1962; Razran, 1949). Regarding Eisen’s 
(1954) failure to demonstrate generaliza- 
tion or the influence of set, it may be 
speculated that the task requirements of 
his experimental situation were con- 
founded. The instructions pertaining to 
generalization were explicit (at least with 
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SET AND STRESS IN SEMANTIC GENERALIZATION 


one group), but they followed a condi- 
tioning procedure in which Ss had been 
systematically misled. 

The supposition that the stressfulness 
of shock detracts from the ability to re- 
spond according to instructions in this 
task appears to be unsupported. 

It is necessary to differentiate between 
two separate problems: the effects of dif- 
fusion of energy on performance (in 
this case, type of generalization) and the 
result, with respect to degree of diffusion, 
of any particular stimulus (in this case, 
shock). The former implicates the heart 
of the theoretical elaboration of the con- 
cept of efficiency: diffuse states should 
lead to decreases in efficiency of per- 
formance. The latter is a purely empiri- 
cal question in this framework, to be de- 
termined by trial and error. It is clear 
that shock did not result in a significant 
decrease in efficiency of generalization in 
this experiment. The critical question is 
whether it created a diffuse, non-task- 
centered, state. Unfortunately, adequate 
indexes of such a state are not available 
in this study. The maintenance of quali- 
tatively organized activity by Ss gen- 
erally would seem to imply that a nega- 
tive answer to this question is at least 
tenable. Thus, the hope that the type of 
US utilized alone would result in suf- 
ficiently different types of task orienta- 
tion to account for differences in pat- 
terns of generalization may not have been 
fulfilled. 

The reliable induction of a diffuse state 
sufficient to disrupt simple performances, 
particularly as reflected in autonomic re- 
sponses, does not appear to be a simple 
task. It is not likely to result from sub- 
jection to shock or threat of shock. It 
is even possible that threats of pain of 
this sort within the confines of a presum- 
ably ethical experimental laboratory serve 
to aid in the mobilization of one's re- 
sources so as to enhance attention to 
instructions and subsequent performance 
if the task requirements are not too 
subtle. Instructions emphasizing pejora- 
tive, if vague, social judgments as penal- 
ties for poor performance, with no feed- 
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back as to the state of performance, seem 
to have had more profound effects (cf. 
Wishner, 1962; 1965). Similarly, na- 
tural states of diffusion, as reflected in 
states of psychopathology, have been 
shown to affect patterns of generalization 
(Peastrel, 1964). 

A final comment is in order on the ob- 
servation that set seems to have had its 
greatest effect on generalization to syno- 
nyms, generalization to homophones re- 
maining relatively constant across sets. 
There are obvious physical similarities 
between homophones—in sound and spell- 
ing. Generalization to synonyms would 
appear to require cognitive processing be- 
yond physical characteristics. The 
strength of responses to homophones may 
therefore be conceived as analogous to a 
base rate for amount of primary stimulus 
generalization in this type of task. From 
this point of view, the semantic set may 
have required some inhibition of the 
natural response to the homophone as 
well as enhancement of the response to 
the synonym of the CS. If so, the data 
indicate tendencies for the buzzer group 
to be more successful than the shock 
group in such a dual task. As can be 
seen in Fig. 1, the shock group general- 
ized more to the homophone than to the 
synonym under the ambiguous set and 
tended to be less successful in inhibiting 
responses to the homophone under the 
semantic set. Both groups showed the 
same pattern of generalization under the 
phonic set. Thus, whatever differential 
trends did appear in the data seem to indi- 
cate some diffusing effects of shock. 
Stronger effects may appear if the types 
of generalization stimuli are equalized for 
base rates in generalization tendencies. 
Probably, this should be done at a higher 
cognitive level than that represented by 
generalization to homophones. Perhaps an 
appropriate variation would involve re- 
quiring generalization to different types of 
nonhomophonic associates of the CS. On 
the whole, however, exploration of surer 
means of induction of diffuse states would 
appear to be a primary task for future re- 
search on the effect of such states on type 
of generalization. 
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PERCEPTUAL LEARNING 


IN THE DISCRIMINATION 


PROCESSES OF CHILDREN: 


AN ANALYSIS OF FIVE VARIABLES IN 
PERCEPTUAL PRETRAINING * 


THOMAS J. TIGHE anp LOUISE S. TIGHE 
Dartmouth College 


In 2 factorially designed experiments 408 lst-grade children learned 
a 2-dimensional concept task and a discrimination reversal following 
pretraining treatments designed to determine the roles of 5 basic condi- 
tions affecting S's exposure to the dimensions of the transfer tasks. 
Within a nonassociative, nonreinforced pretraining procedure, Experi- 
ment I varied manner of stimulus presentation (simultaneous vs. 
successive), type of response (same-different judgments vs. obser- 
vation), and number of stimulus values per dimension (2 vs. 4), Re- 
versal learning was significantly facilitated when Ss were required to 
render judgments to pretraining stimuli varying in 4 values per dimen- 


sion, regardless of presentation procedure. Other pretraining treat- 
ments had no effect on reversal learning. Experiment II varied amount 
of pretraining (4, 8, or 12 experiences per stimulus) and presence 
vs. absence of reinforcement (information about correctness of judg- 
ment) within the pretraining conditions found to yield significant 
transfer by Experiment I. Reversal learning was significantly and 


equally facilitated following 8 and 


12 experiences per stimulus, but not 


following 4. Transfer did not vary with presence vs. absence of rein- 
forcement. Pretaining did not affect performance in the initial task 


in either experiment. 


Gibson and Gibson (1955) have 
argued that the critical change in dis- 
crimination learning is perceptual 
learning, defined as an increase in sen- 
sitivity to existing but initially unde- 
tected or poorly detected variables of 
stimulation. The process whereby per- 
ceptual learning occurs is assumed to 
be one of differentiation, i.e., of coming 
to see more of what is present in 
stimulation, and distinguishing from 
the association learning which often 
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generous cooperation of Gordon Tate, Super- 
intendant of the Lebanon, New Hampshire 
school system, and John A. Freitas, Super- 
intendant of the Hartford, Vermont, school 
system, and for the assistance and thought- 
fulness of the principals and teachers of these 
school systems. 


occurs concurrently in the discrimina- 
tion experiment (e.g. the association 
of verbal labels to discriminated stim- 
uli). The role of perceptual learning 
in discrimination has been studied by 
means of the “predifferentiation experi- 
ment” in which transfer in discrimina- 
tion is measured following some form 
of pretraining with the task stimuli. 
The pretraining procedures most rele- 
vant to the assmuptions of differentia- 
tion theory are those which are primar- 
ily perceptual in nature, i.e. training 
which is nonassociative and non- 
reinforced and which requires only 
some form of perceptual response (e.g. 
same-different judgments) to the to-be- 
discriminated stimuli. It is assumed 
that such pretraining provides an op- 
portunity for further differentiation 
of the task stimuli, and that facilitation 
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of discrimination following such treat- 
ment is due to perceptual learning. 
Although it is now possible to specify 
pretraining conditions of a perceptual 
nature which will facilitate discrimina- 
tion performance in adults (Tighe & 
Tighe, 1966), relatively little work has 
been done on the effects of perceptual 
pretraining in children. However, a 
recent study (Tighe, 1965) has pro- 
duced evidence which suggests that per- 
ceptual learning plays a critical role in 
the discrimination processes of the 
young child. Specifically, it was 
demonstrated that young children 
would solve discrimination shift prob- 
lems in a manner which had hitherto 
been characteristic only of more mature 
Ss if the children were given a brief 
period of pretraining designed to em- 
phasize the independence and dimen- 
Sional nature of the task stimuli. The 
present experiments pursue the study 
of perceptual pretraining in relation to 
the performance of children in a simple 
two-dimensional concept task and in a 
discrimination reversal. Employing a 
factorial design, Exp. I investigates 
three conditions of pretraining—man- 
ner of stimulus presentation, type of 
response, and number of stimulus 
values per dimension, These transfer 
tasks and pretraining conditions were 
selected as being particularly suited to 
the study of the perceptual learning and 
developmental processes assumed under 
the differentiation theory of discrim- 
ination (Tighe & Tighe, 1966). 


EXPERIMENT I 
Method 


Subjects—The Ss were 264 first-grade 
children, average age 6 yr. 7 mo., from the 
Lebanon, New Hampshire school district. 
Each $ was run individually in a small room 
adjacent to his classroom. 

Pretraining brocedure and apparatus —The 
stimuli presented in Pretraining were selected 
from 16 1§-in. diameter wood cylinders which 


varied in height (Ht) and brightness (Br). 
Brightness was varied by covering the cylin- 
ders with black (B) or with white (W) 
enamel paint or with one of the following 
enamel mixtures—dark gray (DG—1 part W 
to 4 parts B) or light gray (LG—1 part B 
to 124 parts W). The Ht values were 
44 in, 51 in, 6j in, and 74 in. The 16 
stimuli represented the Possible combinations 
of these Ht and Br values, In order to aid 
the standardization of Presentation conditions, 
each stimulus was placed in a 3-in. deep X 
li-in. diameter hole cut in a 3i-in. wide x 
l-in. high plywood base or "stand" which 
was placed about 15 in. from S who was 
seated opposite E at a small table. 

Within a 2x2x2 factorial design, sub- 
groups of 24 Ss each received pretraining 
with these stimulus objects under combina- 
tions of the following conditions : (a) simul- 
taneous presentation vs. Successive presenta- 
tion of the stimuli, (b) an observation 
condition in which S$ simply viewed each 
stimulus vs. a specific response condition in 
which S' was required to make a same-differ- 
ent judgment to each stimulus, and (c) rep- 
resentation of each dimension by four differ- 
ent stimulus values vs. representation by only 
two stimulus values. In Successive presenta- 
tion the stimuli were placed into the plywood 
stand one at a time, and between presenta- 
tions all stimuli were hidden from S's view 
behind a nearby masonite screen. The total 
Presentation was divided into eight subsets 
Or series as described below, In simultane- 
Ous presentation, the stimuli which consti- 
tuted a series were presented at the same 
time in a line perpendicular to S's line of 
vision and E pointed to each stimulus S' was 
to judge or observe, The masonite screen 
Was used to cover the stimuli while E 
arranged each presentation. 

When Ss were required to judge each 
stimulus, the first stimulus presented or 
Pointed out within a series constituted a 
standard (St) stimulus. The S was told, 
"Now let's pretend that this object is yours. 
Look at it carefully. I am going to show 
you [point out] some other objects. You 
tell me which of the other objects that you 
will see are exactly like yours.” Under 
successive Presentation, as each succeeding 
comparison (Co) stimulus was placed in 
the stand E said, "Is this object exactly 
like yours?" and the stimulus remained until 
S had rendered a judgment of “same” or 
"different" Under Simultaneous presenta- 
tion, E continued to point to each stimulus 
S was to judge until S$ had made his re- 
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sponse. When an St was presented again 
to begin a new series, the first sentence of 
the instructions was changed to, “Here is 
your object again.” When a new St was 
introduced the first sentence of the instruc- 
tions was, *Now let's pretend that this ob- 
ject and only this one is yours" In the 
observation condition Ss were exposed to the 
identical stimuli and presentation sequences, 
but were instructed only to look at each 
object. As E presented or pointed out the 
first stimulus he said, “Look carefully at this 
object. I am going to show you [point 
out] some other objects, and I want you to 
look at each of them.” Then as E presented 
or pointed to each of the remaining stimuli 
which S experienced, E said, “Look at this 
object" The Ss viewed each stimulus but 
made no other specific response. 

The following stimuli were used as the 
standards for Ss who were required to judge 
the pretraining stimuli: a B cylinder 54 in. 
high, a B cylinder 6% in. high, a W cylinder 
51 in. high, and a W cylinder 6% in. high. 
The Ss who rendered judgments underwent 
two series of Co stimulus presentations with 
each of the four Sts. However, the length 
of the series depended upon whether four 
stimulus values or two stimulus values were 
used to represent each stimulus dimension. 
In training conditions utilizing four different 
stimulus values per dimension each St was 
accompanied by a series of 16 Co stimuli 
which represented the possible combinations 
of the four Ht and four Br values previously 
described. When only two stimulus values 
were used per dimension, E presented only 
B or W and 5ł-in. or 6ł-in. high cylinders. 
Under this condition each St was accom- 
panied by a series of four Co stimuli which 
represented the possible combinations of these 
two Ht and two Br values. In addition, in 
both the two-value and four-value training 
conditions, the number of stimuli which were 
identical to the St varied from one to three 
within any presentation series. This was 
done in order to better maintain S’s interest 
throughout his judgment series. The Ss 
who simply observed the pretraining stimuli 
were exposed to the same stimuli and pre- 
sentation sequences just described, but for 
these Ss the stimuli had no standard or com- 
parison function. 

It should be pointed out that the general 
features of pretraining were such that all 
pretrained Ss, regardless of specific training 
condition, received a brief but intensive ex- 
perience with the fact that the objects may 
remain constant in an attribute (which may 
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assume other values) while undergoing 
variations along another dimension, and they 
had an opportunity to discriminate the ob- 
jects on the basis of one distinguishing fea- 
ture independently of the other. 

The possible 24 orders of the four Sts were 
randomly distributed among all Ss within 
the restriction that for half of the Ss the 
last St presented would be identical to one 
of the positive stimuli in the later discrimina- 
tion and for the other half the last St would 
be identical to one of the negative stimuli. 
None of the judgments or observations in 
any pretraining condition were corrected or 
reinforced and all were based solely on vi- 
sion. The number of experiences with each 
“comparison” and “standard” stimulus was 
the same in all conditions of the experiment 
—eight experiences per stimulus; and the 
total time of stimulus exposure was equated 
across the manner of presentation and type 
of response variations. To achieve the 
latter control, E adjusted the rate at which 
he pointed to the stimuli under the simultane- 
ous presentation and observation conditions 
so as to match the rate of stimulus exposure 
under the successive presentation and judg- 
ment conditions, respectively. The Ss trained 
with four stimulus values per dimension re- 
quired approximately 18 min. to complete 
pretraining while Ss trained with two stimuli 
per dimension required about 7 min. 

In order to make clear to S the meaning of 
the instructions he was to receive in pre- 
training, Ss in the judgment conditions were 
first given about 4 min. practice in making 
corrected same-different judgments on cookie 
cutters varying in shape and hue. The Ss 
in the observation conditions were given 
equal exposure to these stimuli, but did not 
make differential response to them. 

Control task.—Forty-eight additional Ss 
(Group C-1) were given tasks unrelated to 
the subsequent discrimination tasks and de- 
signed to control for nonspecific transfer 
effects. The tasks were modifications of the 
Picture Completion and Picture Arrangement 
subtests of the Wechsler Intelligence Scale 
for Children. The modifications consisted 
of (a) presenting all 20 pictures in the com- 
pletion test to each S and (b) requiring each 
S to tell a story about each picture in the 
arrangement test. None of the S's responses 
in either task were corrected or reinforced. 
Half of the Ss (C-la) were trained for a 
period of 7 min. and half (C-Ib) were trained 
for 18 min., corresponding to the two periods 
of pretraining with the size and brightness 
stimuli. 
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equal that of the Corresponding four-stimulus 
value per dimension condition. 
Discrimination tasks—The transfer tasks 


two problems: 
crimination, 
crimination, 
the Sts used and were pre- 
sented on a device in the 
following two pairs : the tall (T) white (W) 
cylinder vs. the short (S) black (B) 
cylinder, and the TB cylinder ys, the SW 
cylinder, The $ made his choice by picking 
each trial and, if 
under it. The $ 


Was presented imme- 
to the same criterion 
without change of instructions, 


was allowed to choose a 
which included 
charms, bubble gum, whistles, Hershey bars, 
raisins, jack sets, Tootsie rolls, Pencils, pen- 
cil sharpeners, M & Ms, and flutes, 


Results 


Although both trials and errors were 
recorded only the former measure will 
be reported here since analyses of 
errors provided statements identical to 
those which result from analyses of 
trials. 

Initial discrimination Factorial 
analysis of variance of trials to criterion 
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TABLE 1 


MEAN NuMBER oF TRIALS TO CRITERION IN 
INITIAL DISCRIMINATION AND REVERSAL 


Tättial 
ini "m is- | Re- 
Training Condition crimi- | versal 
nation 
Judgment Successive-Four Values 27.96 | 5.17 
Judgment Simultaneous-Four Values 20.21 | 5.63 
Observation Successive-Four Values 28.29 | 18.96 
Judgment Successive-Two Values 28.88 | 20.21 
Observation Successive-Two Values 23.42 |22.63 
Observation Simultaneous-Two Values | 30.50 | 23.42 
Judgment Simultaneous-Two Values 26.63 |23.79 
Control Ib 33.25 | 25.41 
Observation Simultaneous-Four Values | 24.96 | 26:00 
Control 1a 22.96 | 26.91 
Control 2 24.17 | 33.75 


under the three pretraining factors 
yielded F values which were well 
within the range of chance expectation, 
Table 1 includes the mean trials to cri- 
terion in the eight major treatment 
groups and the three control conditions, 
One-way analysis of variance for these 


11 means yielded F < 1. 
were no main or 


Thus, there 
interactive effects of 


the three pretraining variables, and pre- 
trained and nonpretrained Ss did not 


differ in speed 
discrimination. 


of learning the initial 


Reversal task.—Speed of reversal 
learning varied significantly with type 


of pretraining. Factorial 


analysis of 


V X + .5 transformed trials to criterion 
(see Table 2) produced highly signif- 
icant main and interactive effects for 
both number of stimulus values per di- 


TABLE 2 


ANALYSIS or VARIANCE OF TRANSFORMED 
TO CRITERION IN REVERSAL 


Source | df E 
| 

Type of Response (R) 1 | 16.50* 
Number of Stimul.is 

Values (S) 1/|14.21* 
Manner of Presentation ( P) 1| 1.54 
RXS 1 | 15.08* 
RXP 1 
SXP 1 
RXS XP- 1| 1.65 
Within Groups (MS) 184 | (3.548) 

*p «.001. 
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mension and type of pretraining re- 
sponse. The nature of these effects is 
seen clearly in Table 1 which presents 
the mean trials to criterion for each 
experimental and control treatment 
ranked in order of speed of reversal 
learning. Low reversal scores are 
found only under the first two train- 
ing conditions listed, both of which re- 
quired Ss to make judgments of stimuli 
varying in four values per dimension. 
Reversal scores under the remaining 
conditions are much higher and differ 
relatively little from one another. Em- 
ploying Duncan's multiple comparison 
technique, groups ranked No. 1 and 2 
in Table 1 do not differ, p > .10, but 
both differ significantly from each of 
the remaining experimental treatments 
and Cond. C-la, C-lb (groups ranked 
No. 3-10), p < .005 for each compari- 
son. There are no significant differ- 
ences among groups ranked No. 3-10, 
22.10. Thus, reversal learning was 
facilitated only when training combined 
the conditions of judgment and four 
stimulus values per dimension. 

The Ss in Group C-2, who were run 
to assess the contribution of amount of 
practice to comparisons of two vs. four 
stimulus values per dimension, were 
found to reverse more slowly than Ss 
trained under the judgment-succes- 
sive-two-value procedure, £ (46) = 
2.68, p < .02. The Ss in C-2 received 
identical training except that the num- 
ber of trials was increased to equal that 
which obtained when four values ap- 
peared on each dimension. This com- 
parison indicates that the facilitating 
effect of four values as compared to 
two values per dimension cannot be 
attributed to the greater amount of 
practice in the former condition. 

To summarize the major findings, 
then, pretraining did not affect original 
learning, but Ss who were required to 
make  same-different judgments to 


stimulus objects varying in four values 
per dimension were significantly facil- 
itated in learning a reversal with these 
objects, and this was true regardless of 
the presentation procedure employed. 
Other pretraining treatments did not 
yield significant transfer effects. 


EXPERIMENT II 


This experiment studied the effect of 
variations in amount of pretraining and 
the effect of presence vs. absence of 
reinforcement, defined as information 
about correctness of judgment. These 
variables were manipulated within the 
pretraining conditions which were 
found to yield significant facilitation in 
Exp. I, ie., Ss were required to make 
same-different judgments to the pre- 
training stimuli and four stimulus 
values appeared on each dimension dur- 
ing pretraining. In view of the failure 
to find a differential effect of simultane- 
ous vs. successive stimulus presentation 
in Exp. I, the latter procedure was 
utilized as being administratively more 
advantageous. 


Method 


Subjects—The Ss were 144 first-grade 
children, average age 6 yr. 6 mo., from the 
White River Junction, Vermont, school dis- 
trict. 

Procedure.—The apparatus and stimuli 
were those used in the pretraining and trans- 
fer tasks of Exp. I. The general training 
procedure, instructions, and stimuli employed 
were identical to those used in the “judg- 
ment-successive — four-values-per-dimension” 
condition of Exp. I. In a 3 X 2 factorial de- 
sign, Ss received either 4, 8, or 12 experi- 
ences per comparison stimulus in pretraining 
and were either informed or were not in- 
formed of the correctness of their judgments. 
The Ss who received 4 experiences per stim- 
ulus in pretraining underwent one series of 
judgments with each of the four standard 
(St) stimuli, a series being defined as the 
presentation of the St and the 16 comparison 
stimuli plus two interspersed reappearances 
of the St to maintain S's interest. The Ss 
who received 8 experiences per stimulus 


130 


underwent two series of judgments with 
each St, and Ss receiving 12 experiences per 
stimulus underwent three series of judgments 
with each St. The intermediate amount of 
training was employed in Exp. I, and thus 
the three amounts of pretraining include de- 
grees of training equal to, less, and greater 
than an amount previously observed to yield 
significant transfer in discrimination. The 
Ss who were informed as to the correctness 
of their judgments were told "Right" or 
"Wrong" by E immediately after each judg- 
ment, but received no other information. 
The remaining Ss did not receive any form 
of feedback following their pretraining judg- 
ments, ie, these Ss were trained in the 
manner of Ss in Exp. I. 

At the end of pretraining all Ss underwent 
the transfer tasks according to the procedure 
described in Exp. I. 

Two control groups were run. Group C-1 
(16 Ss) was trained for 18 min, in the same 
manner as Ss in Group C-l, Exp. I. 
Eighteen minutes was selected as the length 
of training since this represents the average 
duration of training received by experimental 
Ss and since no difference was found be- 
tween the effect of 7-min. and 18-min. periods 
of similar training in Exp. I. Group C-2 
(32 Ss) received pretraining identical in 
procedure to that undergone by the experi- 
mental Ss but with dimensions other than 
those of size and brightness. The C-2 Ss 
judged objects which approximated the size 
of those used with the experimental Ss, but 
which differed in shape (spheres, pyramids, 
cones, and rectangular blocks) and in hue 
(red, yellow, blue, and green). One half 
of these Ss (C-2a) received correction for 
their judgments, while the other half did not 
(C-2b). All C-2 Ss received eight experi- 
ences per stimulus during training, ie, 
underwent two series of judgments with each 
of four Sts. This figure was selected since 
this amount of training had yielded signif- 
"M transfer among pretrained Ss of Exp. 


Results 


As in Exp. I, only the trials to cri- 
terion will be discussed since analyses 
of errors yielded the same conclusions. 

Initial discrimination —A 3 x 2 fac- 
torial analysis of trials to criterion 
under the two pretraining factors 
yielded F values less than 1. Table 3 
includes the mean trials to criterion in 


THOMAS J. TIGHE AND LOUISE S. TIGHE 


TABLE 3 


MEAN NUMBER oF TRIALS TO CRITERION IN 
INITIAL DISCRIMINATION AND REVERSAL 


Initial 
Training Condition Discrim-| Reversal 

ination 
Noninformed-8 experiences | 24.06 | 3.81 
Informed-8 experiences 22.88 | 6.06 
Noninformed-12 experiences | 18.63 | 6.25 
Informed-12 experiences 22.06 | 7.00 
Informed-4 experiences 25.81 | 19.94 
Control-2b 13.81 | 20.94 
Noninformed-4'experiences | 25.38 | 22.94 
Control-2a 24.25 | 25.13 
Control-1 19.69 | 26.45 

| 


the six experimental groups and three 
control treatment groups. One-way 
analysis of variance for these nine 
means yielded an F of 1. Thus, there 
were no main or interactive effects of 
the two pretraining variables, and pre- 
trained and nonpretrained Ss did not 
differ in speed of learning the initial 
discrimination, 

Reversal task.—Speed of reversal 
learning varied significantly with dif- 
ferences in amount of pretraining but 
not with presence vs. absence of in- 
formation following judgments (see 
factorial analysis of VX +.5 trans- 
formed trials to criterion in Table 4). 
Table 3 presents the mean number of 
trials to criterion in reversal for each 
of the training conditions ranked in 
order of speed of learning. Compari- 
sons among the six experimental treat- 
ments and Group C-1 were accom- 
plished by Duncan's range test. There 
were no significant differences among 


TABLE 4 


ANALYSIS OF VARIANCE OF TRANSFORMED 
TRIALS TO CRITERION IN REVERSAL 


Source df F 
Information Condition (1) 1 
Amount of Training (T) 2 19.41* 
xT 2 - 
Within Groups (MS) 90 (2,27) 


*p <.001. 
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groups ranked No. 1-4 in Table 3, 
p > .10, nor among groups ranked No. 
5, 7, and 9, p>.10. But each of the 
first four ranked groups differs from 
each of the latter three groups, f < 
.005 for all comparisons. 

The Ss in Group C-2a reversed more 
slowly than Ss in the comparable ex- 
perimental treatment (Informed-8 ex- 
periences), t (30) = 3.36, p < .005; 
and similarly Group C-2b learned more 
slowly than the comparable experi- 
mental group (Noninformed-eight ex- 
periences), ¢ (30) = 4.05, p « 001. 
Neither C-2a nor C-2b differed in re- 
versal performance as compared to C-1, 
t (30) — .12 and .55. 

These results, then, sharply define 
the relation between amount of percep- 
tual pretraining and facilitation of re- 
versallearning. Perceptual pretraining 
with only 4 experiences per stimulus 
had no effect on later learning, while 
pretraining with 8 or 12 experiences 
significantly and equally facilitated 
reversallearning. Informing S of the 
veridicality of his judgments had no 
effect on learning at any level of pre- 
training. Finaly, C-2 Ss, who re- 
ceived all of the pretraining experience 
of experimental Ss except for percep- 
tual experience with the dimensions of 
the transfer task, were not facilitated 
in their performance as were the ex- 
perimental Ss or as compared to C-1 
Ss. 


DISCUSSION 


Although the impetus to these experi- 
ments was provided by differentiation 
theory, the data are also directly relevant 
to theories dealing with reversal learn- 
ing—notably, selective attention theory 
(Sutherland, 1959; Zeaman & House, 
1963) and representational mediation 
theory (Kendler & Kendler, 1962). 
Basic to the former viewpoint is the 
assumption that discrimination learning 
is accompanied by the acquisition of di- 
mension-specific mediating responses of 
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an observing response nature. The more 
fully such attentional processes are de- 
veloped to the relevant dimension within 
a discrimination the more likely it is 
that S will continue to respond to the 
relevant cues after a reversal shift and 
consequently the faster will be the rever- 
sal learning. Kendler and  Kendler 
(1962) assume that rapid reversal occurs 
when organisms attach mediating re- 
sponses of a representational nature to the 
relevant dimension, as opposed to dis- 
criminating on the basis of the absolute 
properties of the stimulus objects. The 
representational sequence established in a 
discrimination remains appropriate to the 
demands of a reversal shift and thus in- 
creases the likelihood that response will 
be made to attributes of the relevant fea- 
ture. The differentiation analysis of re- 
versal learning (Tighe & Tighe, 1966) 
relates ease of reversal in two-dimen- 
sional tasks to the organism's ability to 
isolate and independently utilize the in- 
variant properties (i.e, dimensions) of 
the task stimuli. The Ss who have not 
achieved this level of perceptual learning 
with respect to the stimuli of a given task 
are assumed to respond to the stimuli as 
undifferentiated wholes. Only when S 
has related the relevant dimension, rather 
than the specific objects, to reward is 
there a basis for positive transfer to a 
subsequent reversal shift in the form of 
attention to the relevant feature-reward 
relation. 

Although the three foregoing view- 
points stress distinctly different processes 
in the development of discrimination, it is 
notable that all agree that the process 
involved acts to promote dimensional con- 
trol of discriminative response, and all 
relate ease of reversal learning to state 
of dimensional control. In this light, the 
operations of the present experiments 
may be viewed as providing information 
concerning the conditions which favor 
such control of response and, in the case 
of certain variables, as providing evidence 
bearing upon the nature of the mechanism 
involved. Thus, in regard to the effect of 
type of pretraining response which was 
observed in Exp. I it appears that, at this 
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age level at least, Observation alone is 
not a sufficient condition for the develop- 
ment of dimensional control. What is 
necessary (although not sufficient for 
these transfer tasks, as indicated by the 
significant interaction) is that the chil- 
dren make comparative perceptual judg- 
ments of the stimuli as they undergo 
variation in their dimensional attributes. 
This finding is not too surprising given 
the distractable nature of the young child, 
but it should be contrasted with several 
Studies at the adult level (studies which 
Prompted the inclusion of this variable) 
which have found significant transfer 
from pretraining employing general ob- 
Servation of the task stimuli (Pfafflin, 
1960; Rasmussen & Archer, 1961). This 
age difference may reflect a tendency for 
adults, but not children, to seek out di- 
mensions of difference when faced with 
a set of novel stimuli. The suggestion 
is that the Pretraining response employed 
with young children must be such as to 
force S to deal with the distinguishing 
features of the task stimuli if the develop- 
ment of dimensional control, however 
mediated, is to proceed. 

In regard to the second pretraining 
condition of Exp. I which was necessary 
(although not sufficient) for facilitation 
of learning, presence of four stimulus 
values per dimension, there is no clear 
basis in representational mediation theory 
to account for the effect of this condition, 
However, differentiation theory suggests 
an interpretation. The hypothesis, which 
guided the inclusion of this variable, is 
that degree of Perceived continuity among 
stimulus values may be a critical factor 

` in the detection of dimensions of this type. 
That is, when four stimuli, as opposed to 
two stimuli, specified each dimension, the 
children may have been more apt to 
Perceive the relations between them and 
to respond to them as continuous vari- 
ations of a single attribute rather than 
to their absolute Properties. Another 
possibility which is consistent with both 
selective attention theory and differentia- 
tion theory, is that the smaller distance 
between stimulus values in the four-value 
condition evoked greater perceptual 
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search and thus made more likely the 
isolation of the task dimensions. What- 
ever the mechanism may prove to be, this 
finding has important implications for 
understanding the development of dis- 
crimination processes. In this connection 
it should be noted that the ability of 
young children to transpose, which pre- 
Sumes response to dimensions rather 
than to absolute properties, has been 
found to be increased by "double dis- 
crimination" training in which Ss initi- 
ally learn a discrimination between two 
different pairs of stimuli lying on the 
same continuum (e.g, Johnson & Zara, 
1960; Sherman & Strunk, 1964), Al- 
though this observation has been inter- 
preted as a learning set phenomenon, the 
present findings suggest that a. perceptual 
learning process, of the type suggested 
here, may be involved. It is also tempt- 
ing to speculate in this vein concerning 
the findings with the "middle-size" prob- 
lem which tests transposition after $ has 
learned to discriminate three values on 
the same continuum, rather than the usual 
two values. Although absolute theory 
predicts that transposition is very unlikely 
in this problem, relational response has 
nevertheless generally been found even 
in "far tests" with children and animals 
(Hebert & Krantz, 1965). 

The failure to find an effect of the re- 
maining pretraining variable in Exp. I, 
manner of stimulus presentation, is not 
surprising from the viewpoint of repre- 
sentational mediation theory. But simul- 
taneous presentation can be hypothesized 
to favor the occurrence of the processes 
hypothesized by selective attention. and 
differentiation theories since dimensional 
variation is continuously to view in this 
condition while in successive presenta- 
tion it is present only in memory. But 
from the Viewpoint of differentiation 
theory it should be borne in mind that 
in requiring the child to compare stimuli 
Over time, successive presentation more 
closely approximated the conditions of 
the transfer tasks which, for detection of 
the dimension-reward relation, requires 
the integration of information over trials. 
Thus, the failure to fnd an effect of 
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manner of stimulus presentation may be 
due to offsetting advantages of each pre- 
sentation procedure, at least at this age 
level. 

In interpreting the effect of amount 
of pretraining observed in Exp. II, it is 
useful to contrast the present results with 
the outcome of predifferentiation experi- 
ments with adults. Arnoult’s (1957) re- 
view of experiments employing verbal 
pretraining with adults found that posi- 
tive transfer from such training may be 
expected after a minimum of 4 to 8 pre- 
training experiences per stimulus and 
reaches a maximum after 8 to 12 experi- 
ences per stimulus. For example, Can- 
tor's (1955) study found that 72 experi- 
ences per stimulus in pretraining pro- 
duced no greater positive transfer than 
only 12 experiences per stimulus. The 
similarity of these findings with those 
obtained with the present perceptual pre- 
training procedure, where positive trans- 
fer appeared with 8 experiences per 
stimulus and showed no further increase 
with 12 experiences per stimulus, points 
to a common process in these studies and 
suggests that the growth function of this 
process is sudden and discontinuous 
rather than gradual. 

The fact that the effectiveness of pre- 
training did not vary with the presence 
vs. absence of information following judg- 
ment is difficult to reconcile with the 
general view that some form of dif- 
ferential feedback following differential 
response (i.e., reinforcement) is essential 
for ‘learning. It is axiomatic in con- 
temporary experimental psychology that 
instrumental learning is powerfully aided 
by reinforcement, and Gibson’s (1953) 
review leaves little doubt that discrimina- 
tive responses similarly benefit from rein- 
forcement. However, it is not so clear 
that reinforcement will necessarily facili- 
tate the kind of change being considered 
here. The contemporary bias to assign 
reinforcement techniques the most im- 
portant role in discrimination learning 
may be appropriate when the primary in- 
terest is in the rapid establishment of 
simple indexing responses to differences 
in stimuli; but, when the interest is in 
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the promotion of dimensional control, a 
Procedure which emphasizes the inde- 
pendence and dimensional nature of the 
stimulus properties (as does the present 
perceptual pretraining procedure) may be 
of far greater importance than the pres- 
ence or absence of reinforcement. Par- 
ticularly from the differentiation view- 
point, which postulates that reinforcement 
in the sense of external confirmation is 
not necessary to the development of dis- 
crimination (Tighe & Tighe, 1966), it is 
not surprising that significant facilitation 
of learning was observed in the absence 
of reinforcement since, according to this 
theory, the critical conditions for trans- 
fer already obtain. 

Finally, comment should be made con- 
cerning the failure of pretraining to affect 
speed of learning the initial discrimina- 
tion. This finding is particularly embar- 
rassing to selective attention theory which 
posits both that facilitation of reversal 
reflects strengthened attentional responses 
and that attentional responses are prere- 
quisite to the development of differential 
choice. Kendler and Kendler (1959) 
have also hypothesized a positive relation 
between the presence of appropriate rep- 
resentational processes and speed of both 
reversal and original learning. From 
the differentiation viewpoint one would 
expect that the increased sensitivity which 
is assumed to underlie facilitation of 
reversal would also benefit discrimination 
in the initial task. However, it should be 
noted that the initial discrimination, un- 
like reversal, requires considerable learn- 
ing in addition to the detection of the rele- 
vant dimension. That is, it is in the 
initial discrimination that Ss must learn 
the “rules of the game,” and it is pos- 
sible that pretraining did not constitute 
an appreciable advantage in the face of 
such nonspecific learning. Under this 
reasoning, pretraining would be expected 
to have a measurable facilitating effect 
upon an initial discrimination of this type 
only when the stimulus-reward learning 
demands of the task were greater than the 
nonspecific learning requirements. One 
operation which would achieve this end 
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would be an increase in the number of PFAFFLIN, S, M. 


task dimensions. 
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5 experiments studied effects of initial reinforcer magnitude on subse- 
quent behavior, emphasizing its influence on reacquisition following 
adjustment to a 2nd reinforcer magnitude. It was concluded that simi- 
larity between reinforcer magnitudes in acquisition and reacquisition 
is an important determinant of reacquisition behavior. Successive con- 
trast effects, the influence of reinforcer magnitude on extinction 
performance, and proactive interference of reinforcer magnitude were 


also investigated. 


There are at least three potential 
effects of initial reinforcer magnitude 
on subsequent behavior. They are, in 
addition to, and perhaps independent 
of, the effects on instrumental behavior 
for the initial reinforcer magnitude at 
the time of its availability and ap- 
parently imply that some representa- 
tional response of the initial reinforcer 
magnitude is retained. They include 
the following: 

1. Initial and subsequent reinforcer 
magnitude may interact to determine 
performance. Under certain circum- 
stances, performance for a given re- 
inforcer magnitude may be inversely 
related to the reinforcer magnitude 
previously received. This relationship 
has been obtained with a subsequent 
reinforcer magnitude greater than zero 
and after a sufficient number of 
acquisition trials, in terms of per- 
formance for zero reinforcer magni- 
tude, i.e., extinction (cf. Amsel, 1962). 
Both phenomena are well known; and 
while they are not as predictable as one 
might prefer (see, for example, Hill & 
Wallace, 1967; Ison & Rosen, 1965), 
they have had considerable influence on 
theory construction (Amsel, 1962; 
Birch, 1961). 
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2. Retention of a subsequently ex- 
perienced reinforcer magnitude may be 
impaired because of proactive interfer- 
ence from the first (Collier, Knarr, & 
Marx, 1961; Spear, Hill, & O'Sullivan, 
1965). 

3. Even after performance has ad- 
justed to the level expected for the 
second reinforcer magnitude, subse- 
quent adjustment to a third magnitude 
may be influenced by the magnitude 
of the initial reinforcer (Vogel, Mi- 
kulka, & Spear, 1966). 

The first experiment reported here 
investigated these effects as a function 
of the circumstances under which initial 
reinforcer magnitude is experienced. 
One group (Placed) received the initial 
reinforcer magnitude in the goal box 
of a straight runway after being placed 
there by the experimenter; no instru- 
mental behavior was required. An- 
other group (Run) was given conven- 
tional runway trials—S traversed the 
runway in order to attain and consume 
the reinforcer. In terms of Effect 1 
above, it was expected that initial re- 
inforcer magnitude would interact with 
subsequent reinforcer magnitude to the 
greatest extent in those Ss which ex- 
perienced only consummatory behavior 
in association with the first reinforcer 
magnitude. First, one of the clearest 
and most widely cited instances of 
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such an interaction was obtained under 
Similar circumstances—by Collier and 
Marx.(1959) measuring bar-pressing 
rate for a common concentration of su- 
crose solution after Ss had received dif- 
ferential percentages of sucrose concen- 
tration during magazine training. 
Since magazine training entailed only 
consummatory experience, it is anal- 
ogous to the Placed groups in the pres- 
ent experiment. A second reason for 
the present prediction concerns a con- 
struct which might be loosely termed 
"response inertia" (cf. Collier et al., 
1961)—a tendency to respond at the 
Preshift rate following a shift in re- 
inforcer magnitude. Response inertia 
might be expected on several bases, 
eg., Ss that have instrumental ex- 
perience for initial reinforcer magni- 
tude may acquire, in addition to the 
association between the particular re- 
inforcer magnitude and the experi- 
mental situation, a particular running 
rate which may be emitted when S 
returns to the situation (cf. Logan, 
1960). Such a factor would oppose a 
tendency toward a contrast effect under 
conventional (Run-group) circum- 
stances, but would not be present in 
the Placed groups. 

Effect 2 above—proactive interfer- 
ence in retention of the subsequent 
reinforcer magnitude—was not explic- 
itly tested in the present design, but 
it was expected that its influence would 
be more evident in Ss given the initial 
instrumental experience. Retention of 
the second reinforcer magnitude may 
be impaired proactively to the extent 
that a different reinforcer magnitude 
was previously experienced within the 
same stimulus complex. Thus, follow- 
ing a 24-hr. interval between Postshift 
days, Ss shifted in reinforcer magni- 
tude may be less likely to behave ac- 
cording to the Postshift magnitude 


than Ss which had received omly the. 


Postshift magnitude throughout train- 
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ing. Since the stimulus complex (in- 
cluding, for example, proprioceptive 
feedback from running) would be quite 
different for Preshift and Postshift re- 
inforcer magnitude in the  Placed 
groups, there was reason to expect less 
interference of this type in the Placed 
than in the Run groups. 

Effect 3 above was not expected; 
experiments have found no influence 
of Preshift reinforcer magnitude on ex- 
tinction behavior (e.g. Metzger, Cot- 
ton, & Lewis, 1957; Zeaman, 1949). 
However, if residual effects of initial 
reinforcer magnitude did appear in 
terms of resistance to extinction after 
a second reinforcer magnitude, it was 
expected that they would be more pro- 
nounced for those Ss given initial 
instrumental experience with the first 
reinforcer magnitude. Since extinction 
included “instrumental” experience, 
accompanying performance was ex- 
pected to be more influenced by Pre- 
shift reinforcer magnitude experienced 
under similar circumstances (Run 
groups) than in the Placed groups. 


EXPERIMENT I 
Method 


Subjects—The Ss were 128 naive female 
albino rats of the Sprague-Dawley strain 
(purchased from the Sprague-Dawley Co., 
Madison, Wisconsin), approximately 65 days 
old at the start of the deprivation schedule. 

Apparatus—The runway and response 
measures have been described by Vogel. et al. 
(1966, Exp. I). In addition, Total speed— 
the reciprocal of the sum of Start, Running, 
and Goal times—was analyzed. 

Maintenance and prehandling—For seven 
days prior to the first day of runway train- 
ing, S was prehandled daily, which con- 
sisted of allowing S to explore a wooden 
drawer located on a table top for 3 min. 
during which she was picked up and re- 
placed five times by E. The feeding sched- 
ule, initiated on the first day of prehandling, 
consisted of 1 hr. free feeding starting 20-30 
min. after the last daily trial. 

Training—Runway training was divided 
into three stages: Preshift (Trials 1-20), 
two trials daily per S; Postshift (Trials 


| 


RESIDUAL EFFECTS OF REINFORCER MAGNITUDE 


21-44), two trials per day; and Extinction 
(Trials 45-76), eight trials per day. The 
intertrial interval was approximately 15 min. 
On each conventional runway trial, S was 
placed in the start box facing away from 
the door which was opened 3 sec. later re- 
gardless of S’s orientation. 

Design—The 128 Ss were randomly as- 
signed to 8 groups of 16 Ss each, comprising 
a 2X22 factorial design in which type 
of Preshift experience, Preshift reward mag- 
nitude (1 or 16 regular 45-mg. Noyes pel- 
lets) and Postshift reward magnitude (1 or 
16 pellets) were varied. During Preshift, 
half of the Ss were given their daily trials 
in the conventional manner (Run groups)— 
by running from the start box to the goal 
box. The other half (Placed groups) were 
merely placed in the goal box facing the 
nearby food cup on their Preshift trials. 
All Postshift and Extinction trials were run 
in the conventional manner. 


Results and Discussion 


General effect of response measures. 
—Four response measures were ana- 
lyzed in each of the experiments re- 
ported in this paper. These included 
Start, Running; and Goal speeds as de- 
fined in each of the Method sections, 
as well as Total speed, the reciprocal 
of the sum of each of the component 
times. However, only the results of 
analyses on Total speed are reported 
for two reasons: First, none of the 
conclusions reached were dependent 
upon which speed measure was con- 
sidered; and second, space considera- 
tions accordingly dictated that only the 
response measure with greatest gen- 
erality be reported. 

Generally the magnitude, but not the 
direction, of the effects reported in this 
paper and elsewhere (for example, 
Spear & Spitzner, 1967; Vogel et al., 
1966) consistently differed among re- 
sponse measures in two major respects. 
First, the influence of any decrement 
in reinforcer magnitude typically is 
greatest and earliest nearest the goal 
box. That is, a reduction or removal 
of reinforcer magnitude is accompanied 
by a relatively quick drop in Goal 
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speeds, followed after a few trials by 
a decrease in Start speeds; and the 
effect of this reduction in reinforcer 
‘magnitude is typically greater in Goal 
speed than in Start speed. Second, an 
increase in reinforcer magnitude is ac- 
companied by a reverse spatial-temp- 
oral relationship in terms of its effect 
on the component speeds. This kind 
of reinforcer shift has its greatest and 
first influence on Start speeds followed 
by Running speeds followed by Goal 
speeds, Although these relationships 
among component speeds are interest- 
ing and important enough in their own 
right, they will not be discussed further 
in this paper. Nearly all analyses pre- 
sented here are in terms of Total speed 
only. 

Initial effect of preshift reinforcer 
magnitude, in placed vs. run Ss.—lt 
was important to establish that differen- 
tial reinforcer magnitude during Pre- 
shift differentially affected some behav- 
ior in the Placed Ss. This was first 
evaluated by comparing speeds on the 
first instrumental trial, that is, speed on 
the Placed Ss’ first Postshift trial 
(prior to their experience with the 
Postshift reinforcer magnitude) and 
the Run Ss’ first Preshift trial (see 
Fig. 1). If Ss in the Placed conditions 
were influenced by Preshift reinforcer 
magnitude, an interaction would be ex- 
pected such that speeds should be di- 
rectly related to Preshift reinforcer 
magnitude in the Placed conditions but 
not in the Run conditions. Accord- 
ingly three-way analyses of variance 
(including the dummy variable, Post- 
shift reinforcer magnitude) revealed 
that this interaction between Preshift 
reinforcer magnitude and Placed vs. 
Run occurred as predicted. The ap- 
propriate F for Total speed was 6.38. 
With 1 and 120 df, an F of 3.92 is re- 
quired to assert statistical significance 
at the .05 level. The overall greater 
speed by Placed Ss (which was con- 
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reinforcer magnitude influenced instrumental 
performance by Placed Ss in at least one 
respect, the magnitude-of-reinforcer function 
for these Ss is shown on their first instru- 
mental tríal (Postshift Trial 1) in contrast 
to the same function for the Run Ss on both 
the latter's first instrumental trial (Preshift 
Trial 1) and on their first Postshift trial. 


founded for this particular response 
measure by differential handling and 
time on deprivation) also was signif- 
icant on this trial, F (1, 120) = 75.77. 

The influence of Preshift reinforcer 
magnitude in the Placed conditions may 
also be demonstrated by analyses on the 
first Postshift trial for all Ss. Speeds 
on this trial reflect only the Preshift 
reinforcer. This analysis revealed that 
speed was greater the greater the Pre- 
shift reinforcer magnitude, F (1, 120) 
=7.78 for Total speed. Moreover, 
there was no interaction between this 
variable and Placed vs. Run, F (1, 
120) = 281, p > .05, while the latter 
variable had considerable main effect ; 
eg. greater Total speed for Run Ss 
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is reflected by F (1, 120) — 161.96. 
Thus to the extent that the null hypoth- 
esis may be accepted, the influence of 
Preshift reinforcer magnitude did not 
differ (in terms of speeds for that re- 
inforcer magnitude) whether Ss re- 
ceived only consummatory experience 
in the goal box or instrumental-plus- 
consummatory experience. 

On the basis of these two analyses, 
we conclude that reinforcer magnitude 
did differentially affect some behavior 
of Ss in the Placed conditions. 

Postshift performance.—Although 
the curves of Postshift performance 
shown in Fig. 2 give the appearance of 
neat, symmetrical, and substantial con- 
trast effects, the statistical differences 
obtained are best characterized by the 
term "marginal" This should not be 
surprising in view of the many failures, 
published and unpublished, to obtain 
statistically reliable successive contrast 
effects with rats, and the relatively 
weak effects obtained even when they 
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meet criteria of statistical significance 
(eg. Collier & Marx, 1959; Spear & 
Spitzner, 1966). Two analyses of 
these effects will be reported. First, 
Table 1 shows the results of individual 
t tests on each day using the pooled 
within-Ss error term to evaluate sepa- 
rately the depression and elation effects 
for the Run groups alone. We have 
concluded that these data reflect the 
reliable occurrence of a depression 
effect but not of an elation effect and 
believe that we are in less danger of 
a Type I error than we would be of a 
Type II error with the opposite con- 
clusion. First, the statistical signif- 
icance of the elation effect was marginal 
at best and occurred essentially only in 
the running-speed portion of the run- 
way. In view of the numerous other 
failures to demonstrate an elation effect 
under comparable circumstances (see, 
eg. Spence, 1956), these differences 
certainly do not warrant rejection of 
the null hypothesis. Second, it may be 
seen that the base-line control Group 
16-16-Run was running slower than 
Group 16-1-Run at the end of Pre- 
shift when both groups were receiving 
the same reinforcer magnitude. More- 


over, Ss in Group 16-16-Run were 
about as slow during Preshift as Ss 
which were receiving only one pellet as 
the Preshift reinforcer. If the mean 
speed of the former condition has been 
underestimated because of sampling 
error, the tendency toward an elation 
effect would be overestimated. 
Analyses of variance were applied to 
the Postshift data, and the following 
are representative of the results. First, 
2x2x2 analyses of variance were 
completed for each speed on each day 
of Postshift, including all three inde- 
pendent variables. If the contrast ef- 
fects (dependency of the effect of Post- 
shift reinforcer magnitude on Preshift 
reinforcer magnitude) occurred dif- 
ferentially for Placed vs. Run Ss, it 
would have been reflected by a reliable 
three-way interaction. However, this 
interaction did not attain statistical 
significance on any day for any re- 
sponse measure. As another test, the 
elation and depression effects were ex- 
amined separately in terms of mean 
speed during Days 24 of Postshift. 
That is, taking the mean speed for each 
S on Days 2-4 as the basic response 
measure, 2 X 2 analyses of variance 


TABLE 1 


EVALUATION Or THE DEPRESSION EFFECT (GROUP 1—1 MINUS Group 16—1) AND ELATION 
Errect (Group 1—16 minus GROUP 16—16) ror THE Run Groups (Exp. 1) 
IN TERMS or MEAN DAILY SPEEDS IN POSTSHIFT 


Response Measure 


Possahity Start Speed Running Speed Goal Speed Total Speed 
Fation | Depression | Elation | Depression | Elation | Depression | Elation _ | Depression 
2 09 2.45* 1.86 1.71 
3 1.26 1.63 247% 2.00* 
4 28 2.57* 1.43 2.00* 
5 68 1.14 1.00 1.57 
6 85 2.20* .86 1.14 
7 1.19 24 1.28 A3 
8 43 1.24 A3 1.43 


Note,—The scores shown are values on the £ distribution with 60 df., computed with a pooled within-groups 


error term. 
*p <.05, two-tailed test. 
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and 45-mg. Noyes pellets totaling 11 gm. 
and was given 25 min. after the last daily 
trial; (b) Each S received five trials daily 
with an intertrial interval of 4-5 min. 

Design.—The 60 Ss (30 in each replica- 
tion) were randomly assigned to six groups 
comprising a 2 X 3 factorial design in which 
runway and reinforcer magnitude during 
Acquisition (1, 3, or 9 45-mg. Noyes pellets) 
were varied. Acquisition lasted for 90 trials 
(18 days), and Extinction for 40 trials 
(Trials 91-130, Days 19-26). During Re- 
acquisition (Days 27-32, Trials 131-160) all 
Ss received 3 pellets per trial as the rein- 
forcer. 


Results and Discussion 


Mean Total speed throughout Ac- 
quisition, Extinction, and Reacquisition 
is shown in Fig. 3. Analyses of vari- 
ance were completed with type of run- 
way, replications, and magnitude of re- 
inforcer during Acquisition as orthog- 
onal variables. Although type of run- 
way consistently yielded a significant 
main effect on the variance (greater 


TOTAL SPEED (FT/SEC) 


Fie. 3. 
Exp. II. 


Mean Total speed per day in 
(Speeds are averaged across dif- 
ferential construction of runway, so each 
point represents data from 20 Ss.) 
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speed in Runway 0), replications did 
not; and neither source contributed to 
any important interactions. Therefore, 
further attention will be given only to 
the effect of reinforcer magnitude. 

Mean Total speed was significantly 
affected by reinforcer magnitude on the 
last day of Acquisition, F (2, 48) — 
6.53, p < .01. It may be seen in Fig. 3 
that the majority of variance is contrib- 
uted by the slower speeds for Ss re- 
ceiving 1 pellet relative to Ss receiving 
Sor 9. 

Throughout the first five days of Ex- 
tinction, resistance to extinction was 
significantly influenced by prior mag- 
nitude of the reinforcer—for example, 
on the first day of Extinction, F (2, 
48) = 6.68, p < .01. It may be seen 
that the greater part of the variance is 
contributed by the lesser resistance to 
extinction of Ss which had 9 pellets 
relative to the other two groups. Dur- 
ing the last three days of Extinction, 
Acquisition reinforcer magnitude no 
longer influenced speed. The Fs indi- 
cating the influence of reinforcer mag- 
nitude during the last three days of Ex- 
tinction were 1.18, 1.44, and 1.05, re- 
spectively. With 2 and 48 df, an F 
of 3.19 is required for significance at 
the .05 level. 

Although the influence of the original 
reinforcer magnitude did not immedi- 
ately reappear on the first day of Re- 
acquisition, the effect did appear on 
the second day of Reacquisition and 
was maintained throughout the remain- 
ing five days. For each of Days 2-6, 
the Fs reflecting the influence of the 
magnitude of reinforcer experienced 
during Acquisition were 4.25, 4.27, 
6.98, 5.15, and 5.82, respectively. With 
2 and 48 df, these Fs are all significant 
at better than the .025 level. 

These results demonstrate retention 
of the influence of Acquisition rein- 
forcer magnitude throughout a series 
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of extinction trials. This contrasts 
with the complete absence of retention 
of the Preshift reinforcer's influence 
in Exp. L At this point, it appeared 
likely that the major difference between 
these two experiments was the kind of 
experience through which the influence 
of the original reinforcer was to be re- 
tained. In Exp. I the interpolated ex- 
perience with another reinforcer re- 
sulted in no effective retention; but in 
Exp. II interpolated experience with 
extinction did not erase retention of 
the initial reinforcer magnitude, In 
other words, it appeared that if S has 
some interpolated experience with a 
*new" reinforcer magnitude, the effect 
of the original reinforcer magnitude 
will be less likely to show up at some 
later time if the interpolated reinforcer 
magnitude is different from zero. 

On this basis, a theory was con- 
structed, the details of which are not 
really relevant for this paper. Briefly, 
the then-presumed greater effect of 
initial reinforcer magnitude following 
interpolated extinction was interpreted 
to imply lesser unlearning or extinction 
of the representational response of the 
original reinforcer than when some re- 
inforcer magnitude different from zero 
had been interpolated. The degree of 
unlearning was supposed to be a con- 
sequence of the extent of “recovery” 
and subsequent re-unlearning of S’s 
representational response for the orig- 
inal reinforcer during the intervals be- 
tween presentation of the interpolated 
reinforcer. This “recovery” of S’s 
representational response for the orig- 
inal reinforcer was presumed to be 
greater following a trial with the same 
kind of reinforcer than following an 
extinction trial. 

This led us to examine more directly 
another residual effect of initial rein- 
forcer magnitude—Effect 2 above, the 
recovery of initial reinforcer magnitude 
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(producing forgetting of the second re- 
inforcer) over an interval following 
presentation of the second. 


ExpertmMent IIT 


This experiment was designed to 
determine whether the recovery of an 
initially experienced reinforcer magni- 
tude (and resultant competition with 
a subsequent reinforcer) might be dem- 
onstrated in a reinforcer shift paradigm 
if massed trials were given with a 
minimum number of 24-hr. intervals 
between daily sessions. Previous stud- 
ies of the effects of reinforcer shifts 
on rat behavior necessarily had em- 
ployed widely distributed trials com- 
posed of several daily sessions of one 
or more trials each; and daily sessions 
are, of course, separated by 24-hr. 
intervals. Because recovery of various 
sorts may take place during all stages 
of training when such widely distrib- 
uted trials are employed, it has been 
suggested that specific effects of the 
recovery of proactively competing asso- 
ciations are difficult to demonstrate if 
they occur at all (e.g., Kehoe, 1963). 
Moreover, Underwood and Ekstrand 
(1966, 1967) have convincingly dem- 
onstrated that conventional interference 
effects, possibly resulting from recover- 
ing associations, do not occur in hu- 
man verbal learning when the compet- 
ing associations are acquired under 
widely distributed practice. 

Therefore, Exp. III employed four 
groups of rats with conditions com- 
parable to those of the Run groups of 
Exp. I: In terms of their Preshift- 
Postshift reinforcer magnitude, the 
various groups received 1-1, 1-15, 
15-15, or 15-1 20-mg. pellets. But in 
contrast to the distributed trials of Exp. 
I, Preshift consisted of eight massed 
trials (15-sec. intertrial interval), Post- 
shift consisted of six massed trials, and 
Extinction included eight massed trials. 
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To minimize differential food satiation, 
a 24-hr. interval was interpolated be- 
tween Preshift Trials 6 and 7. An- 
other 24-hr. interval was interpolated 
between the end of Postshift and the 
beginning of Extinction to permit 
assessment of the recovery of the effects 
of Preshift reinforcer magnitude dur- 
ing this interval following adjustment 
to Postshift reinforcer magnitude. To 
test the possibility that recovery of the 
effects of reinforcer magnitude might 
take place over intervals less than 
24 hr., half the Ss in each reinforcer 
magnitude condition had a 1-hr. inter- 
val interpolated between Postshift 
Trials 4 and 5, while the remaining Ss 
continued on massed practice through- 
out Postshift. 

Several predictions were possible : 
(a) It was expected that no successive 
contrast effects would occur because of 
the small number of Preshift trials; 
and, following the shift in reinforcer 
magnitude, Ss should rapidly adjust 
their running speeds to the level de- 
fined as appropriate by the base-line 
control (b) If recovery of the effects 
of Preshift reinforcer magnitude occur- 
red over short intervals, it was ex- 
pected that Preshift reinforcer magni- 
tude would contribute a significant 
Source of variance on Postshift Trial 
5 for Ss with the 1-hr. interval between 
Trials 4 and 5 but not for Ss continued 
on massed practice during Postshift, 
(c) Similarly, to the extent that the 
effects of Preshift reinforcer magnitude 
recovered over the 24.hr. interval be- 
tween Postshift and Extinction, it was 
expected that Preshift reinforcer mag- 
nitude would contribute a significant 
Source of variance to running speeds at 
least on the first trial of Extinction. 
(d) Finally, it was expected that resist- 
ance to extinction would be directly re- 
lated to reinforcer magnitude for Ss 
in Groups 1-1 and 15-15 because of the 
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relatively small number of acquisition 
trials, 


Method 


Subjects and apparatus.—The Ss were 96 
naive rats with the identical specifications 
of those in Exp, I and IL The runway was 
Runway 0 of Exp. II, Previously described 
by Spear and Spitzner (1967, Exp. IV), 

Maintenance, prehandling, and training — 
Experiment III differed from Exp. I and II 
in the following respects : (a) The Ss were 
trained in six replications, each containing 
two Ss from each of the eight groups (see 
below). (b) The daily food ration con- 
sisted of 10.2 gm. of ground chow and 20-mg. 
pellets, including reward pellets consumed in 
the runway. (c) Within each of three con- 
secutive days, trials were massed with a 
negligible intertrial interval, On Day 1, each 
S received six trials. On Day 2, each S 
received eight trials: for half the Ss there 
was a l-hr. interval between the sixth and 
seventh trial of Day 2; while for the remain- 
ing Ss the start of the seventh trial occurred 
immediately after the sixth (as on all other 
trials). On Day 3, all Ss received eight 
extinction trials, with only an empty food 
cup present in the goal box, 

Design—The Ss were randomly assigned 
to eight groups comprising a 2X2X2 fac- 
torial design in which Preshift reward mag- 
nitude—reward on Day 1 and the first two 
trials of Day 2 (one 20-mg. regular Noyes 
pellet or 15 such pellets to which 5 drops 
of water were added); Postshift reward 
magnitude—reward on Trials 3-8 of Day 2 
(1 pellet or 15 pellets plus 5 drops of 
Water); and  intertrial interval between 
Trials 6 and 7 of Day 2 (1 hr. or immediate 
replacement in the start box) were varied. 
Letting the first number indicate Preshift 
reward, the second number Postshift reward, 
M immediate replacement in the start box 
after Trial 6 on Day 2, and S a 1-hr. interval 
between Trials 6 and 7 of Day 2, the eight 
groups were: 1-1-M, 1-1-S, 15-1-M, 15-1-S, 
1-15-M, 1-15-S, 15-15-M, and 15-15-S. 


Results 


The results generally confirmed the 
predictions, including the recovery of 
the effect of Preshift reward over the 
24-hr. interval between Postshift and 
Extinction. However, the 1-hr. inter- 
val interpolated between Trials 4 and 
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5 of Postshift for half the groups did 
not permit similar recovery: this inter- 
val did increase running speeds uni- 
formly, but the extent of increase did 
not depend upon prior or present re- 
inforcer nor upon their interaction. 

'The effect of Preshift and Postshift 
reinforcer magnitude in terms of mean 
Total speed for each trial is shown in 
Fig. 4. An analysis of variance con- 
tributed by Preshift reinforcer magni- 
tude, Postshift reinforcer magnitude, 
and interval between Postshift Trials 4 
and 5 was performed on these speeds 
for each day of training. 

On the last trial of the first day 
(Trial 6), the larger Preshift rein- 
forcer magnitude was already accom- 
panied by greater Total speed, F (1, 
88) = 10.90, p < .01. On the trial in 
which S was to experience a shift in 
reinforcer magnitude upon entry into 
the goal box (Trial 9) the effect of 
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Fic. 4. Mean Total speed per trial in 
Exp. III. (Speeds are averaged across the 
differential in the interval between Postshift 
Trials 4 and 5, so each point represents 
data from 24 Ss.) 
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Preshift reinforcer magnitude was 
sizable and significant in terms of Total 
speed, F (1, 88) = 37.38, and the Fs 
for all other (dummy) sources of vari- 
ance ranged from .07 to 1.58. 

The effect of Postshift reinforcer 
magnitude altered speeds after only a 
single experience with the new rein- 
forcer (see Fig. 4). In fact, on Trial 
10 the effect of Postshift reinforcer 
magnitude on Total speed attained sta- 
tistical significance at the .01 level, 
F (1, 88) = 9.95. On this trial, how- 
ever, Preshift reinforcer magnitude still 
had a significant effect on Total speeds, 
F (1, 88) =9.74, p « .01, primarily 
attributable to the slower change 
in behavior farthest from the goal. 
By the second trial following the 
reinforcer shift, only the effect of Post- 
shift reinforcer magnitude was signif- 
icant—all Fs reflecting the effect of 
Preshift reinforcer magnitude were less 
than 1. 

On Trial 13, immediately after the 
variation in intertrial interval, the 
effects in terms of reinforcer magnitude 
remained identical to those on the pre- 
vious trial although Ss ran faster after 
the longer interval between Trials 12 
and 13, F (1, 88) = 13.91, p « .01. 
The longer interval of 1 hr. did not 
permit differential recovery of the 
effects of Preshift reward: No inter- 
action between reinforcer magnitude 
and intertrial interval approached sta- 
tistical significance and the main effect 
of Preshift reinforcer magnitude yielded 
an F value less than 1. 

Recovery of the effect of Preshift 
reinforcer magnitude did occur after 
the 24-hr. interval between Postshift 
and the first trial of Extinction (see 
Fig. 4): On this trial significantly 
greater mean Total speeds were asso- 
ciated with large reinforcer magnitudes 
than with small reinforcer magnitudes, 
F (1,88) = 15.30, p < .01. The influ- 
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ence of intertrial interval was still pres- 
ent after the 24-hr, interval, as Ss with 
the longer interval continued to show 
faster Total speed, F (1, 88) — 5.85, 
b «.05. This was surprising since 
earlier work had indicated that similar 
effects of intertrial interval were rela- 
tively transient (Spear & Spitzner, 
1967, Exp. II). No other Sources of 
variance approached statistical signifi- 
cance on this trial, After the first Ex- 
tinction trial, neither the effect of 
Preshift reinforcer magnitude nor of 
intertrial interval contributed signifi- 
cant variance to speed; in fact, the Fs 
Íor these effects rarely exceeded 1. 
However, Ss with the larger Postshift 
reinforcer magnitude contributed to 
run faster, a signifiant (p < .05) ef- 
fect continuing from the second 
through the seventh trial of Extinction, 


Discussion 


Experiments I-III demonstrated cer- 
tain residual effects of initial reinforcer 
magnitude remaining throughout instru- 
mental trials for a different reinforcer 
magnitude. First, differential behavior 
attributable to initial reinforcer magni- 
tude reappeared following extinction 
trials but not after trials with a nonzero 
reinforcer, Assuming that the differences 
between these two phenomena are attrib- 
utable to the differential retroactive inter- 
ference effects of extinction trials vs. 
trials with some nonzero reinforcer mag- 
nitude, a theory was suggested; but it 
required assumptions concerning recovery 
of S's representational response for the 
first reinforcer magnitude during an 
interval following the second. This led 
us to Exp. III which demonstrated that 
although such recovery does occur over 
a 24-hr. interval, shorter intervals (1 
hr.) apparently are not sufficient to per- 
mit such recovery. Obviously, the source 
of the difference required a different 
theory or emphasis on different param- 
eters. 

At this point, it was desirable to in- 
vestigate the apparently contrasting re- 
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tention phenomena of Exp. I and II 
within a single experiment. An experi- 
ment similar in details to Exp. II was 
then conducted in which rats in a straight 
Tunaway received either 10 or 1 pellets 
during the first Stage (60 trials), 
orthogonally combined with second-stage 
(40 trials) reinforcer magnitude of either 
1 or 0 pellets (10 naive Ss in each of 
the four groups). Finally, a third stage 
of 30 trials with 10 pellets was given all 
Ss. It was expected that original rein- 
forcer magnitude would affect third-stage 
behavior for Ss given interpolated trials 
with zero reward, but Ss given inter- 
polated training with 1 pellet would not 
differ in third-stage performance as a 
function of initial reinforcer magnitude, 
Initial reinforcer magnitude significantly 
affected behavior during the first stage 
but not at the conclusion of the second. 
However, the predicted interactions did 
not occur: The effect of original rein- 
forcer magnitude on third-stage behavior 
did mot differ between Ss given inter- 
polated experience with 0 pellets and 
those given interpolated experience with 
1. In fact, regardless of the reinforcer 
magnitude during the second stage of 
training, third-stage performance (speed) 
Was greater for Ss given 10 than for 
those given 1 pellet during the first stage. 

How did this fit in with the results 
of Exp. I and III vs. those of Exp. II 
and others (cf. Vogel et al, 1966 vs. 
Zeaman, 1949) ? Apparently, the nature 
of the reinforcer magnitude during inter- 
polated experience is not so critical a 
determinant of third-stage performance as 
is the similarity between the magnitude of 
the reinforcer employed during the first 
and third stages. Essentially all experi- 
ments finding no retention of original 
reinforcer magnitude measured this effect 
in terms of performance during extinction 
(Zeaman, 1949; Metzger et al, 1957; 
Exp. I and III of the present report). 
However, experiments showing retention 
of original reinforcer magnitude evaluated 
it in terms of third-stage performance 
for a magnitude also given one of the 
groups during initial training (Vogel 
et al., 1966; Ison & Rosen, 1965; Logan, 
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1960; Exp. II of the present paper; 
Wagner & Thomas, 1966; and the experi- 
ment described in the above paragraph). 
This suggests that third-stage perform- 
ance may be facilitated if the reinforcer 
magnitude employed has previously been 
experienced by S, relative to the per- 
formance of an S which has not pre- 
viously experienced that reinforcer mag- 
nitude. To assess this possibility, Exp. 
IV (described below) was completed with 
the basic rationale of that described in 
the above paragraph, but in which each 
S received a different reinforcer magni- 
tude in each of the first, second, and third 
stages of runway training. Thus Exp. 
IV included Ss given either 16 or 1 pel- 
lets during the first stage of training 
orthogonally combined with either 0 or 
2 pellets during the second stage. Then 
all Ss were given third-stage training 
with 4 pellets. If behavioral differences 
occur in this stage because of the ad- 
vantage of prior experience with the spe- 
cific magnitudes, then in this experiment 
there should be no effects of initial rein- 
forcer magnitude on third-stage perform- 
ance under any conditions. 


EXPERIMENT IV 
Method 


Subjects—The Ss were 32 rats with the 
same specifications as those employed in 
Exp. I, II, and III. 

Apparatus.—The Ss were trained in a 
straight runway consisting of a start box, 
14 in. long and 5 in. wide; an alley, 68 in. 
long and 4 in. wide; and a goal box, 14 in. 
long and 5 in. wide. The start and goal 
boxes were separated from the alley by clear 
Plexiglas doors used to prevent retracing. 
All portions of the 18-in. deep runway 
Were covered by Plexiglas. The wooden 
floor and walls were painted flat black. 
Reward pellets were placed in a shallow 
black dish at the rear of the goal box. A 
2i-in. high "shield" attached to the front 
of the pellet container prevented S from 
seeing the contents of the dish until it had 
reached the rear of the goal box. 

Response measures included: (a) Start 
time—from the start-box door to 6 in. into 
the alley, (b) Running time—from Start 
time to 13 in, before the goal box, a total 


of 49 in., (c) Goal time—from Running time 
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to a point 22 in. farther, 9 in. inside the 
goal box and 2 in. before the pellet container, 
Total time was derived from the sum of 
Start, Running, and Goal times on each 
trial. All times were converted to speeds 
in feet per second. 

Procedure.—Maintenance, prehandling, and 
runway training were essentially identical to 
that of Exp. I and II. 

Design.—Training was divided into three 
stages: Preshift (Trials 1-60; two trials 
were given on Day 1, three on Day 2, and 
five on each of Days 3-13), Postshift (Trials 
61-105, Days 14-22) and Reacquisition 
(Trials 106-135, Days 23-28). The 32 Ss 
were randomly assigned to four groups of 
eight Ss each comprising a 2 X2 factorial 
design in which Preshift reinforcer magni- 
tude (16 or 1 20-mg. Noyes pellets) and 
Postshift reinforcer magnitude (2 or 0 pel- 
lets) were varied. During Reacquisition, all 
groups received 4 pellets per trial as the re- 
inforcer, 


Results and. Discussion 


The results in terms of Total speed 
are shown in Fig. 5. Analyses of vari- 
ance indicated that each of the Pre- 
shift and Postshift reinforcer magni- 
tudes yielded effects typically obtained 
with this relatively large number of 
Preshift trials: (a) The Ss given 16 
pellets had greater speeds during Pre- 
shift than those given 1 pellet (for 
Total speed on the last day of Preshift, 
F (1, 28) = 38.17); (b) After the 
third day of Postshift (Day 16), Post- 
shift performance was inversely related 
to Preshift reinforcer magnitude until 
Day 22, at which point this variable no 


TOTAL SPEED (FT/SEC) 


.Fic. 5. Mean Total speed per day in 
Exp. IV. 
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longer had a significant effect on 
speeds. For Days 17—21, the source of 
Preshift reinforcer magnitude yielded 
Fs ranging from 4.89 to 12.35, all sta- 
tistically significant at better than the 
-05 level with 1 and 28 df. On the final 
day of Postshift (Day 22), Preshift 
reinforcer magnitude had no statist- 
ically significant effect on behavior, 
F (1, 28) = 242. It also might be 
noted that at no time did the effect 
of Preshift reinforcer magnitude inter- 
act significantly with the Postshift re- 
inforcer magnitude; that is, the influ- 
ence of Preshift reinforcer magnitude 
did not differ for Ss shifted to 2 pel- 
lets compared to those shifted to 0 
(extinction). The only exception to 
to this occurred on the first day of 
Postshift (Day 14) after which the Fs 
for the interaction between Preshift and 
Postshift reinforcer magnitudes ranged 
from .01 to 1.85 in terms of mean daily 
"Total speed. 

Because previous data had indicated 
that initial reinforcer magnitude would 
be retained through experiences with 
another reinforcer magnitude but would 
affect subsequent performance only if 
that initial reinforcer magnitude were 
also employed as the Reacquisition re- 
ward, and since the 4-pellet Reacquisi- 
tion reinforcer in the present experi- 
ment was equally similar to a 1- and 
16-pellet reward (at least it was mid- 
way on the log scale between 1 and 
16), it was expected that Preshift re- 
inforcer magnitude would not affect 
Reacquisition behavior regardless of 
prior conditions. To the extent that 
the null hypothesis can be accepted, 
this prediction was upheld.  Preshift 
reinforcer magnitude contributed no 
significant variance component in any 
speeds on any Reacquisition day. For 
example, in terms of Total speed, the 
Fs for the main effect of Preshift re- 
inforcer magnitude ranged between .24 


and 1,77 during each of the six days 
of Reacquisition. 

We believe that the null hypothesis 
may be accepted with a little less than 
the usual caution in this case. First, 
other data in this experiment are or- 
derly and reliable in terms of agreement 
with previous studies. Second, the null 
hypothesis was predictable from the 
previous set of experiments reported 
here. Third, this particular effect (i.e., 
lack of effect) has been replicated by a 
student, William Hoyer, in our labora- 
tory. This unpublished experiment in- 
cluded a different runway, different re- 
inforcer magnitudes (though similar 
relative amounts) and also was inter- 
nally consistent in other respects. We 
conclude that initial reinforcer magni- 
tude will affect behavior following an 
equalizing treatment to the extent that 
the magnitude employed in the third 
stage is identical (or perhaps similiar) 
to that of the first. 


GENERAL DISCUSSION 


This set of experiments has demon- 
strated that some representational response 
of reinforcer magnitude may be retained 
throughout experience with, and adjust- 
ment to, a different reinforcer magnitude. 
The resultant residual effects of the initial 
reinforcer may be reflected in two ways: 
(a) by recovery of the first representa- 
tional response over an interval of rela- 
tive inactivity following acquisition of the 
second, and (b) by reacquisition perform- 
ance which is dependent upon the similar- 
ity between the reinforcer magnitude 
employed during reacquisition and that 
initially experienced. These generaliza- 
tions may be extended to zero reinforcer 
magnitude. If the initial magnitude is 
nothing (that is, no food pellets) and 
food pellets are subsequently given in the 
same situation, the initial representational 
response for 0 pellets may spontaneously 
recover over a 24-hr. interval between the 
rewarded sessions (Spear et al, 1965; 
Hill, Erlebacher, & Spear, 1965). Fur- 
thermore, if initial nonrewarded trials are 
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given and followed by trials with reward, 
subsequent speed for zero reward (that 
is, extinction performance) is greater 
than if initial nonrewarded trials were 
not given, a fact which Spear and 
Spitzner (1967) have concluded is best 
explained by the similarity between initial 
and subsequent reinforcer magnitudes. 

Similar facts have been organized by 
Capaldi into a workable theory (eg. 
Capaldi, 1967). They may be sum- 
marized at one level by stating a some- 
times recognized, but not widely used, 
generalization: Properties of specific mag- 
nitudes of reward, including zero reward, 
may result in relatively permanent changes 
in the rat’s behavior. It does not seem 
unreasonable to suppose that S’s repre- 
sentational response (say, 7) of the rein- 
forcer may function in retention and for- 
getting as any other “memory” (cf. Spear, 
1967). Thus S learns about the magni- 
tude of the reinforcer, an acquisition 
which may subsequently affect his ad- 
justment to a second reinforcer magni- 
tude, retention of the second reinforcer 
magnitude and reacquisition for the first 
reinforcer magnitude. 
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3 predictions derived from the Spence postulate of habit-drive multi- 
plication were tested by orthogonal variation of paired UCS intensity 
(H) and average UCS intensity (D) over trials in a 2 X 2 design 
using 25 Ss per group. It was predicted that paired and average inten- 
sities would interact over all 100 conditioning trials and at asymptote, 
resulting from a larger H effect at high D. There was no evidence 
for either of these hypothesized interactions. In addition, it was 
assumed that the D effect would increase over trials. While this latter 
interaction was reliable, its form was not in accord with an hypothesis 


of H X D. 


A recent review of eyelid condition- 
ing data by Prokasy (1967) indicated 
that there was little evidence favoring 
Spence's (1956) E = H X D postulate 
for classical aversive conditioning. The 
majority of studies examined by Pro- 
kasy manipulated D through manifest 
anxiety; however, Spence has also 
assumed that D is dependent on the 
average UCS intensity over trials 
(Spence, Haggard, & Ross, 1958), 
which can be varied independently of 
H by presenting the UCS alone. 

Given a constant number of condi- 
tioning trials, the H x D interaction 
can be tested by taking D as a function 
of the average UCS intensities on 
paired (CS-UCS) and unpaired 
(UCS-alone) trials and H as a func- 
tion of the UCS intensity on CS-UCS 
trials. 

The present study was designed to 
test the relationship between H and 
D by using two widely separate levels 
of H, defined by paired UCS intensity, 
and two levels of D, defined by average 


1 This investigation is based upon the first 
author’s MS thesis under the direction of the 
second author, The research was supported 
by the National Institute of Mental Health 
Grant 12582-01 to the second author. 


UCS intensity over all trials. Given 
the present design and the assumption 
that H and D multiply, three predic- 
tions concerning the interaction of H 
and D were made. First, the inter- 
action between paired (H) and average 
(D) UCS intensity, collapsed across 
trials, should yield a larger difference 
between Groups HH (high H, high D) 
and LH (low H, high D) than between 
Groups HL (high H, low D) and LL 
(low H, low D); in other words, the 
difference between high and low H 
groups should be larger at high D 
than at low D. Secondly, this inter- 
action should be most evident over the 
terminal training trials where H is also 
at a maximum. Finally, the interaction 
between average intensity (D) and 
trials, the other source of H, should 
show a larger performance increment 
over trials for combined Groups HH 
and LH than the combined HL and 
LL groups. 


METHOD 


Subjects—The Ss were 108 female intro- 
ductory psychology students. Five Ss were 
discarded as voluntary responders, using the 
latency criterion of Spence and Ross (1959). 
Two additional Ss were dropped for E error 
and one S for failure to meet the criterion 
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of 20% CR's over the last 20 conditioning 
trials. The remaining 100 Ss were parti- 
tioned equally among the four cells of the 
2X2 design. 

Apparatus—All Ss were seated in a 
dental chair in a 5X7-ft. sound-insulated 
room, with a 55 db., 660-cps tone as a mask- 
ing noise. The room was illuminated by a 
7.5-w. amber light located behind and above 
S. Movements of the right eyelid activated 
a microtorque potentiometer via a nylon 
thread and plastic eyepiece taped to S's eye- 
lid. The output of the potentiometer, acting 
as a voltage divider, was recorded on a Grass 
Model 5 polygraph. A buzzer was used as 
a ready signal on each conditioning trial. 
A small neon bulb, located 3 ft. from S 
behind a 2-in. diameter ground-glass disc, 
served as fixation point. Four additional 
bulbs, also behind this disc, served as a CS 
of 550-msec. duration. The air puff was de- 
livered to the right eye during the last 
50 msec. of the CS. The sequence of ex- 
perimental events was controlled by decade 
interval timers and a tape programmer. 

Design—The UCS intensities given each 
of the four groups on paired and unpaired 
trials are summarized in Table 1. In com- 
paring the column means for paired and 
average intensity, the only variation is in 
paired intensity and thus H. Since average 
air puff intensity has been equated, there 
should be no differences in D (Spence et 
al, 1958). Similarly, the row means of 
Table 1 indicate that H was held constant 
by equating paired puff intensities but that 
D was manipulated by differences in average 
puff strength. 


The difference in paired intensities was of 
a magnitude which has consistently generated 
reliable performance differences (Spence & 
Platt, 1966) without ceiling effects. The 
lower value of average intensity was pri- 
marily based on Group HL, such that the 
unpaired puff was perceived on all trials. 
The upper bound on average UCS intensity 
was chiefly determined by the LH condi- 
tion, such that the unpaired value for Group 
LH did not reach a painful level. 

Procedure.—After being instructed to re- 
lax, to fixate on the disc, and not attempt 
to control the responses of her eyes, S was 
given three CS-alone trials and one UCS- 
alone trial. The Ss were discarded for two 
or more responses to the CS alone. All Ss 
were given 200 trials: 100 conditioning (CS- 
UCS) trials and 100 UCS-alone trials. The 
order of paired and unpaired trials was re- 
stricted such that each occurred twice in a 
block of four trials. Trials were separated 
by an interval of 10, 13, or 16 sec., defined 
as the interval between successive UCS pre- 
sentations. On a conditioning trial, the 
ready signal preceded the CS by 2, 3, 
or 4 sec. Trial interval and ready signal 
lead were randomly varied over the three 
values given for each. 

The data were scored using the latency 
criterion for a CR (Spence & Ross, 1959). 
Any eyelid deflection of 1 mm. or more dur- 
ing the interval 200-500 msec. after CS 
onset defined a CR. Responses in the inter- 
val 200-300 msec. after CS onset defined 
voluntary responses, but were included as 
CR’s unless S made more than 50% such 


TABLE 1 
EXPERIMENTAL DESIGN 


Paired UCS Intensity (H) 
Average UCS 
Intensity (D) m m ili 
Group HH Group LH Groups HH & LH 
High 2.5 paired 1.0 pairec AE paired y 
3.5 unpaired 5.0 unpaired .25 unpaire 
3.0 average 3.0 average 3.00 average 
Group HL Group LL Groups HL & LL 
Low 2.5 paired 1.0 paired 1.75 paired 
x 0.5 unpaired 2.0 unpaired 1.25 unpaired 
1.5 average 1.5 average 1.50 average 
Groups HH & HL Groups LH & LL 
Mean 2.50 paired 1.00 paired d 
2.00 unpaired 3.50 unpaire 
2.25 average 2.25 average 


Note.— The first letter of 


the group designation refers to level of habit, the second to level of D. N = 25 
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responses, in which case the S was dis- 
carded as a voluntary responder. 


RESULTS AND Discussion 


Three indexes of conditioning were 
obtained: percentage CR’s (Rp) in- 
cluding all conditioned and voluntary 
responses, Rp omitting voluntary re- 
sponses, and excitatory potential (E), 
using the procedure suggested by 
Spence (1956, p. 104). Separate anal- 
yses for each of the three measures 
yielded virtually identical results, con- 
sequently, only the combined-R, and E 
measures are presented here. 

Acquisition curves for the four 
groups appear in Fig. 1A. It can be 
seen that R, increased systematically 
across blocks of 10 CS-UCS trials for 
all groups. Although Group HH ap- 
proached 90% CR after 70 condition- 
ing trials, only four Ss of this group 
responded at 100% CR over the final 
30 trials ; thus ceiling effects, important 
to the analysis of H x D, were at a 
minimum, 

The main effects of paired UCS in- 
tensity (H), F (1, 96) = 94.85, aver- 
age UCS intensity (D), F (1, 96) = 
39.19, and blocks of 20 conditioning 
trials, F (4, 384) = 73.76, were all 
highly reliable, p < .001, using the R, 
measure. The corresponding F ratios 
for E were 88.58, 38.08, and 86.26, 
all yielding p < .001. The interaction 
between paired and average UCS in- 
tensities did not approach significance 
for either R,, F (1, 96) — 1.01, or 
E, F (1, 96) = 1.02, both p > .20. In 
fact, as is apparent from Fig. 1A, the 
H difference was slightly larger at low 


The interaction between H and D 
over the final 20 conditioning trials is 
presented in Fig. 1B. Given the H x 
D assumption, these curves should have 
diverged, reflecting a larger H effect 
at high D. A test of the Paired x 
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PERCENT CR 


C 


BLOCKS OF 10 TRIALS 


Fic. 1. Results for three predictions 
tested: (A) Paired X Average UCS Inten- 
sity interaction taken over trials, (B) 
Paired X Average UCS Intensity interaction 
over trials 81-100, and (C) Average UCS 
Intensity X Trials interaction. (In section C, 
the scale for the solid lines is given along the 
left-hand axis; for the broken line, the scale 
appears along the right-hand axis.) 


Average UCS intensity interaction over 
the final 20 trials showed no evidence 
for the predicted divergence, F (1, 96) 
= .22 and .01 for R, and E, respec- 
tively, both p >.20. Again, the H 
effect was slightly greater at low D. 
The relationship between average 
puff intensity and trials appears in 
Fig. 1C. Given H x D, performance 
curves should have diverged over trials. 
Although the interaction between trials 
and average puff intensity was reliable, 
F (4, 384) = 2.76 and 2.95 for R, 
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and E, respectively, both p < .05, its 
form was not that specified by the H x 
D postulate. The broken line cor- 
responding to the difference between 
high and low D indicated that the 
initial divergence of the groups was fol- 
lowed by partial convergence over the 
final 50 trials, a tendency which Pro- 
kasy (1967) noted in several of the 
studies he reviewed, Following Pro- 
kasy, a rank-order correlation (rho) 
between trial blocks and the HH-HL 
performance difference yielded a rho 
of .37 for E and .18 for R,, both 
p» 20. Similarly, the rho between 
trials and the LH-LL difference was 
.02 for E and .25 for Ry, both p > .20. 

It is clear from the present results, 
as well as from previous data evaluated 
by Prokasy, that there is little evidence 
to support an E — H x D postulate in 
classical defense conditioning. There 
are at least two avenues available for 
revising the H x D postulate of Hull- 
Spence theory in line with extant data. 
It is possible to alter the postulated 
relationship between the constructs of 
the theory (H, D, etc.) and inde- 
pendent variables (UCS intensity, 
trials), or it may be desirable to alter 
the postulated relationship between the 
theoretical constructs themselves (eg., 
E = H + D rather than E = H X D). 

The H x D postulate could account 
for the present data if it is assumed that 
D decreases over trials, possibly 
through adaptation, and that this de- 
crease is greater at higher levels of D. 
Assuming a linear relationship between 
amplitude of the UCR and D, the 
course of this decrement should be re- 
flected in a decrease in UCR amplitude 
over trials and a greater decrease in the 
high D groups (Taylor, 1956). How- 
ever, a sampling of the first and last 
10 trials showed no evidence of 
greater UCR adaptation by the high D 
Ss. Of course, it remains to be seen 


153 


whether UCR amplitude is the appro- 
priate index of D, or whether some con- 
comitant measure of D, such as GSR 
activity, may be needed to adequately 
specify drive. 

At this point, it is appropriate to 
note that D may not be determined by 
a simple average of paired and unpaired 
puff intensities. Possibly, paired in- 
tensities exert greater influence on per- 
formance than do unpaired puffs, and 
as a result our averaging technique 
may not have equated D between 
groups. The design of this experiment, 
with different unpaired puffs for each 
group, does not permit an answer to 
this question. 

Turning now to the second method 
of modifying the theory, the decreasing 
difference between high and low drive 
groups late in training, noted in Pro- 
kasy’s review and apparent in Fig. 1C, 
suggests that some form of a stimulus 
sampling theory of D (Estes, 1958) 
may be applicable to the data of class- 
ical defense conditioning. Estes’ satia- 
tion and deprivation cues may possibly 
be made equivalent to adaptation and 
emotionality cues arising from the UCS 
in classical defense conditioning. 

Finally, it is clear from the present 
results and those discussed by Prokasy, 
that the theory as postulated by Spence 
is inadequate to handle the available 
data. Modifications to the theory 
should rest on further research pointed 
toward a more precise specification of 
the relationship between the constructs 
of the theory and relevant independent 
variables. 


REFERENCES 


Estes, W. K. Stimulus-response theory of 
drive. In M. R. Jones (Ed.), Nebraska 
symposium on motivation. Lincoln: Uni- 
versity of Nebraska Press, 1958. 

Proxasy, W. F. Do D and H multiply to 
determine performance in human condition- 
ing? Psychol. Bull. 1967, 67, 368-377. 


154 JAMES J. HUG AND JOHN J. PORTER 


Spence, K. W. Behavior theory and con- Spence, K. W., & Pratt, J. R. UCS inten- 


ditioning. New Haven: Yale University sity and performance in eyelid condition- 
Press, 1956. ing. Psychol. Bull., 1966, 65, 1-10. 

Spence, K. W. A theory of emotionally Spence, K. W., & Ross, L. E. A methodo- 
based drive (D) and its relation to per- logical study of the form and latency of 
formance in simple learning situations. the eyelid response in conditioning. J. 
Amer. Psychologist, 1958, 13, 131-141. exp. Psychol., 1959, 58, 376-381. 


TAYLOR, J. A. Level of conditioning and 

CL E UCS intensity and tbe associative UNS of the adaptating stimulus. J. 
(habit) strength of the eyelid CR. J. exp. — ^^! Psychol, 1956, 51, 127-131. 
Psychol., 1958, 55, 404-411. (Received May 26, 1967) 


(Continued from page 134) 


Continuing and Reversing the Direction of Responding Movements—Some Exceptions to 
the So-called “Psychological Refractory Period": John Brebner*: Department of 
Psychology, University of Dundee, Dundee, United Kingdom. 

Activation, Manifest Anxiety, and Verbal Learning: Robert E. Thayer* and Shelia J. 
e Department of Psychology, California State College at Long Beach, California 

804. 

Can the Superior Learnability of Meaningful and Pleasant Words be Transferred to Non- 
sense Syllables?: Albert Silverstein* and Richard A. Dienstbier: Department of 
Psychology, University of Rhode Island, Kingston, Rhode Island 02881. 

Recognition and Free Recall of Organized Lists: Walter Kintsch*: Department of 
Psychology, Stanford University, Stanford, California. 

An Investigation of Grammatical Class as an Encoding Class in Short-Term Memory: 
Delos D. Wickens*, Sandra E. Clark, Frances A. Hill and Roy P. Wittlinger: 
Department of Psychology, Ohio State University, 1945 North High Street, Columbus, 
Ohio 43210. 

Osgood Dimensions as an Encoding Class in Short-Term Memory: Delos D. Wickens* 
and Sandra E. Clark: Department of Psychology, Ohio State University, 1945 High 
Street, Columbus, Ohio 43210. 

Inference about a Nonstationary Process: James O. Chinnis, Jr. and Cameron R. Peter- 
son*: Engineering Psychology Laboratory, University of Michigan, P.O. Box 618, 
Ann Arbor, Michigan 48107, 

A Procedure for Reducing Orienting Reactions in GSR Conditioning: Kenneth R. 
Burstein and Seymour Epstein*: Department of Psychology, University of Massa- 
chusetts, Amherst, Massachusetts 01003. 

Reward Schedule Effects Following Severely Limited Acquisition Training: E. J. Capaldi* 
A. T. Lanier, and R. C. Goodbout: Department of Psychology, Mezes Hall 211, Uni- 
versity of Texas, Austin, Texas 78712. 

Repetition Effects in Immediate Memory When There Are No Repeated Elements in the 
Stimuli: Robert Crowder*: Department of Psychology, Yale University, 333 Cedar 
Street, New Haven, Connecticut 06510. 

Readaptation and Decay after Exposure to Optical Tilt: Sheldon M. Ebenholtz: Depart- 
aoe ee Psychology, University of Wisconsin, Charter at Johnson, Madison, Wiscon- 
sin . 

Intuitive Inference about Normally Distributed Populations: Wesley M. DuCharme* 
and Cameron R. Peterson: Engineering Psychology Laboratory, Institute of Science 
and Technology, University of Michigan, P.O. Box 618, Ann Arbor, Michigan 48107. 

Task Difficulty and the Frustration Effect: Ronald R. Schmeck and James L. Brunning*: 
Department of Psychology, Ohio University, Athens, Ohio 45701. 


* Asterisk indicates author for whom address is given. 


Journal of Experimental Psychology 
1968, Vol. 7). No. 1, 155-158 


PROCESSING OF VISUAL FEEDBACK 
IN RAPID MOVEMENTS * 


STEVEN W. KEELE Ax» MICHAEL I. POSNER 


University of Oregon 


The present study determined the minimum amount of time necessary 
to process visual feedback from a movement. The Ss rapidly moved a 


stylus from a home position to a 


target. On half the trials all lights 


turned off at the start of the movement so that they were made in the 
dark. Visual feedback did not facilitate accuracy in hitting the target 
when the movement was as short as 190 msec. For durations of 260 
msec. or longer, having the lights on facilitated accuracy, suggesting 
that it takes 190-260 msec. to process the feedback. 


Skilled motor performance typically 
involves a series of accurate move- 
ments, If a movement is made quite 
slowly, visual feedback may be used in 
guiding it to an accurate termination. 
However, visual feedback may not in- 
fluence the accuracy of very rapid 
movements. Numerous studies (Wood- 
worth & Schlosberg, 1954) have shown 
that visual choice reaction time is of 
the order of } sec. or longer. There- 
fore, it would be expected to take } sec. 
or longer to process visual error in- 
formation. 

Woodworth (1899) and Vince (1949) 
attempted to determine the processing 
time for visual feedback. They showed 
that accuracy was no better with the 
eyes open than with the eyes closed 
when movement rates were between 
100 and 180 strokes per minute. They 
both used repetitive movements which 
include time spent in reversing direc- 
tion. Thus, the stroke rate at which 
accuracy is no better with visual feed- 
back than without overestimates the 
processing time. Although both Wood- 
worth and Vince had additional experi- 
ments with interpolated time intervals 
between successive movements, they 
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were not designed to estimate visual 
processing time. 

The present study compared discrete 
movements with and without vision. 
The Ss rested a stylus on a home posi- 
tion. With the onset of a light, they 
rapidly moved the stylus from the home 
position to a small target. On half 
the trials all lights turned off as the 
home position was left and turned on 
again when the movement was com- 
pleted. On those trials, therefore, the 
movement was performed in the dark. 
Visual processing time is estimated by 
the shortest movement time at which 
hitting the target is facilitated by 
having the lights on. There are two 
advantages of the present procedure. 
(a) The estimate of processing time is 
not biased by the time to reverse move- 
ments, because movement time is deter- 
mined from the time the home position 
is left until the stylus hits either the 
target or a surrounding metal plate. 
(b) The S does not know in advance 
whether or not the light will turn off, 
insuring that his premovement strategy 
is the same under light on and light off 
conditions. 


METHOD 


Subjects —Eight undergraduate and grad- 
uate male students from the University o 
Oregon were paid $3.00 for two 1-hr. ses 
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sions. The Ss had not been in any previous 
movement experiments. 

Apparatus—A  i-in. diameter contact 
(home position) was mounted at the center 
of 16-in. X 16-in. metal plate, and the plate 
was mounted in the horizontal plane on a 
table. There were two i-in. diameter target 
contacts 6-in. from the home position. One 
target was mounted on each side of the home 
position on the 45°-225° axis. A Yy-in. 
rim of translucent plastic surrounded each 
target, and when the plastic was illuminated 
from below, a circle of light surrounded the 
target. 

Movement times were measured in hun- 
dredths of a second from the time a stylus 
left the home position until the stylus touched 
either the target or the metal plate. If the 
plate was touched before the target, a red 
light came on informing S of a miss. 

In all cases the target light turned off 
immediately upon leaving the home position. 
In addition there was a white feedback light 
positioned 8-in. above the back edge of the 
metal plate that illuminated the entire plate 
(the target lights and feedback light were 
GE No. 1819 24V. lamps with approximately 
a 10-msec. decay time to 10% of the initial 
illumination). The feedback light could be 
programmed to turn off immediately on leav- 
ing the home position. If the feedback light 
turned off when the home position was left, 
the entire room remained dark until either 
the target or the metal plate was touched 
with the stylus. 

A Western Union tape transmitter pro- 
grammed the order of target lights and 
whether the feedback light stayed on or 
turned off. A 4-sec. interval occurred be- 
tween successive targets. Eight different 
sequences of 24 targets were randomly con- 
structed with the restriction that each of the 
combinations of the two targets with the 
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feedback light on or off occurred an equal 
number of times in each sequence. 

Procedure.—On the first day Ss were given 
two practice sequences on which movement 
speeds were self-determined. This was fol- 
lowed by three sequences at each of four 
pacing conditions. The first of the three 
sequences in each condition was practice. 
The Ss attempted to produce movements of 
duration .15, .25, .35, or .45 sec. Pacing was 
facilitated by reporting the movement time 
after each movement. The order of the 
conditions was counterbalanced over the eight 
Ss by two 4X4 Latin squares. On the 
second day each S received the same con- 
ditions but in a reverse order from the first 
day. Again, the first of the three sequences 
in each condition was for practice. 

The Ss were instructed to make their 
movements as accurately as possible but at 
the same time try to keep the movement 
duration within plus or minus 10% of the 
intended time. In addition they were asked 
to try to keep durations constant whether 
the feedback light stayed on or turned off, 
because the primary interest was in com- 
paring movement accuracies. 


RESULTS 


The mean time per movement and 
the proportion of missed targets are 
shown in Table 1. In addition, stan- 
dard deviations of movement times 
were determined for each S, and the 
means of those scores are shown in the 
table. The standard deviations are 
relatively small, indicating that Ss were 
quite consistent in movement durations. 

Mean moyement times at the fastest 
pace were slower than the indicated 


TABLE 1 


PROPORTION OF TARGET Misses, MEAN-MOVEMENT TIME IN SECONDS, AND THE 
SD or MovEMENT TIMES 


Intended Movement Time in Seconds 


Light 15 225 35 ESI 
On Off On of On Off On Off 
Misses .68 .69 AT .58 28 ep AS AT 
Time .190 185 267 254 357 -338 Add 424 
SD .028 .020 .039 .036 .058 .050 .061 .057 
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time, but at other paces they were rela- 
tively close to the intended time. The 
times were slightly longer when the 
feedback light remained on than when 
it was turned off. An analysis of vari- 
ance showed the difference in move- 
ment times at different paces to be 
highly reliable, F (3, 21) — 312.0, 
p < 001, and the difference between 
feedback conditions to be significant, 
F (1, 7) 2 10.1, p < .025. Although 
the interaction between the variables 
was not significant, F (3, 21) = 1.19, 
individual ¢ tests (7 df) showed a re- 
liable difference in feedback conditions 
at the .25- and .35-sec. movement dura- 
tions (p < .05) but not at the other 
two paces. 

At the fastest movement pace there 
was essentially no difference in propor- 
tion of target misses with and without 
visual feedback. As movement time 
increased, accuracy increased under 
both feedback conditions. The increase 
was greater with visual feedback so that 
at movement times of .25, .35, and .45 
sec. the proportion of misses was much 
less than when the feedback light was 
turned off. An amalysis of variance 
showed the effects of movement time, 
F (3, 21) = 537, feedback, F (1, 7) 
= 552, and the interaction, F (3, 
21) = 10.2, to be significant at the 001 
level of confidence. Normal approxi- 
mations of the binomial distribution 
showed significant differences in feed- 
back conditions at the .25-, .35-, and 
A5-movement times (p < .001 in each 
case). 


Discussion 


The Ss in the present experiment took 
approximately 190 msec/movement when 
instructed to move for a duration of 150 
msec. At that speed there was essen- 
tially no difference in accuracy between 
light-on and light-off conditions. With 
the 250-msec. instruction, movement times 
averaged about 260 msec. At that rate 
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there was significantly greater accuracy 
in hitting the target when visual feedback 
was present. Thus, the minimum dura- 
tion for processing visual feedback from 
a movement appears to be between 190 
and 260 msec. This is about half the 
minimum duration suggested by Wood- 
worth (1899) and Vince (1949). The 
difference is probably due to their use of 
repetitive movements which include the 
time for reversal The present study 
eliminated the bias by using discrete 
movements with a 4-sec. pause between 
successive responses. 

In a recent experiment by Pew (1966), 
Ss attempted to maintain a target in the 
center of an oscilloscope by sequentially 
pressing two keys, one of which caused 
target acceleration to the right and the 
other to the left. If the oscilloscope dis- 
play was blanked out for periods up to 
410 msec. after a response, the modal 
time before the next corrective response 
was 300-350 msec. after the end of blank- 
ing. Since some of the corrective re- 
sponses required less than the modal time, 
Pew's data are consistent with the present 
data that indicate a minimum of 190-260 
msec. for processing visual error in- 
formation. 

A second major result of the present 
study is that accuracy increased as move- 
ment time increased even when the feed- 
back light was off, though not as much 
as when it remained on. One hypothesis 
for the speed-accuracy trade-off in the ab- 
sence of visual feedback is that at slower 
speeds movement corrections are made on 
the basis of kinesthetic spatial informa- 
tion. Chernikoff and Taylor (1952) and 
Gibbs (1965) found evidence for kines- 
thetic reaction time as short as .11—.13 sec. 
This is on the order of 50 msec. shorter 
than simple visual reaction time (Wood- 
worth & Schlosberg, 1954). If correc- 
tions are made on the basis of kinesthetic 
feedback, movements of too short a dura- 
tion for visual control could still be under 
kinesthetic control. 

A second hypothesis for the speed-ac- 
curacy trade-off in the light-off condition 
is that the actual motor commands issued 
to the muscles are compared to the in- 
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tended motor commands and adjustments 
are made on the basis of discrepancies. 
Alternatively, the motor command for a 
slow movement may be more detailed and 
accurate than for a fast movement. Ac- 
cording to these hypotheses, rapid move- 
ments in the absence of vision are not 
under kinesthetic contro], Festinger and 
Canon (1965) and Howard and Temple- 
ton (1966) present considerable evidence 
that motor commands play a large role in 
movement control. 'The present data do 
not discriminate between the hypotheses. 
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NONREINFORCED RESPONDING AS A FUNCTION OF THE DIRECTION 


OF A PRIOR ORDERED INCENTIVE SHIFT: 
IL A REPLICATION WITH FIXED-INTERVAL REINFORCEMENT SCHEDULE 


MELVIN H. MARX 


University of Missouri, Columbia 


14 adult albino rats were given 5 wk. of either regularly increasing or 
regularly decreasing concentrations of sucrose on a fixed-interval 
schedule of reinforcement, followed each day by nonreinforced bar- 
press tests. A reliable interaction between reinforcement-nonrein- 
forcement and concentration order confirmed previous results with 
continuous reinforcement. Moreover, by the 4th week an increasing 
number of bar presses occurred over reinforced trials for the group 
with increasing concentrations, and a decreasing number occurred 
for the group with decreasing concentrations, thus indicating the 


importance of the order variable, 


These results are interpreted as 


offering further support for a motivational account of extinction. 


A recent experimental report (Marx, Tom- 
baugh, Cole, & Dougherty, 1963) provided 
evidence that motivational level, as manip- 
ulated by varying percentages of sucrose re- 
inforcement, can be a more potent deter- 
minant of persistance of nonreinforced per- 
formance than terminal level of reinforced 
performance. This result is of special theo- 
retical interest because of the fact that the 
terminal level of reinforced responding has 
been so often shown to be a most powerful 
contributor to level of nonreinforced perform- 
ance in extinction tests (Pubols, 1960). 
The previous data were obtained under con- 
ditions of continuous reinforcement (30-sec. 
reinforced and nonreinforced bar-press ses- 
sions with 120-sec. intertrial intervals) which 
maximized differences in the terminal level of 
reinforced responding in a direction opposite 
to the observed level of nonreinforced per- 
formance. 

The purpose of the present experiment was 
to test this same proposition with a fixed- 
interval schedule of reinforcement. A major 
advantage of using this schedule is that it 
Permits control of number of reinforcements. 

Method—The Ss were 14 adult male 
Sprague-Dawley albino rats, naive with re- 
spect to sucrose reinforcers and bar pressing. 
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the National Institutes of Health, and by a Public 
Health Service research career program award, 5-K6- 
ME-22,023, from the National Institute of Mental 
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Eight operant-conditioning boxes, pre- 
viously described in detail (Collier & Marx, 
1959), were used. 

The 14 rats were randomly assigned to two 
groups of seven Ss each. Group I, in- 
creasing order, received successive reinforce- 
ments of 4, 8, 16, 32, and 6496 sucrose. 
Group D, decreasing order, received succes- 
sive reinforcements of 64, 32, 16, 8, and 4% 
sucrose. It was predicted that, as in the 
previous experiment with CRF schedule, 
Group I would respond at a lower reinforced 
but a higher nonreinforced rate. 

Five weeks of experimental training, six 
days per week, were administered to all Ss. 
Each day's training consisted of a 10-min. 
bar-press session on FI 1, Sucrose rein- 
forcements were provided for the first five 
reinforcements, followed by five empty-cup 
presentations. One-minute bar-press counts 
were recorded. The Ss were fed for 1 hr. 
after each experimental session, and at the 
same time of day on the seventh (rest) day 
of each week. 

Results and discussion.—The relationship 
earlier obtained under continuous reinforce- 
ment was confirmed, with Group I showing 
less reinforced but more nonreinforced re- 
sponding by the third week. The inter- 
action between reinforced-nonreinforced 
trials and the treatment variable (I-D) was 
statistically reliable: F (1, 48) = 4.54, p< 
.05. 

Weekly means for the 10 1-min. counts 
are shown in the four panels of Fig. 1. 
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(The curves for Week 5 are not shown 
since they are very similar to those for 
Week 4.) Here the development of the 
interaction is indicated. The two groups 
apparently were differentiated with respect 
to nonreinforced bar-press responses on the 
second week and with respect to reinforced 
responses on the third week. The inter- 
action tested for the entire experimental 
period was most closely approximated in the 
third week. 

The development during Week 4 of an 
increasing function for the reinforced trials 
of Group I and a decreasing function for the 
reinforced trials of Group D is also of inter- 
est. Group I Ss showed an increment in 
response rate, up to the 1696 sucrose rein- 
forcement; Minute X Order interaction, 
F (4, 48) —584, p « 01. This fact sug- 
gests a motivational effect of the order of 
incentive presentation. Correspondingly, the 
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even more regular decrement for the Group 
D Ss suggests a regular motivational decre- 
ment as a consequence of the decreasing 
order of incentive presentation. 

The general implication of these results 
is that they serve to support the motivational 
interpretation of extinction recently reviewed 
(Marx, 1966). 

The present results are of more specific 
interest in two major ways: (a) they extend 
the generality of the previous similar finding 
with continuous reinforcement, and (b) 
they offer a more direct measure of the 
motivational shifts that presumably occur 
during reinforced trials. 

The latter point follows from the fact 
that, with number of reinforcements con- 
trolled, rate of bar pressing under the FI 
Schedule may be considered a relatively ac- 
curate index of motivational value of the 
incentives offered. The factor of satiation, 
while not entirely eliminated, is at least 
greatly minimized, especially when compared 
with its action in continuous reinforcement, 
which was used previously. It is especially 
interesting that reinforced bar-pressing rate 
during the last two weeks of the experiment 
followed so closely the increasing or decreas- 
ing order of incentives (except for the 32 
and 64% incentives in Group I). This fact 
supports an interpretation of the differential 
nonreinforced response rates in terms of 
order, rather than merely terminal value 
(cf. Tombaugh & Marx, 1965), of incentives 
during reinforcement. 
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ORDINAL POSITION EFFECTS WITH A TWO-DIMENSIONAL 
STIMULUS ARRAY' 


ROBERT K. YOUNG anp RICHARD E. BUCK 
University of Texas, Austin 


2 groups of 32 Ss each learned a 16-pair list in which the stimuli 
consisted of 16 individually illuminated lights arranged in a 4X4 
square and the responses consisted of 16 adjectives each uniquely 
associated with a stimulus light. 1 group of Ss learned a list in which 
the stimuli consisted of 16 clear lights. A 2nd group of Ss learned 
a list in which the stimuli consisted of 15 clear lights and 1 red light. 
Ordinal position effects similar to those found in serial learning were 
found in those pairs with stimuli in the top and bottom rows while 
similar effects were not found in the learning of those pairs with 


stimuli in the middle 2 rows. 


Although the ordinal position effect 
(OPE) has long been known in serial 
learning as the bowed serial position curve, 
the OPE has also been found recently in a 
variety of learning situations not involving 
the serial presentation of items. Ebenholtz 
(1966) for example, found the OPE in 
paired-associate learning when the stimuli 
were numbers varying from 1 to 10. That 
is, pairs with stimulus numbers 1, 2, 9, 
and 10 were easier to learn than pairs with 
stimuli varying from 3 to 8. 

The present study was designed to investi- 
gate the generality of the OPE phenomenon 
in learning. Since the OPE is found when 
stimuli vary along a single dimension, the 
present study was designed to determine 
if similar effects are found when stimuli are 
varied along two dimensions. In addition, 
an attempt was made to determine if the 
von Restorff effect of isolation commonly ob- 
served in serial (and paired-associate) learn- 
ing could be produced in a two-dimensional 
learning situation. 

Method.—Each of the 64 Ss learned a 
single list of 16 pairs in which each stimulus 
consisted of a single illuminated light ina 
group of 16 arranged in a 4X 4 square and 
in which the responses consisted of two- 
syllable adjectives. Two levels of response 
similarity (High or Low) were used and 
within each level of similarity two samples 
of adjectives were used. The responses 
were randomly paired with the light stimuli 
and two arrangements of the light-adjective 
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pairs were employed. In addition, all lights 
for the Control group were clear as were 
those for the Experimental group with the 
exception of a single red light in the third 
row from the top and third column from 
the left, Finally, two Es were employed. 
Thus the design of the experiment can be 
thought of as a 2X2X2X2X2 factorial 
(N —2 per cell). 

During learning the order of pair presenta- 
tion was random. The stimulus light was 
presented (ie. lighted) for a 2-sec. interval 
and then both the response (printed on a 
white card) and the stimulus light were 
presented together also for a 2-sec. interval. 
A 2-sec. interpair interval was used, and 
an 8-sec. intertrial interval allowed E time 
to vary the order of presentation from trial 
to trial. A stopwatch was used for all 
timing. The anticipation method was em- 
ployed and the criterion of learning was 
one errorless recitation. 

The S was seated in front of a 20-in. X 
24-in. midgray screen upon which was 
mounted 16 panel lights arranged in a 4x4 
square 54 in. on a side. The square was 
horizontally centered with the top row 1 in, 
from the top of the screen. Eight inches 
from the bottom of the screen also hori- 
zontally centered was a 34 in, X14 in. 
rectangular slot in which the responses, typed 
on white cards (also 3} in. X 14 in.) were 
manually presented and retrieved by E from 
the back of the screen. The lights consisted 
of 6-v. jeweled panel lights and were clear 
except for the single red light in the Experi- 
mental group. 

The adjectives were taken from Under- 
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wood and Goad (1951). The items in the 
high similarity samples consisted of 16 
synonyms such as ELATED, GLEEFUL, JOLLY, 
NAPPY, LAUGHING, etc, while the samples 
of items with low similarity were as dis- 
similar as careful inspection would allow. 
All Ss were taken from introductory psy- 
chology classes and were naive to verbal 


of errors by E and each was replaced by the 
next S to appear in the laboratory. 
Results and discussion—In the analysis 
of trials to learn, only similarity was found 
an effective variable with F (1, 32) 
0S. The mean trials to learn 
Low similarity lists were 
, respectively. The difference 
Experimental an! Control groups 
approach significance with F < 1.00. 
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which accounts for the quadratic 

trend interaction between rows and columns. 

Consistent with other studies (cf. New- 
the i 


54, respectivel igh! 
significant with F (1, 95, b 
Lord (1, 32) = 57.95, P <,001 


vs. pair 
comparison The 
difference between the red-light pair. and 
its control was reflected in the trend analysis 
quadratic trend and the x quad- 
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Fro, 1. Mean correct per trial for the 16 palm 
of the Control group and the red light palr of the 
Experimental group as a function of stimulus 


P «05, respectively. The Group X Row 
quadratic trend did not approach significance - 
with F « 1.00. 

The control group trends were analyzed. 
independently of the Experimental 
and essentially the same results were ob- 
tained as in the overall analysis. The Row, 
Column, and Row X Column interaction | 
quadratic trends were all significant, with 
F (1, 16) 21421, 15.78, and 7.29, p « 05, 
respectively. 

Although this last analysis is not orthog- 
onal to the other analysis made in the 
Control group, the mean correct per trial 
for the four corner pairs (the pairs with 
Lights 1 and 4 in Rows 1 and 4 as stimuli) 
was compared to the mean correct per trial 
for the other 12 pairs. These means were - 
69 and .55, respectively, and the difference - 
is highly significant, F (1, 16) 3483, p< 
01, It is apparent that a large portion of 
the variance accounted for by the significant 
Row, Column, and Row X Column quadratic 
trends in previous analyses was related to 
the ease of learning of the four corner pairs. 
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OPE along both dimensions then the trends 
observed in the present experiment are some- 
what more complex than those which would 
be expected by Ebenholtz. It is, of course, 
not clear how S perceives the present 

ulus array. However, if the Ebenholtz 
proposition is to be extended beyond a single 
dimension, to be consistent with the present 
results it must be assumed that $ does not 
perceive the array as a two dimensional form. 
As an alternative consistent with the data, 
S might see the 4 corner lights as the outer 
portion of a single stimulus dimension and 
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the other 12 lights as the inner portion of 
the same dimension. 


EFFECT OF A NOVEL STIMULUS DIMENSION 
ON DISCRIMINATION LEARNING * 


CHARLES L. RICHMAN awp JOHN TRINDER 
University of Cincinnati 


shift training. 


Mediational theories differ in how they 
explain why school-aged and adults 
perform a reversal shift (RS) more 
than an extradimensional shift (EDS). 
Kendler and Kendler (1962) emphasized 
the verbal nature of the mediational process. 


PR but pes EDS; resul from pre- 
a t not an d * ^ 


experience with 
stimulus dimensions. It is therefore of inter- 
the effect 
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so as to be low in discriminability and dif- 
ficult to categorize. Furthermore, they 
possessed no characteristics that would pre- 
dispose either of the two responses (“soft” 
and "strong") to be associated with them 
in the learning task, "There was no initial 
response bias associated with responding 
on Trial I to any of the stimuli, X* (7) — 
10.55, p.10. The figures consisted of 
geometrical relationships such as acute and 
obtuse angles, circles, triangles, and filled in 
spaces randomly associated in each figure. 

To make the categorizing task difficult, 
the four stimuli associated with the particular 
response did not have obvious (to E) di- 
mensions in common. In order to maintain 
this spread of dimensions and yet use the 
same eight stimuli for each S, the subset of 
figures associated with a particular response 
were identical for each S$ in original learn- 
ing, RS, and HR-shift. 

The Ss' task was to learn an association 
between four of the stimuli and the word 
"soft" and the other four stimuli and 
the word "strong." A trial consisted of the 
presentation of the eight stimuli with S re- 
sponding to each figure with the words 
"strong" or "soft" On each trial the order 
of the stimuli was randomized. The dura- 
tion of each figure was 5.0 sec. with an 
interstimulus interval (ISI) of 600 msec. 
and an intertrial interval (ITI) of 20.0 sec. 
The following instructions were read to 
each S. 


"You are a S in a learning experi- 
ment. Your task is to learn the associa- 
tion between the figures flashed on the 
wall in front of you and the word 
‘strong’ or ‘soft.’ That is, when a figure 
is flashed on the wall, you will imme- 
diately say the word ‘strong’ or ‘soft,’ 
at which time I will say ‘right’ or 
‘wrong,’ indicating that you have made 
the correct or incorrect association. 
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Remember, as soon as a figure is flashed 
say the word ‘strong’ or ‘soft’ If 
there are no questions, we will begin.” 


The experiment consisted of a 2X4 fac- 
torial design. The two main treatments 
were defined by the type of shift involved, 
Half of the Ss were assigned to a RS group 
while the other half to HR-shift group. RS 
involved the reversal of all associations. 
That is, all figures originally associated 
with “soft” were reversed to “strong” and 
vice-versa. A HR-shift was one in which 
half of the stimuli associated with a par- 
ticular response were reversed to the other 
response while the other half retained the 
same association (two of the “strong” 
changed to “soft” and two of the “soft” to 
"strong"). 

Within each main treatment 12 Ss were 
assigned to each of four groups distinguished 
in terms of the amount of training S received 
before the shift took place. The four groups 
were as follows: the criterion group was 
defined as reaching a criterion of one entirely 
correct trial; overtraining two (OT2) was 
criterion plus two trials; and overtraining 
four (OT4) was criterion plus four trials. 
The undertraining group (UT) was defined 
as four preshift trials (four trials were 
approximately half the average number of 
trials Ss required to achieve a completely 
correct trial for the other groups. This was 
not determined until the other groups had 
been run). It should be noted that there 
was a problem in determining the number 
of preshift trials; the moment of the shift 
cannot be determined in terms of the indi- 
vidual S's performance as in the other groups, 
but rather as a group function. 

Results.—Table 1 presents the mean num- 
ber of errors and trials to criterion in both 
original and shift conditions for all groups. 
Separate analyses of variance were per- 
formed on both the total errors and trials 


TABLE 1 


MEAN NUMBER OF TRIALS AND ERRORS TO CRITERION IN ORIGINAL LEARNING AND 
NUMBER OF TRIALS AND ERRORS TO SHIFT 


Original Training 


Shift Training 


Groups RS HR-shift RS HR-shift 
Errors Trials Errors Trials Errors Trials Errors Trials 
UT 15.1 4.0 iz 4.0 25.4 9.2 16.7 T4 
C 21.5 8.4 19.1 8.5 13.1 6.8 15.4 6.2 
OT2 21.2 9.2 22.5 9.1 7.6 4.0 15.4 6.3 
OT4 21.1 10.0 23.3 8.3 4.8 2.6 19.3 8.3 
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to criterion measures. Since the reversal 
data based on errors and trials did not meet 
the requirement of homogeniety of variance, 
the data in both cases were treated with a 
square root transformation, (Myers, 1966). 
The effect of original learning on the re- 
versal phase was significant, for errors, 
F (3, 88) —1249, »<.001 and trials, 
F (3, 88) 21410, p<.001. A Duncan 
multiple-range test for the RS groups indi- 
cated that for errors OT2 was significantly 
different from criterion, p < .01 (6, 60) and 
UT, p<.01 (7, 60). The OT4 group also 
had significantly fewer errors than the cri- 
terion group, p<.01 (7, 60) and the UT 
group, ? «.01 (8, 60). The trials measure 
gave identical significance, and are not re- 
ported. These results demonstrate that a 
RS is substantially enhanced by overtraining 
in original learning. Analysis of variance 
also indicated a statistically significant dif- 
ference between RS and HR-shift with the 
latter having more errors, F (1, 88) = 4.68, 
~<.05, yet a nonsignificant difference with 
regard to trials, F (1, 88) —.08. A signif- 
icant interaction was found for both the 
errors analysis, F (3, 88) = 14.32, p < .001 
and trials, F (3, 88) — 10.61, p<.001. The 
interaction demonstrated that as a function 
of the amount of preshift training the RS 
and HR-shift groups responded differentially, 
with the former decreasing errors and trials 
and the latter remaining relatively stable. 
With undertraining the RS group had a 
slower rate of learning when compared with 
the HR-shift group. A Duncan multiple- 
range indicated that this difference was sta- 
tistically significant for both errors $ < 05 
(4, 60), and trials, p <.05 (3, 60). With 
criterion training this difference was not 
statistically significant for either errors, 
p>.10 (5, 60) or trials, p>.10 (3, 60). 
It was also found that the RS, OT2 group 
had less errors and trials to shift than the 
OT2, HR-shift group and with even greater 
differences found between the OT4 groups. 
The differences were significant for groups 
OT2 and OT4 for both errors, p<.01 (3, 
60), p< .01 (6, 60) and trials, ~<.05 
(3, 60), p<.01 (7, 60), respectively. 
Discussion —This study demonstrates 
faster RS with increasing levels of over- 
training using a novel stimulus dimension. 
Kendler et al. (1967) did not find facilita- 
tion of a reversal with overtraining. A 
likely variable accounting for the difference 
is the use of verbal labels (“strong” and 


"soft") in the present study as opposed to 
motor responses (Kendler et al., 1967). The 
data indicate that an important considera- 
tion is the availability of the verbal label 
when one is not available either in the lan- 
guage system, or artificially; S does not 
necessarily produce his own. It is also 
suggested that a verbal response may pro- 
vide a more effective mediator than a motor 
response, as used by Kendler et al. (1967). 
This does not mean that motor responses 
using human Ss cannot give an ORE, or 
that a verbal mediator could not be produced. 

Other factors differed between the two 
experiments and should be considered. 
Kendler et al. (1967) used trigrams whereas 
this study used nonsense figures. This would 
be an important factor to the extent that 
either one of the types of stimuli is more 
likely to spontaneously produce mediating 
responses. Secondly, Kendler et al. (1967) 
used either l-sec. or 4-sec. ISI with each 
stimulus acting as one trial. - The present 
study involved an ISI of 600 msec. and an 
ITI of 20 sec, a trial being each of the 
eight figures presented once. The effect of 
the discrete trials may have been to fa- 
cilitate the grouping of the stimuli and 
thus the development of a mediator. 

The fact that with increasing preshift 
training speed of reversal shows a progres- 
sive increase supports the view that the 
verbal labels provide an appropriate media- 
tor for the stimuli. This conclusion is fur- 
ther suggested by the faster HR-shift as 
compared to a RS, with undertraining. 
Kendler et al. (1967) found, with criterion 
training no difference between the two 
groups and concluded that Ss were respond- 
ing in terms of both mediating responses 
and single unit bonds. The present data 
agree with this conclusion with regard to 
criterion training, but with less preshift 
training (UT), and presumably less oppor- 
tunity to develop the mediator, Ss respond 
primarily at a unit-bond level as indicated 
by the faster HR-shift. 
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STIMULUS SELECTION AND RETROACTIVE INHIBITION + 


NINA G. SCHNEIDER ? ANp JOHN P. HOUSTON 
University of California, Los Angeles 


This study was designed to demonstrate that, in a 2-list paired-asso- 
ciate situation, stimulus selection during 2nd-list learning can influence 


losses in Ist-list recall. 


6 conditions differed in terms of the nature 


of the 2nd-list stimuli and in terms of the 2nd-list learning instruc- 


tions, 


pressure was increased to atten: 


It was predicted that retroactive inhibition would increase as 
d to 2nd-list stimulus components 


which were identical to those stimuli utilized in Ist-list learning. 


This study was designed to test the hy- 
pothesis that, in a two-list paired-associate 
(PA) situation, stimulus selection during 
second-list learning can influence losses in 
first-list recall, ie, amount of retroactive 
inhibition, For example, assume different 
responses are learned in two successive tri- 
gram-adjective lists. In addition, assume 
that the second-list trigrams are surrounded 
by colors. If S selects a component of the 
second-list stimulus which is identical to the 
stimulus utilized in the learning of the first 
list (viz, the trigrams) then the situation 
corresponds to the A-B, A-C paradigm. 
Considerable unlearning and competition are 
expected to occur in this situation. On the 
other hand, if S utilizes a component of the 
second-list stimulus which is different from 
that used in first-list learning (viz, the 
colors) then the situation corresponds to 
the A-B, C-D paradigm and much less RI 
is expected. Partial attention to both com- 
potee might result in intermediate amounts 


All Ss learned two paired-associate (PA) 
lists and were then tested for immediate 
recall of the first-list material. The first 
list in all six conditions was composed of 
trigrams as stimuli and adjectives as re- 
sponses. The groups differed in terms of 
the nature of the second-list stimuli and in 
terms of the kind of second-list learning in- 
structions they received. The second-list 
responses in all conditions were composed of 
a set of adjectives different from those 
learned in the first list, In Cond. 1 the 
second-list stimuli were colors and Ss re- 
ceived mo special second-list instructions. 

* This was 
te gs ab EE 
Supported, in part, by a Public Health Service 
Grant (MH 11109-02) to the second author. 

? Now at the University of California, Berkeley. 


This situation corresponds to an A-B, C-D 
paradigm and was expected to produce 
maximal first-list recall, In Cond. 2-5 the 
second-list stimuli were compound stimuli 
composed of the first-list trigrams sur- 
rounded by different colors. Condition 2 Ss 
were advised to attend to the colors during 
second-list learning as, following learning, 
they would be asked to recall the responses 
when the colors alone were presented. Little 
RI was also predicted in this condition as 
selection of the colors would render the 
situation similar to the A-B, C-D paradigm, 
Condition 3 Ss were given no special in- 
structions concerning second-list learning. 
Here the notion was that some unlearning 
of the first-list associations and some com- 
petition at the time of recall might occur 
in connection with incomplete selection of the 
colors. In Cond. 4 Ss were told that they 
would be tested with both the colors alone 
and the trigrams alone following learning. 
A greater loss was expected in this condi- 
tion as the instructions force at least partial 
attention to the trigrams. Condition 5 Ss 
were instructed to attend to the trigrams, as 
they would constitute the ensuing test cues. 
In this case considerable RI was expected as 
the use of the trigrams render it an A-B, 
A-C situation. Finally, the trigrams alone 
served as stimuli in Cond. 6. First-list recall 
was expected to be poorest in this A-B, 
A-C paradigm. Thus the overall prediction 
was of decreasing first-list recall as one 
moved from Cond. 1 to Cond. 6. 
Method.—Sixty undergraduates enrolled 
in an introductory psychology course served 
as Ss. The six conditions were randomized 
10 times such that each occurred once in 
each succeeding block of six conditions. 
Assignment to this arrangement was made 
on the basis of S's appearance in the testing 
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situation. One set of eight trigrams and 
two sets of eight adjectives were used in 
the construction of the lists. These were 
the A-B, A-C materials used and described 
by Barnes and Underwood (1959), The 
following eight colors were used: 

black, green, red, yellow, purple, orange, 
and blue, Strips of these colors were drawn 
above and below the appropriate syllables 
with felt marking pencils. The stimulus 
and response materials were paired such 
that inter- and intralist similarity was as 
minimal as possible. In addition, the first 
letters of paired trigrams, words and colors 
were not the same. Five presentation orders 
were developed to avoid serial learning and 
each was used as the starting order equally 
frequently. All learning was at a 2:2-sec. 
presentation rate with a 4-sec. in i 

a l-min. interlist interval on a Stowe drum. 
First-list learning was taken to a criterion 
of one perfect trial while second-list learning 
was carried for 14 anticipation trials, Dur- 
ing the recall test, which began 1 min, after 
the completion of second-list learning, the 
trigram stimuli were each presented for 
2 sec, Two-sec, rest intervals intervened 
between the presentations of the stimuli. 
The responses were not exposed. The 
stimuli were presented in the order S would 
have received them had he had one more 
first-list learning trial. 

Results and discussion.—An analysis of 
variance of the numbers of trials to criterion 
(M =9.88) revealed no significant differ- 
ences among the first-list means, F (5, 54) 
—109. The mean numbers of correct 
second-list responses were 97.10, 89.20, 89.60, 
81.90, 72.60, and 74.70, for Cond. 1-6, re- 
spectively, F (5, 54) =3.35, p € 05. The 
pattern of these means seems to be related 
to the expected selection effects and, in fact, 
closely parallels the recall data reported be- 
low. The mean numbers of first-list re- 
sponses recalled correctly were 6.00, 6.10, 
6.00, 3.60, 2.50, and 4.00 for Cond. 1-6, 
F (5, 54) =9,73, p < .01. Subsequent New- 
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man-Keuls comparisons indicated that Cond. 
, 2, and 3 did not differ and that Cond. 4, 
6 did not differ, On the other hand, 
and 3 all differed significantly from 
, and 6, p<.05. In general, these data 
iggest that stimulus selection during second- 
String can influence losses in first-list 


as predicted. However, while the overall 
analysis was significant, the individual com- 
parisons failed to reveal the predicted smooth 
increase in recall from Cond. 1-Cond. 6. 
Instead, the conditions appeared to bunch 
into two distinct groups. This may have 
been due to the fact that in Cond. 1, 2, 


insuring some, but apparently not differential, 
use of the trigrams. 
The relative contributions of unlearning 


pre: 
these factors, Although 


, 1989, 68, 97-105. 
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SELECTIVE REINFORCEMENT OF RESPONSE SPEEDS IN CHILDREN: 


ROBERT B. CAIRNS anp STEWART PROCTOR E 


Indiana University 


In this investigation of the immediate determinants of response latency 
in children, 48 children were differentially reinforced for speed in 
performing a button-press response. 


Studies of response differentiation indi- 
cate that the response latencies of rats can 
be influenced directly through discriminative 
reinforcement, Although the evidence is 
clearest for the slowing down of responses 
(compared with levels obtained on rate- 
independent reinforcement contingencies, 
€g., Anger, 1956; Logan, 1960), one report 
indicates that the selective reinforcement of 
fast responses increases their probability of 
occurrence (Church & Carnathan, 1963). 

The purpose of this study was to deter- 
mine whether response speed differentiation 
through selective reinforcement could be 
demonstrated in children. In view of the 
recent widespread use of response speed 
and rate of responding as motivational in- 
dexes for children (e.g, Hill & Stevenson, 
1964; Rosenbaum & Bruning, 1966), an 
examination of the immediate determinants 
of these response dimensions seems necessary, 

Method—Each S was given 160 trials 
(8 blocks of 20 trials) on an apparatus that 
required a button-press response. The Ss 
were assigned at random to one of four 
reinforcement conditions : selective reinforce- 
ment for relatively fast responses (FR); 
selective reinforcement for relatively slow 
responses (SR) ; yoked reinforcement (YR), 
with half of the Ss paired to FR Ss, and 
half to SR Ss; and no reinforcement (NR). 


The ay it vailabli 
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Six male Ss and six female Ss were assigned 
to each condition. 

Forty-eight children, 24 boys and 24 girls, 
enrolled in the upper level ungraded classes 
of the University Elementary School served 
as Ss. Their ages ranged from 96 mo. to 
153 mo. (M = 120.19 mo.; SD = 12.96 mo.). 
Four Es each tested an equal number of Ss 
in the four groups. 

The button-press apparatus was a gray 
metal box, with two buttons and correspond- 
ing indicator lamps located symmetrically on 
the 8 X 133-in. stainless steel panel facing S. 
The box was kept in a fixed position on a 
23 X 28-in. piece of Masonite placed on the 
table between E and S. Immediately in front 
of S's position, equidistant from the two but- 
tons, an inset plate with a child's handprint 
painted on it was embedded in the Masonite. 
A capacitance relay was adjusted so that 
when S's hand was removed i-in. above 
the handprint, a Standard Electric clock 
(S-1) was activated. The clock continued 
running until S depressed one of the buttons. 
The distance from the handprint to the two 
buttons was constant, fixed at 15 in. The 
clock and control panel were located in front 
of E, and out of S's view. 

The Ss were tested individually. Follow- 
ing a demonstration, Ss were given standard 
instructions in how to operate the apparatus. 
The instructions were summarized in the 
sentence, "Remember, you put your hand on 
the pattern, wait until the light comes on, 
press the button beside the light, then put 
your hand on the pattern until the light 
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comes on again.” The S then completed 20 
trials (Block 1). Children in the FR, SR, 
and YR conditions were then told by E, 
“This time I'm going to let you try to earn 
points. You will know that you have re- 
ceived a point whenever I say ‘correct.’ Try 
to earn as many points as you can.” Testing 
resumed, and continued in blocks of 20 trials, 
with approximately 40 sec. rest between each 
successive block. A one-way vision window 
permitted a second E to observe Ss’ behavior. 

For Ss in the FR conditions, a response 
in Block n was reinforced if the response 
speed was equal to or faster than the 75th 
percentile of Block n-1. An identical graded 
criterion was followed in the SR condition, 
except that to qualify for reinforcement the 
response speed must have been equal to or 
slower than the 25th percentile of the pre- 
ceding block. The Ss in the YR condition 
were individually paired with respect to sex, 
E, and exposure to reinforcement, to Ss in 
the FR and SR conditions. Yoked Ss were 
reinforced only on the trials that their FR 
or SR counterparts had been reinforced. 
No responses were reinforced by E in Block 
1, and Ss in the NR condition were never 
reinforced. 

Results and Discussion—The two YR 
subgroups did not differ, either in terms of 
number of trials reinforced or in terms of 
mean log response time, and their data were 
combined for the major analyses. A re- 


+0,1 


— REINFORCED: FAST RESPONSES 

^*^» REINFORCED: SLOW RESPONSES 
-0.5- ——— REINFORCED: YOKED 

—-— NO REINFORCEMENT 


MEAN RESPONSE LATENCY( LOG SECONDS) 


XR 


CEPSUNRONEE ET 
BLOCKS OF 20 TRIALS 


Fic. 1. Performance of the two experimental and 
two control groups over eight blocks of 20 trials 
(differential treatment only in the final seven blocks). 
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peated measures analysis of the data shown 
in Fig. 1 (Blocks 2-8) indicated that the 
experimental treatment was significant, both 
as a main effect F (3, 44) — 855, p<.001, 
and in the interaction with trial blocks, 
F (18, 264) —819, p<.001. While the 
groups did not differ in Block 1, the dif- 
ferences were highly significant in Block 8, 
F (3, 40) =24.37, p<.001. A Newman- 
Keuls analysis of Block 8 scores indicated 
that all of the two group comparisons, ex- 
cept the one between the two control groups 
(YR and NR), were statistically signif- 
icant at p<.01. 

In the paired comparisons between the 
experimental and yoked control Ss, both 
the FR and SR groups differed from their 
respective yoked control subgroups in the 
fina! block, t (5) =2.65, t (5) = 3.09, p< 
05. 

A separate 2 X 2 analysis of the Block 1 
vs. Block 8 change scores for the two con- 
trol groups (YR and NR) indicated that 
both tended to produce faster responses in the 
final block. The combined 1 vs. 8 difference 
only approached significance, however, AE 
22) = 4.20, p < .06. 

The between-E and sex-of-subject differ- 
ences were not significant, either as main 
effects or in interaction with the experi- 
mental conditions. 

Behavior observations indicated that Ss 
reinforced for the performance of slow re- 
sponses learned a variety of "irrelevant" re- 
sponses. For example, after accidentally 
missing the button and being reinforced upon 
recovery, three Ss persisted in hitting their 
fingers against the apparatus panel before 
pressing the appropriate button. In a simi- 
lar fashion, another S learned to perform 
a “loop-the-loop” with her hand before press- 
ing the button. Still other Ss learned to 
depress it “slow and hard.” In short, the 
children in the condition selectively rein- 
forced for slow responses developed distinc- 
tive mannerisms that served to increase re- 
sponse latencies and, thereby, the Jikelihood 
of reinforcement. 

The performance of those Ss reinforced 
randomly with respect to response latency 
points up the importance of the contingency 
between response speed and reinforcement 
delivery in children. In this study, the non- 
contingent delivery of a reinforcement event 
did not systematically lead to increases in 
response speed. Only a slight (nonsignif- 
icant) gain in response speed was observed 
when the delivery of the reinforcer was not 
correlated with S's response latency. And 


L/U 


the magnitude of the gain was matched by 
a nonreinforced control group. 

What implications do the present results 
have for the use of response speed as an 
index of “drive” or “incentive magnitude” for 
children? At the least, they suggest that 
caution should be exercised in such applica- 
tions. The failure to find a significant dif- 
ference between the tandom-reinforced and 
nonreinforced groups suggests one serious 
limitation of the response latency measure in 
human studies, If only these two groups had 
been run, ie, a Partial reinforcement con- 
dition and a nonreinforcement control, the 
conclusion might have been reached that 
the reinforcement stimulus had little or no 
incentive value for these Ss. The inclusion 
of two speed-dependent groups, however, 
demonstrated that the event was a remark- 
ably powerful outcome event. The lesson 
seems clear: If response speed is used as an 
index of reinforcement magnitude, 
schedule upon which the reinforcement event 
is delivered should be speed dependent. 

though this experiment was not directly 
concerned with the use of response rate as a 
dependent variable, it seems not unreasonable 
to expect that a similar pattern of results 
rate had been manipulated. 
Some of the obscurities in the recent social 
reinforcement. literature may be traced to 
this methodological problem. A review of 


SUPPLEMENTARY REPORTS 


1965, 1967; 
a procedure might be 
effectiveness of the "social reinforcer” treat- 
ments relative to nonreinforcement conditions, 
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Cornell 


When asked to indicate whether 
had appeared before (+) or not 
common associates and synonyms 
words. In a 2nd experiment, fal 
when the preceding words associ 
lation was 


plexes of attributes or features. 


Even the bulkiest dictionaries pre- 
sent vocabulary as a list of unrelated 
items, Linguists, however, have made 
it clear that although the lexical com- 
ponent of language manifests less sys- 
tematicity than syntax or phonology, it 
nevertheless has much more structure 
than is suggested by the arrangement 
of words in dictionaries (e.g, Chom- 
sky, 1965, especially Ch. 4.; de Saus- 
sure, 1959; Katz & Fodor, 1963 ; Wein- 
reich, 1964). Within the generative 
approach to language (Chomsky, 1965) 
a theory of semantics has recently been 
developing which has as its central no- 
tion the cross classification of words 


1The research reported herein was con- 
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United States Department of Health, Edu- 
cation, and Welfare, Office of Education. 
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each of 200 orally presented words 
(—), students gave more plusses to 
of preceding words than to control 
se recognition errors were obtained 
jatively elicited the test words and 
bidirectional but not when only the 
The results of the 2nd 


into syntactically relevant categories, 
such as animate-inanimate, and human- 
nonhuman. 

Psychologists too have recognized 
that words are organized and they have 
concerned themselves with the analysis 
of the psychological processes under- 
lying this organization and with its 
consequences in verbal behavior. Ex- 

eriments on semantic generalization 
(Feather, 1965), clustering in free re- 
call (e.g., Cofer, 1965), not to mention 
work on free and controlled associations 
(e.g., Deese, 1965), can all be viewed 
as reflecting this concern. In these 
areas of psychological investigation, 
there is a general tendency, with some 
notable exceptions (e.g. Deese, 1965), 
to explain findings by reference to as- 
sociative bonds between words. The 
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present study was designed to challenge 
this approach. 

In particular, our interest was 
aroused by a study of Underwood 
(1965) in which he used a method of 
continuous recognition originally intro- 
duced by Shepard and Teghtsoonian 
(1961). Underwood had Ss indicate 
for each of 200 words, presented by a 
tape recorder, whether it had occurred 
earlier in the list or not. He found that 
common associates of words which ap- 
peared earlier in the list were falsely 
recognized more often than control 
words, 

The purpose of the first experiment 
to be reported below was to investigate 
whether relations of synonymity have 
similar effects to those of association, 
The constant use of paraphrasing in 
everyday life communication suggests 
that in coding for memory under nor- 
mal conditions speakers retain pri- 
marily the semantic content of a mes- 
sage. Since synonyms have a large 
area of meaning in common, they would 
seem natural candidates for confusion 
in the kind of task used by Underwood. 
Of course, this experimental situation, 
unlike everyday life, puts a premium 
on verbatim coding, but it was rea- 
soned that since associates proved to 
intrude in this situation Synonyms cer- 
tainly should, 

The second experiment was sug- 
gested by Razran's study (1949) on 
Semantic generalizations of conditioned 
Tesponses. He found greater general- 
ization when the test word strongly 
evoked, in a free association task, the 
conditioned word than when the con- 
ditioned word strongly evoked the test 
word. ‘For instance, when a salivary 
Tesponse was conditioned to the word 
dog there was less generalization to 
the superordinate animal than to the 
subordinate terrier. In a free associa- 
tion task, subordinates tend to evoke 
superordinates more strongly than 
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superordinates evoke subordina e 
His finding led Razran to the conclu 
sion that semantic generalization 
not an automatic process taking plac 
during original conditioning but rath 
was an artifact of subsequent testing 
Using unidirectional and bidirectiona 
associates the present authors attempted. 
to see whether false recognition v 
due to processes involved in ini 
coding of the stimulus or to confusi 
resulting from the presentation of 
associate. 


METHOD 
Subjects , 


There were 28 male university students 
in the first experiment and 34 female stud- - 
ents in the second. A 


Materials and procedure 


Each of the two experiments contained | 
200 words which were recorded on magnetic - 
tape at 10-sec. intervals. Each word was $ 
recorded twice in immediate succession to 
make sure that it would be heard. The 
tape was played to groups of five or six 
Ss. For each word, S had to indicate by | 
a + or — whether it was “old” or “new.” 
The Ss were instructed to guess when in 
doubt. The lists were constructed to con- 5 
tain three major categories of words: Pre- 
ceding (P) words, Experimental (E) X 
words, and Contro| (C) words. For each 
P word the list contained one or two E 
words, and for each E word one C word. 

Experiment I.—The E words in the first 
experiment related to the P words in two 
Ways: 24 as common associative responses 
(As) and 25 as synonyms (Ss). Table 1 
presents the list of words used in this ex- 
periment. The words included as As were 
given as responses to their respective stimuli 
in a free association test by 27-70% of the 
500 male college students Palermo and 
Jenkins (1964) used for their association 
norms. The words used as Ss were given 
as snynonyms by 30-86% of the 50 stud- 
ents Jenkins and Palermo (1965) used for 
their synonym norms, As can be seen in 
Table 1, the E words. were of four kinds: 
15 As and 15 Ss had common P words 
(eg, P: chair, A: table, S: seat), 4 As 
were related to Ps which had no obvious 
synonym (eg, P: bed, A: sleep), 5 Ss 
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TABLE 1 


Wonps Usep IN Exp. I, THEIR ORDINAL Positions AND NUMBER 
OF FALSE RECOGNITION ERRORS 


e A CA S cs 
Posi- Posi-| Er- Er- -| Er- èr- 
word | Ps) wora | Bask) Er | word |Æ] wo | Bos! Et | wed | Ee 
always 40 | never 74 1 | beside 1 |forever 
black 142 | white 185 4 | soft 3 |dark 169 i Horde 1 
girl 119 | boy 167 6 |air 5 |female 157 1 marble 1 
chair 60 | table 88 7 | valley 2 |seat 78 0 | job 2 
high 132 | low 173 1 |in 1 |tall 163 2 | safe 1 
king 141 | queen 181 4 | dance 0 |ruler 191 0 | statue 0 
live 32 | die 52 1 | see 3 | exist 68 2 | inform 0 
needle 129 | thread 153 2 | nest 3 |pin 175 1 | jar 1 
scissors 89 | cut 110 0 | look 0 | shears 128 1 | brine 0 
thirsty 44 | water 86 | 3 | paper 0 | dry 67 | 0 | nice 0 
whiskey 95 | drink 135 7 | spend 2 |alcohol 15 1 | corridor 9 
bath 158 | clean 186 5 | small 3 |shower 198 3 | humor 1 
now 145 | then 176 2 | near 9 |immediately | 192 2 | certainly 4 
over 36 | under 61 1 | within 0 ve 83 4 | early 2 
tell 79 | me 107 0 | it 0 | relate 123 2 | deceive 0 
Error Sub- 
totals 44 32 24 13 
anger 166 | mad 199 1 | rough 3 | mad 199 1 | rough 3 
carpet 159 | rug 179 2 | she 2 |rug 179 2 |shed 2 
eagle 111 | bird 151 5 | guard 3 |bird 151 5 | guard 3 
kitten 54 | cat 100 2 | wool 0 |cat 100 2 | wool 0 
swift 39 | fast 58 O | third 0 | fast 58 0 | third 0 
Error Sub- 
totals 10 8 10 8 
bed 114 | sleep 149 2 | guess 0 
green 47 | grass 91 2 | heart 0 
his 97 | hers 136 2 im 1 
stem 35 | flower 55 0 Journal 0 
Error Sub- 
totals 6 1 
baby 134 infant 160 3 | drama 1 
citizen 37 member 57 2 | winter H 
have 144 own 193 | 4 | go 0 
jump 49 hop 75 | 0 | fetch 0 
make 162 create 182 3 | reduce 2 
Error Sub- 
totals 12 4 
Grand 
Totals 60 LH 46 25 


^ The positions indicated are for the third token of each P word. 


b The positions indicated are for the first order. 


were related to Ps which had no common 
associates (eg, P: baby, S: infant), and 
in five cases the same E word was both 
an A and an S (e.g, P: carpet, A, S: rug). 

Our procedure for selecting control words 
differed from that employed by Underwood. 
Underwood's control words were common 
associates whose stimuli did not appear in 
the list. For instance, he used dowm as a 
control word because it is commonly given 
as a response to up, but up was not included 
in the list. This method does not provide 
a specific control word for each experimental 
word. Also, it implies that the only variable 
conceivably relevant to false recognition is 
associative connection—a stronger hypothe- 
sis than Underwood's experiment was de- 
signed to test. The large number of errors 
to the control words in Underwood's study 
suggests that factors other than associative 


relation influence false recognition. Because 
of these considerations, our C words were 
selected so that each one of them would be 
similar to its experimental counterpart with 
respect to part of speech, frequency of usage 
as reflected in the Thorndike and Lorge 
(1944) G count, and number of syllables. 
In addition, the C words had to have no 
obvious relation to other words in the list. 

The list thus contained 29 P words, 44 E 
words, and 44 C words. Each P word 
appeared three times in different positions 
before its E counterpart was heard. The 
three tokens of each P word were within 
10-20 positions of each other, and the E 
word appeared within 18-50 positions after 
the last token of its corresponding P word, 
In the 15 cases where a single P word had 
two E counterparts, one A and one S, the 
two E words were located within 10-22 posi- 
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TABLE 2 


Wonps Usep IN Exp. 2, THEIR ORDINAL POSITIONS AND NUMBER 


oF FALSE RECOGNITION ERRORS 


P Positions | E Position | Errors | c | Position | Errors 
Pe 
List 1 
black 120 white 169 1 fine 168 2 
boy 70 irl 99 1 bird 101 0 
chair 177 table 199 5 duty 198 3 
king 82 queen 116 0 field 118 2 
health 127 Sickness 147 0 aspect 148 1 
butter 79 bread 114 3 fault 115 2 
igh 131 low 164 5 all 162 4 
square 102 round 142 4 young 144 3 
green 104 124 6 town 122 0 
cold 159 hot 188 2 full 190 2 
List 2 
white 120 black 169 1 fresh 168 1 
girl 70 boy 99 0 car 101 0 
table 177 chair 199 6 page 198 1 
ueen 82 kin 116 3 star 118 1 
ickness 127 ealth 147 0 church 148 1 
bread 79 butter 114 3 evening 115 1 
wW 131 high 164 4 each 162 1 
round 102 square 142 0 real 144 1 
104 green 124 1 large 122 1 
fot 159 cold 188 2 just 190 1 
Error 
Subtotals 47 Y 28 
P- E 
List 1 
bitter 125 sweet 155 2 best 153 0 
gers 78 108 1 fact 107 1 
how 156 now 180 5 why 179 2 
long 130 short 161 1 rich 160 2 
swift 149 fast 178 5 kind 176 6 
bloom 154 flower 187 7 answer 189 4 
sky 133 blue 166 1 free 167 0 
whiskey 87 drink 121 1 break 119 1 
el 62 e 90 0 it 91 0 
96 window 135 0 building 136 5 
List 2 
appear 98 see 138 1 add 139 1 
cottage 163 house 185 8 time 183 1 
m 75 sleep 111 4 price 113 0 
heavy 69 light 93 2 stone 92 0 
t 72 carry 97 0 follow 95 0 
loud 128 soft 157 4 glad 158 1 
therefore 152 because 181 2 either 182 1 
stomach: 151 food 184 2 Iw 186 1 
infant 143 baby 174 3 army 173 1 
Scissors 105 cut 146 1 ask 145 3 
Error 
Subtotals 50 30 
P4—— —— —— ——— —E 
List 1 
see 98 appear 138 1 begin 139 0 
house 163 cottage 185 0 feather 183 3 
sleep 75 dream 111 4 t 113 9 
light 69 heavy 93 2 louble 92 1 
carry 72 lift 97 0 drop 95 0 
soft 128 loud 157 2 fair 158 2 
because 152 therefore 181 0 without 182 0 
food 151 stomach 184 0 lecture 186 9 
baby 143 infant 174 1 jacket 173 0 
T ent 105 scissors 146 0 parchment 145 0 
Sweet 125 bitter 155 1 active 153 1 
hand 78 108 0 corners 107 0 
now 156 how 180 3 yet 179 2 
short 130 long 161 6 late 160 3 
fast 149 swift 178 0 cruel 176 1 
flower 154 bloom 187 0 189 0 
blue 133 sky 166 3 art 167 1 
drink 87 whiskey 121 0 antique 119 0 
me 62 tell 90 0 lay 91 3 
window 96 door 135 0 k 136 0 
Error 
Subtotals 23 17 
Error Grand 
120 75 


* The positions indicated are for the second token of each P word. 
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tions of each other. Each C word was 
within two positions of its E word, with 
half the Cs preceding and half following 
the E words. The Cs for As will be re- 
ferred to as CAs and those for Ss as 
CSs. The positions of P, E, and C words 
are indicated in Table 1. 

In addition to words in these three cate- 
gories, the list also contained 22 filler words. 
Of these, 20 appeared once each, one— 
twice, and one—three times. The fillers had 
no apparent relation to any of the other 
words in the list. They occupied positions 
1-10 and other positions mostly in the first 


` quarter of the list. The list was recorded in 


two orders. For the first order, the words 
were randomly arranged, within the con- 
straints outlined above. The second order 
was identical to the first, except that the 
positions occupied by Ss and CSs were re- 
placed by their respective As and CAs, 
and vice versa. This interchange was, of 
course, possible only for the words which 
had common Ps. 

Experiment II.—In the second experi- 
ment there were 30 E words and all of them 
had an associative relation to the P words. 
The words used in this experiment and 
their positions can be seen in Table 2. The 
As were drawn írom two sources. Some 
were given as responses to their respective 
stimuli by 22-73% of the 500 female college 
students in the Palermo and Jenkins (1964) 
norms and others by 26-74% of the 1,349 
airmen in the Bilodeau and Howell (1965) 
norms. The 30 E words fell into three cate- 
gories in terms of their relation to P. words. 
In 10 P-E pairs, the E words were common 
responses to the P words as stimuli but 
they did not commonly elicit the P words 
as responses (e.g. bitter — sweet, P> E). 
In 10 other cases, the E words commonly 
elicited the P words as responses but this 
relation was not reciprocated by the P 
words (light <- heavy, P €— E). Finally, 
in 10 cases, the associative link went in 
both directions (king €? queen, P €? E). 
Most of the words considered as nonasso- 
ciates were given by less than 1096 of the 
respondents in the norms used and only 
one word reached 22% response commonal- 
ity. For instance, while sweet was given 
by 5396 of the Palermo-Jenkins Ss in re- 
sponse to bitter, only 296 gave bitter as a 
response to sweet. The words (List 1) that 
served as Ps for about half (16) of the 
Ss served as Es for the rest of the Ss 
(18, List 2), and vice versa. The mean 
association values for the directions indi- 


cated are: P E— 5396, PP E=P<E 
= 42%, In Exp. II, as in Exp. I, the char- 
acteristic that distinguished a P word from 
an E word was its ordinal position in the 
list: the P word came before its correspond- 
ing E word. Because each E word had a 
specific C word, the Cs for the two lists had 
to be different. In Exp. II, each P word 
appeared only twice. Thus, the distribution 
of words in the list was as follows: 60 
positions were occupied by the Ps, 30 by 
the Es, and 30 by the Cs. The rest, 80 
positions, were held by filler words. Thirty- 
five fillers appeared once each, 19—twice 
each, one—three times, and one—four times. 
The first 49 positions were taken by fillers, 
the rest of the fillers were scattered through- 
out. The second token of each P word 
was within 10-20 positions of the first 
token, and each E word was placed 20-50 
positions following the second token of its 
P counterpart. In other respects the de- 
sign of the second experiment was similar 
to that of the first. 


RESULTS 


Two types of errors were possible in 
these experiments : (a) identification of 
a new item as an old one (positive er- 
rors, or false recognition errors), and 
(b) identification of an old item as a 
new one (negative errors). Negative 
errors could have been made to the 
repetitions of the P words and filler 
words. False recognition errors could 
have been made to all other words. 
The number of errors of any kind was 
minimal. In Exp. II, it amounted to 
6% out of the possible number of er- 
rors, and in Exp. I the number of er- 
rors reached only 5%. 

Our interest in this paper is focused 
on the comparison of the number of 
errors made to the E words vs. the 
number made to the C words. The 
errors made in these two categories of 
words can be seen in Tables 1 and 2 
for Exp. I and II, respectively. In- 
spection of the distribution of errors 
in Exp. I showed an atypically large 
number of errors to the P word “bath” 
(12 errors, while the next highest fig- 
ure for a P word was 6) and to the CA 
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word "near" (9, next highest in this 
category being 5). For Exp. I, there- 
fore, the comparisons between E and C 
words excluded the two rows (10 
words) in which these two words ap- 
peared. After this exclusion, both the 
association and synonymity variables 
produced significant results, for the 
S-CS difference, ¢ (27) = 294, p< 
.01, and for the A-CA difference, t (27) 
= 2.52, p < .02. Comparisons for the 
complete data without exclusions 
yielded significant results for S-CS, t 
(27) =2.42, p<.05, but not for 
A-CA, t (27) = 1.85, p < .10. 

The exclusions thus do not alter the 
status of synonymity as a relevant vari- 
able, they affect seriously only the as- 
sociation variable. Since the main ob- 
ject of this study was to weaken the 
omnipotence of associative relations in 
verbal behavior, the exclusion proce- 
dure seems theoretically conservative. 
As will be seen below, another type of 
experimental error, “misperception,” 
militated in the present study against 
the association variable. 

The means for the 23 entries involved 
in the S-CS comparison are: S = 1.46, 
CS =.71, and the means for the 22 
A-CA entries are: A —1.89, CA= 
1.08. The five E words which were re- 
lated to the P words as both associates 
and synonyms, and their corresponding 
C words, were included in both the 
A-CA and the S-CS comparisons, but 
as can be seen in Table 1 these words 
contributed very little to the differences 
obtained. 

The higher error rate for the A and 
CA categories as compared to the S and 
CS categories may be due to the higher 
frequency of these words. While only 
5 A and 5 CA words had Thorndike- 
Lorge frequencies under AA, 16 S 
words and 15 CS words fell below this 
level. The more frequent the words, 
the greater the likelihood that they will 
have interrelationships amongst them- 


selves. Such relationships, beyond 
those under experimental control, will 
tend to contribute to false recognition. 
Generally, it is practically impossible 
to construct a list of 200 words which 
will not contain relations other than 
those built-in experimentally. The best 
one can achieve is the elimination of 
any glaring nonexperimental relations 
and then depend on chance to distribute 
the remaining relations equally among 
all conditions. , 

In Exp. II, the statistical analysis for 
the pooled data of Lists 1 and 2 reveals 
that significantly more false recognition 
errors were made to the E words than 
to the C words when the E words were 
bidirectionally associated with P words, 
P<>E; means: E—1.38, C = 82, t 
(33) = 2.81, p< .01, and when the 
associative link was in the forward di- 
rection, P>E; E = 1.47, C = 88, t 
(33) = 2.23, p < .05, but not when the 
association was backward, P «— E; E 
= .68, C= .50, t (33) = 65, ns. In 
separate analyses for List 1 and List 2, 
the pattern of results for List 2 was 
identical to the pattern for the pooled 
data but none of the E-C comparisons 
for List 1 reached significance. 

Why was the pattern of results in 
List 2 not duplicated in List 1? It 
may be seen by inspecting Table 2 that 
the number of errors to the E words 
in List 1 was roughly the same as in 
List 2. The difference between the two 
lists is that the number of errors to the 
C words in List 1 was more than 
double that in List 2 for the PoE 
and P — E categories. It is possible to 
blame the higher error score of the C 
words in List 1 on gratuitous relations 
between some of these words and other 
words in the list. For instance, bind 
which was used as a control for the 
adjective fast contributed a very high 
number of errors (6). This control 
could have been perceived by Ss as a 
synonym of the noun type which ap- 


INITIAL CODING AND FALSE RECOGNITION 177 


peared earlier in the list as a filler. 
Although such ad hoc explanations are 
of little value, because one can probably 
find some nonexperimental relation for 
most of the words in the list, the point 
stands that uncontrolled (and uncon- 
trollable) relations, whatever they are, 
played a role in this experiment, and 
could account for the disproportionate 
number of errors in the C category of 
List 1. 

After completion of the experiments 
the suspicion arose that some words 
may not have been heard as the words 
they had been intended to be. To 
check on this possibility, different Ss 
were asked to listen to the tapes and to 
write down what they heard. Eight Ss 
did this for the tape used in Exp. I 
and four for the tape used in Exp. II. 
This test revealed some misperception, 
mostly due to homophones. The mis- 
perceptions of one or more Ss were 
distributed among the word categories 
as follows: Exp. I, two P words, four 
A words, six CA words, one S word, 
and one CS word; Exp. II, seven P 
and E words, and four C words. 

It is apparent that in Exp, I there 
was a greater concentration of misper- 
ceptions in the A category words than 
in the other categories. Because of this 
it is likely that the t value for the differ- 
ence between A and C words under- 
estimates the true contribution of as- 
sociative relations to false recognition. 
However, this problem is not too seri- 
ous for the present study because its 
purpose was not to ascertain the exact 
weight of associations in determining 
false recognition but rather to introduce 
synonymity as ah additional variable. 


Discussion 


In interpreting the results of the second 
experiment it must be kept in mind that 
the forward (PE) and backward 
(P & E) conditions differed not only in 
temporal order of the free association 


stimuli and responses relative to each 
other but also in the number of times the 
associative stimuli appeared. In the for- 
ward condition where the associative 
stimuli served as P items, they appeared 
twice, but in the backward condition, 
where they served as E items, they ap- 
peared only once. For this reason it is 
not possible to conclude that backward 
associative relations cannot produce false 
recognition. But one can conclude that 
forward relations do result in such an 
effect, thus establishing that initial coding 
plays a role in false recogniton, and that 
it is not merely an artifact of testing for 
it. The discrepancy between this conclu- 
sion and Razran's is not too disturbing, 
because of the basic methodological flaws 
in his experiment recently summarized by 
Feather (1965). In fact, despite the 
gadgetry involved in semantic generaliza- 
tion experiments—perhaps, because of it 
—it seems that the judgmental procedure 
employed in the present experiments taps 
the underlying semantic processes more 
directly than the semantic generalization 
procedure, In semantic generalization, the 
CR transfers, according to this view, to 
words judged, mistakenly in the case of 
the test words, by Ss (not necessarily in 
full awareness) as having been previously 
conditioned. The generalization gradient, 
whenever it obtains, may reflect the de- 
gree of confidence associated with such 
judgments. Such mistaken judgments re- 
flect underlying processes of word coding. 

What are these processes? A model is 
needed that would account both for Ss’ 
pronounced ability to recognize correctly 
new and old words and for the systematic 
false recognition errors found. It is clear 
that S must somehow mark off the words 
on the list from the tens of thousands of 
words in his vocabulary. This marking 
off may be achieved in the following way. 
Assume that, when activated, the neural 
processes giving a word its identity leave 
a trace which is “dark” and “heavy” at 
first and with the passage of time wears 
out and becomes fainter and fainter. A 
mechanism of this sort would keep the 
time of words and enable S to judge each 
word as to whether it was heard in the 
experimental session or prior to it. On 
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the basis of this information S could clas- 
sify a word as new or old. 

This timing notion can thus account for 
Ss’ correct identification of words. But 
it cannot explain why Ss made the sys- 
tematic false recognition errors described. 
In order to account for these errors it 
must be assumed that the word is not the 
ultimate unit of. coding. If it were, asso- 
ciates and synonyms should not produce 
more errors than control words, Rather, 
our finding, along with the common phe- 
nomenon of paraphrasing and related ob- 
servations (e.g., Broadbent, 1964; Brown 
& McNeill, 1966; Yavuz & Bousfield, 
1959), supports the conception of words 
as complexes of features. According to 
this view, each word consists of a set of 
features or attributes which uniquely char- 
acterize it and distinguish it from all 
other words in the vocabulary system, 
The features are of many different kinds 
and involve semantic, syntactic, phono- 
logical, and for literates, orthographic 
aspects. For instance, the feature char- 
acterization of table would describe it as 
“a piece of furniture,” “a noun,” “having 
a [t] sound in initial position,” “an (e) 
letter in final position" etc. On this 
conception, the encoding of a word 
would correspond to a simultaneous ac- 
tivation of a set of features, Many of the 
semantic features and all of the features 
in the other three categories would be 
common to large segments of S’s lexicon, 
but some semantic features would be spe- 
cific to a single item or to a small group 
of items (see Katz & Fodor, 1963). The 
general features serve to relate various 
lexical items to each other. For instance, 
many English words begin with a (t), and 
all these words constitute, in some sense, 
a category whose members share this fea- 
ture. But the fact that a table stands at 
a certain height, broadly defined, in rela- 
tion to its other dimensions could be 
thought of as an idiosyncratic feature. 

Within the framework of the feature 
analysis, different responses to the same 
word share to a great extent the same 
Processes. When an S is asked in a free- 
association test to give the first word that 
comes to his mind, he focuses on some of 
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the features of the word, perhaps the ones 
which appear to him most salient at the 
time. When he is asked to give a 
synonym, he focuses again on a subset of 
the feature complex. In this case the 
features relevant for the determination of 
the response are semantic. When he is 
asked to produce rhymes, he focuses on 
phonological features, and when asked to 
respond with words of the same letter 
length as the stimuli, he focuses on ortho- 
graphic features. According to this view, 
associative responses and So-called cate- 
gorical responses do not entail totally dif- 
ferent processes, they result mainly from 
different selections of features. 

The idea of feature coding can account 
for the false recognition errors resulting 
from associative and Synonymy relations, 
When a new word is heard which shares 
some significant features with an old 
word, S may be led mistakenly to “dis- 
regard” the distinguishing features and 
consider the two as identical, The as- 
sumption implied here is that not all 
features carry equal weight. When a 
word is heard, some of the features poten- 
tially associated with it may not get ac- 
tivated and even if activated not all fea- 
tures leave noticeable traces, 

In conclusion, we would like to restate 
the hypothesis that words are not stored 
as words but as complexes of features, 
When words are used they are not repro- 
duced from memory but rather recon- 
structed from their component features, 
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measured from 3 muscle groups are attenuated at about the time the 
UCS is expected. When respiratory activity is not controlled under 
these conditions, both the Írequency and the magnitude of respiration 


acceleration of heart rate, observed on test trials to follow a small 
deceleratory response, is associated with an increase in EMG and 
respiratory activity. These data are considered to be consistent with 
an hypothesis which views cardiac and somatic-motor events as dif- 


ferent aspects of the 
position for behavioral processes 


This research was concerned with 
whether the heart-rate changes in 
human Ss observed in anticipation 
of aversive stimuli during simple 
classical conditioning are related to 
the modification of striate muscle 
activity. Two complimentary hypoth- 
Ses were evaluated. One, the more 
dominant anticipatory deceleration of 
heart rate observed with interstimulus 
intervals (ISI) in excess of 4.0 sec, 
(Hastings & Obrist, 1967 ; Notterman, 
Schoenfeld, & Bersh, 1952; Obrist, 
Wood, & Perez-Reyes, 1965; Wood & 
Obrist, 1964; Zeaman, Deane, & Weg- 
ner, 1954) would be concomitant with 
an inhibition of EMG activity. Sec- 
ond, the more dominant acceleration of 
heart rate observed on test trials with 
a short ISI of 1.0 sec, (Hastings & 
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Same response process. 


Implications of this 
are discussed, 


Obrist, 1967) would be concomitant 
with an increase in EMG activity. 
These hypotheses were suggested by 
several lines of evidence, F irst, cardiac 
and striate muscle effects appear to 
have common central and peripheral 
mediating mechanisms with respect to 
both inhibitory (Gellhorn, 1964; 
Kaada, 1960; Lofving, 1961) and ex- 
Citatory effects (Bard, 1960; Hoff, 
Kell, & Carroll, 1963; Landau, 1953; 
Rushmer, Smith, & Franklin, 1959; 
Wall & Davis, 1951). Second, vaga- 
tonic effects on the heart, i.e., cardiac 
deceleration, have been observed to 
occur along with a loss of muscle tone 
or a cessation of somatic-motor activity 
in anticipation of or to certain stressful 
situations (Engel, 1950; Gellhorn & 
Loofbourrow, 1963; Richter, 1957). 
Third, cardiac changes observed in 
dogs during classical and operant con- 
ditioning were consistently related to 
somatic-motor activity (Obrist & 
Webb, 1967; Webb & Obrist, 1967). 
These various lines of evidence sug- 
gested: (a) that neurophysiologically, 
cardiac and somatic-motor events can 
be interrelated or coupled events such 
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that inhibition or excitation in the 
motor system would be concomitant 
with inhibition or excitation in the 
cardiovascular system, and (b) that 
because of this coupling, the cardiac 
response in anticipation of or to stres- 
sors of various sorts would be con- 
comitant with the direction and ampli- 
tude of the somatic-motor response 
initiated by the organism in coping 
with the stress. Therefore, it would 
be expected that if the anticipatory 
cardiac deceleration during aversive 
conditioning in human Ss is one aspect 
of an inhibitory cardiac-somatic re- 
sponse process, then one should find a 
parallel inhibition of anticipatory striate 
muscle activity. Also, if the more 
dominant acceleratory response ob- 
served on test trials with a short ISI 
(Hastings & Obrist, 1967) is one 
aspect of an excitatory cardiac-somatic 
process, then an increase of anticipa- 
tory striate activity should be seen. As 
a means of evaluating this position, 
three experiments were carried out in 
human Ss in which striate activity 
was evaluated by electromyography 
(EMG). 


EXPERIMENT I 


A long 7.0 ISI was used to produce 
a deceleratory cardiac response in order 
to determine whether this effect and an 
inhibition of striate activity were con- 
comitant. 


Method 


Subjects.—The Ss were 19 healthy under- 
graduate males randomly selected from the 
Introductory Psychology course. Technical 
difficulties prohibited analysis of the EMG 
recordings in one S and of the heart-rate 
data in two additional Ss. i 

Apparatus—The methods for measuring 
heart rate and respiration were similar to 
those previously described (Wood & Obrist, 
1964) with one exception. EKG was mea- 
sured using Beckman biopotential electrodes 
from a precordial lead with a reference 
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electrode on the opposite side of the chest. 
Striate muscle activity (EMG) was mea- 
sured from muscles of the chin, the neck, 
and the left arm. The positions of the neck 
and the arm (pronator teres muscle) leads 
were the same as those described by Malmo 
and Surwillo (1960). On the chin, one 
electrode was directly behind the point on 
the chin where the muscle attaches to the 
bone, while the second electrode was on the 
muscle to the inside of the right jaw bone 
and about 2 in. from the first. The chin 
lead is able to detect activity in several 
muscle groups involved in jaw and tongue 
movements and in swallowing. These three 
positions were selected because it was 
thought that they would be more active than 
other muscle groups under the experimental 
conditions; that is, Ss were seated in a re- 
laxed position in a lounge chair. The head 
was considered to be the part of the body 
that might be most readily subject to spon- 
taneous movement. The left arm was mea- 
sured because the UCS, shock, was adminis- 
tered on two fingers of the left hand. This 
lead should detect any anticipatory tensing 
of the arm. 

Procedure.—Conditioning procedures were 
identical to those described in previous work 
(Obrist, Wood, & Perez-Reyes, 1965) with 
two principle exceptions. First, simple trace 
conditioning rather than differential pro- 
cedures was used (see Hastings & Obrist, 
1967; Obrist & Wood, 1966). Second, 12 
rather than 10 nonreinforced test trials were 
given with the extra two trials occurring as 
the last two of a total of 28 reinforced and 
test trials. 

The CS-UCS interval (ISI), measured 
from the onset of the CS to the onset of the 
UCS, was 7 sec. The CS was a dim blue 
light of 2-sec. duration. The UCS was a 
6-sec. shock. Prior to the start of condition- 
ing S adjusted the intensity of the shock 
to be "very painful" Ten CS alone adapta- 
tion trials preceded conditioning. Respira- 
tion was controlled at a normal, but constant, 
frequency and amplitude throughout adapta- 
tion and conditioning except for two an- 
nounced 90-sec. rest periods. 

Data Quantification—The anticipatory 
heart-rate changes on the 12 test trials were 
quantified in a second-by-second analysis as 
previously described (Wood & Obrist, 1964, 
1968). Deceleration of heart rate is indi- 
cated by this technique as a lengthening of 
the R-R interval which is measured in milli- 
seconds. An increase of 10 msec. within the 
range of heart rates observed under these 
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Fic. 1. Example of cessation of EMG activity and heart-rate deceleration on a test trial. 


conditions approximates a deceleration of 
about 1 beat/min. 


Respiration 
was quantified 
deviate signifi 


amplitude following CS onset 
9n test trials and found not to 
cantly from the base period. 


For this purpose, the amplitude was evaluated 
for all inspirations in the 15 sec. following 
CS onset, for the largest inspiration in the 
first 6 sec. following CS onset, and for all 
inspirations occurring during the periods 6-12 
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sec. following CS onset or the period of 
maximum deceleration (for further details, 
see Wood & Obrist, 1964). 

The EMG was quantified on test trials 
in two ways. The tension level was read 
from the integrated output of an Offner 
EMG integrator coupler and was analyzed 
similarly to the post-CS second-by-second 
analysis used on heart rate. The gain used 
for measuring chin activity was adjusted 
sufficiently high in order to detect small 
changes in tension level. This was such a 
high gain, however, that any bursts in EMG 
activity exceeding 50-100 & v. drove the pens 
off the record. The tension level which 
was usually in the range of 10-30 v. varied 
little either in the base period or following 
CS onset with the exception of these bursts 
of activity. A second analysis consisted of 
determining from the raw or direct output 
of the EMG coupler (which had been re- 
corded on separate channels from the inte- 
grated output) the frequency and duration 
of all bursts of EMG activity which ex- 
ceeded 50, v. for all three muscle groups. 
A sustained burst of activity sometimes ap- 
peared to be composed of two or more con- 
secutive bursts. In these instances, the ac- 
tivity was counted as separate bursts if the 
first or preceding burst returned approxi- 
mately halfway or more to the base line. 
The minimal criterion of 50 uw v. used to count 
the presence of a burst was selected because 
the tension level was usually less than this 
value making it easy to detect the presence 
of any change in activity. It is the EMG 
activity, as shown by this latter analysis, 
that is reported in the results section of 
each experiment. 


Results 


An inhibition of bursts of EMG ac- 
tivity was observed to be concomitant 
with the anticipatory deceleration of 
heart rate. An extreme example of 
the effect is illustrated in Fig. 1. This 
shows a test trial with almost contin- 
uous small bursts of chin EMG activity 
which ceased shortly after CS onset 
and commenced again several seconds 
after the point in time in which the 
UCS would occur. The effect, although 
typical for this S, was never found to 
be as pronounced in other Ss because 
most Ss never had such a continuous 
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display of background activity. For 
example, of the 216 base line or 15-sec. 
pre-CS periods (ie. 12 trials/S X 
18 Ss) 140, 50» v. or larger distin- 
guishable bursts occurred in 86 of 
these periods. The remaining periods 
showed no activity. Typically, when 
EMG activity occurred on either or 
both the arm and neck leads, there was 
corresponding activity on the chin. On 
the other hand, there were numerous 
occasions when bursts of activity were 
noted to occur only on the chin. For 
example, of the 140 EMG bursts in the 
base period, 60 of the 63 occurring on 
the neck and/or arm were accompanied 
by activity on the chin while there were 
another 77 bursts which were recorded 
only on the chin leads. 

Following CS onset on test trials, 
an appreciable reduction in the occur- 
rence of EMG bursts was noted at 
about the peak of the deceleration. The 
cardiac response itself was a uniphasic 
deceleration. It was a highly reliable 
effect as evaluated by a one-way trend 
analysis for group data with significant 
linear, F (1, 224) = 27.45, p < .001, 
quadratic, F, (1, 224) = 94.32, p< 
.001, and cubic, F (1, 224) = 20.69, p 
< .001 components. Figure 2 depicts 
the second-by-second changes from CS 
onset of both heart-rate and EMG 
changes on test trials. Heart rate is 
displayed as the change from the pre-CS 
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Fic. 2. Second-by-second changes in heart- 
rate and EMG activity on test trials—7.0-sec. 
ISI, respiration controlled. 
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base period? EMG bursts are depicted 
as the number of times in any one 
second that a 50» v. or larger burst 
Occurred (a burst-duration measure). 
"Therefore, following CS onset there 
was an attenuation of EMG activity 
which peaked at the time the UCS 
would occur. At this point, only one 
instance of a 50 » v. or larger burst was 
observed. The overall change from the 
base to the post-CS period was as fol- 
lows. The number of trials with EMG 
bursts decreased from 84 to 42, the 
number of distinguishable EMG bursts 
decreased from 140 to 54, and the total 
duration of EMG bursts decreased 
from 275 to 100 sec. In all, 17 of the 
18 Ss had a reduction in EMG activity 
following CS onset with the pre-post 
CS difference being highly reliable— 
t (17) = 5.87, p< .001, duration; t 
(17) 24.90, »<.001, number of 
bursts. 

There was little evidence that Ss 
braced or tensed in anticipation of the 
shock. The arm lead, particularly, might 
be expected to detect such increased 
activity. In only one S was anticipa- 
tory EMG activity observed on the 


2The heart-rate changes in this figure 
as well as in all following figures have been 
corrected for the lag inherent in the integra- 
tion of R-R interval (see Wood & Obrist, 
1968). This involved omitting from the 
figures the € heart-rate change obtained from 
the second-by-second analysis in the first 
second after CS onset and moving all mean 
second changes thereafter 1 sec. nearer to 
CS onset. Thus, 14 rather than 15 measure- 
ment periods are depicted. Because the first 
second following CS onset in the second-by- 
second analysis actually depicts heart-rate 
change in the last second before CS onset, 
these changes were included in the trend 
analysis to provide a zero change base line 
for detecting trends. It should be noted that 
the correction in these figures has not been 
used in previously reported experiments 
(Hastings & Obrist, 1967; Obrist & Webb, 
1967; Obrist & Wi , 1966; Obrist et al, 
us j Wood & Obrist, 1964; Wood & Obrist, 
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arm. On most trials, this S demon- 
strated a 20-30 v. response which 
commenced around Sec. 3 and was sus- 
tained for five or more seconds. How- 
ever, in this S, there was no anticipa- 
tory cardiac deceleration as averaged 
over the 12 test trials. Furthermore, 
this was the only S failing to show the 
deceleratory effect. 

The results, therefore, are consistent 
with the hypothesis that the cardiac de- 
celeration observed in anticipation of an 
aversive stimulus may be part of an 
inhibitory  cardiac-somatic response 
process. This possibility was then fur- 
ther evaluated by determining more 
precisely the concomitance between 
cardiac and somatic-motor effects. 
Specifically it was hypothesized that 
the anticipatory deceleration of heart 
rate, at least in part, represents a mo- 
mentary reduction in heart rate to a 
level one would find when no EMG 
activity is present. 

The coincidence of cardiac accelera- 
tion and EMG activity was evaluated 
by determining the magnitude of the 
cardiac change coincident with EMG 
bursts which occurred in measurement 
periods when S had been instructed not 
to make any unnecessary movements. 
These periods included the 10-min. rest 
period at the beginning of the experi- 
ment and the intertrial interval during 
adaptation and conditioning. How- 
ever, the cardiac changes were not eval- 
ulated in periods which overlapped the 
CS, UCS, or a preceding EMG burst. 
In all, 261 instances were evaluated. 
Almost without exception, heart rate 
was found to accelerate during EMG 
bursts. The difference between the 
mean heart rate in the 10 sec. preced- 
ing an EMG burst and the 10 sec. com- 
mencing with the onset of an EMG 
burst averaged 35 msec., t (15) = 6.34, 
P < 001. The cardiac response was of 
a similar magnitude even when the in- 
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stances were omitted which had any 
type of respiratory irregularity. 

The pre-CS base levels of heart rate 
were also found to be more accelerated 
during base periods when EMG bursts 
were present. For each S, test trials 
where dichotomized into those with and 
without EMG bursts of 50 » v. or more, 
and then the heart rate on each type of 
trial was averaged. Fifteen of the 16 
Ss had both types of base periods. 
Heart rate was found to be reliably 
more accelerated when bursts occurred, 
the mean within-S difference being 
37 msec., t (14) = 4.00, p < .01. 

Finally, the deceleration following 
CS onset was found to be largest on 
trials which had EMG bursts in the 
pre-CS period and a cessation of EMG 
activity following CS onset. In evalu- 
ating this possibility, the magnitude of 
the deceleration following CS onset was 
compared between test trials which 
showed no activity during the pre-CS 
period and those which had 50 y v. or 
larger bursts during the pre-CS period. 
Any trial was eliminated from this 
analysis if following CS onset there 
were any bursts of EMG activity, even 
if they were less than 50 V., or ir- 
regularities in respiration. The latter 
was defined as any inspiration which 
either increased or decreased by 20% 
or more of the base-period average. Of 
the 192 test trials, 78 met this criterion 
of which 35 had EMG activity and 44 
had no EMG activity during the base 
period. Both types of trials occurred in 
varying numbers in most Ss. A 
second-by-second post-CS heart rate 
analysis was done for each condition. 
The deceleration at its peak was about 
twice as large when EMG activity was 
present in the pre period than when no 
EMG activity occurred. Figure 3 de- 
picts these changes. In order to obtain 
an idea of the reliability of the post-CS 
difference between these two types of 
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——No EMG Bursts in Base Period, N«44 (Trials) 
EMG Bursts in Bose Period, N=38 (Triola) 
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Fic. 3. Second-by-second changes in heart 
rate on test trials with and without EMG 
activity in the base period—7.0-sec. ISI, 
respiration controlled. 


trials, the cardiac change during the 
3 sec, of maximum deceleration (ie., 
Second 7, 8, 9) was averaged sepa- 
rately for each trial The mean dif- 
ference between conditions of 45 msec. 
was highly reliable, t (77) = 378, p < 
001. This analysis treated each trial 
within a coridition, as well as each con- 
dition as independent, because any 
alternative method of data analysis 
which did not violate these assumptions 
was not practical, Thus, this statistic 
must be qualified accordingly. 

These analyses demonstrated fairly 
conclusively that heart rate accelerated 
when bursts of EMG activity occurred, 
They further suggest that the decele- 
ration following CS onset will be larger 
when EMG bursts are present in the 
base period and cease following cS 
onset. But even here, it should be 
noted that a deceleration of heart rate 
was still present, although smaller, 
when neither respiratory changes mor 
EMG activity were encountered in 
either period of measurement. 


EXPERIMENT IT 


A short 1.0-sec. ISI was used in 
this experiment in order to produce an 
acceleratory effect on nonreinforced 
test trials to determine whether an 
increase in striate activity was con- 
comitant with the acceleration. 
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Fic. 4. Second-by-second changes in heart 
rate and EMG activity on test trials—1.0-sec, 
ISI, respiration controlled. 


Method. 


The same apparatus, procedures, and meth- 
ods of data quantification were used as in 
Exp. I except the interval between CS and 
UCS onset was 1.0 sec. The Ss were 20 
undergraduate men selected from the Intro- 
ductory Psychology course. 


Results 


Cardiac and EMG activity were 
again found to be concomitant, but in 
this case the dominant aspect of the 
cardiac response was an acceleration of 
heart rate, while bursts of EMG activ- 
ity increased rather than decreased fol- 
lowing CS onset. Figure 4 presents 
the second-by-second changes on test 
trials for cardiac and EMG activity 
similiar to Fig. 2. The cardiac re- 
sponse was an initial, briefly sustained, 
deceleration of heart rate which was 
followed by a larger and more sustained 
acceleration. This change is essentially 
a perfect replication of results previ- 
ously obtained with a short ISI (Hast- 
ings & Obrist, 1967). One-way trend 
analysis for group data performed on 
these cardiac changes produced a highly 
significant linear component. F (1, 
266) = 134.97, p< 001, and less sig- 
nificant quadratic, F (1, 266) = 7.51, 
$ < 01, and cubic, F (1, 266) = 8.55, 
p < .005 components. Thus the bi- 
phasic trend was significant as was the 
large acceleratory response. 
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Bursts of EMG activity were noted 
to increase relative to the base period 
starting with Second 3 and to follow a 
path that was fairly parallel with the 
acceleration of heart rate. The overall 
change from the base to the post-CS 
period was as follows. Of the 240 test 
trials, the number of trials with bursts 
increased from 29 to 48, the number 
of bursts increased from 40 to 74, and 
the total duration of bursts increased 
from 70 to 146 sec. The latter change 
was not quite significant, t (19) = 2.00, 
p « .10. 

These data were also evaluated with 
respect to whether the post-CS accele- 
ration of heart rate was specific to 
those occasions when EMG bursts 
occurred following CS onset. For this 
purpose, cardiac activity was compared 
on test trials with and without EMG 
activity following CS onset. In order 
to increase the N, trials were con- 
sidered to have EMG activity even if a 
distinguishable EMG burst was 
< 50 uv. However, any trial was ex- 
cluded from this analysis if there was 
EMG activity in the base period. Also 
omitted were trials which had no EMG 
activity following CS onset, if irreg- 
ularities in respiration occurred. Of 
the 240 trials, 76 with no EMG bursts 
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Fic. 5. Second-by-second changes in heart 
rate on test trials with and without EMG 
activity in the post CS period—1.0-sec. ISI, 
respiration controlled. 
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and 54 trials with EMG bursts met 
these criteria. Figure 5 depicts the 
second-by-second change in heart rate 
for both types of trials. Although an 
initial deceleration was observed in 
both instances, there was essentially no 
acceleratory response when EMG 
bursts were not present, but a large 
acceleration when bursts of EMG ac- 
tivity were present. The difference 
between the two types of trials was 
evaluated by averaging the cardiac 
change between Seconds 5-9. In this 
case, only Ss were used who had four 
or more such trials of each type in 
order to increase the reliability of the 
estimate of cardiac change for each S. 
Eight of the Ss had four or more trials 
with EMG bursts and another 11 had 
four or more trials without bursts. 
The mean difference of 41 msec. be- 
tween trial types was found to be sig- 
nificant, ¢ (17) = 2.60, p < .02. Also 
evaluated were the post-CS cardiac 
changes on test trials with increases in 
respiratory amplitude which were > 
20% but when no EMG activity was 
present either before or after CS onset. 
Here also an appreciable acceleration 
of heart rate was observed following 
the initial deceleration on the 46 trials 
used in this analysis. Therefore, it 
would appear that the acceleration ob- 
served on test trials when the CS-UCS 
interval is shortened to 1.0 sec. is asso- 
ciated with an increase in either EMG 
activity or respiratory amplitude or 
both. In the absence of such somatic 
changes, there is essentially no cardiac 
acceleration. 

The frequency or number of EMG 
bursts in the base period for this ex- 
perimental group was considerably less 
than that for Ss used in Exp. I. 
Whether this difference was due to ex- 
perimental conditions, ie, shortening 
the ISI, or to inherent differences in 
reactivity of the individuals in each 


group was evaluated by determining 
the amount of EMG bursts during the 
base period of adaptation trials which 
occurred prior to pairing the CS and 
UCS. It was found that Ss in the 
short ISI group were significantly less 
reactive during adaptation. Thus, 
these EMG base-level differences be- 
tween groups during conditioning re- 
flect individual differences in reactivity 
for which the two groups were not 
matched, 


Experiment III 


This experiment was carried out to 
determine whether the concomitance 
between inhibitory cardiac and somatic- 
motor effects as seen in Exp. I with 
a 7.0-sec. ISI could be accentuated by 
increasing base levels of striate activ- 
ity. This, it was thought, could be 
accomplished by eliminating instruc- 
tions which had been given in the 
previous experiments for S to sit still 
and make no unnecessary movements. 
The Ss were also not required to main- 
tain a normal depth and frequency of 
respiration. It was felt that such a 
requirement might impose restraint on 
other types of striate activity. 


Method 


The same apparatus, procedures, and meth- 
ods of data quantification were used as in 
Exp. I with the following two exceptions. 
First, respiration was in no way controlled. 
Second, all instructions were omitted which 
might tend to suppress somatic-motor activ- 
ity. In the previous groups Ss were in- 
structed at several points in the experiment 
to sit still and make no unnecessary move- 
ments except at designated rest periods. 
This instruction was not given or implied in 
any way. The Ss were 19 undergraduate 
male volunteers recruited from summer- 
school students. 


Results 


The results tend to replicate those of 
Exp. I with respect to the coincidence 
of heart rate and EMG changes. How- 


188 


ever, in certain respects the effects 
were not as consistent, nor were they 
more pronounced as had been antici- 
pated. There was even evidence in 
some Ss that EMG activity and heart 
rate could be out of phase. Also, the 
inhibition of respiratory events and the 
deceleration of heart rate appeared to 
be coincident. 

The anticipatory cardiac response on 
test trials was a large, essentially uni- 
phasic deceleration of heart rate. The 
one-way trend analysis for group data 
revealed significant linear, F (1, 252) 
= 133.60, p < .001, quadratic, F (1, 
252) = 101.74, p< .001, and cubic, 
F (1, 252) = 125.31, p < .001, com- 
ponents, Bursts of EMG activity dur- 
ing the pre-CS base line occurred on 
117 of 228 test trials, with 269 distin- 
guishable bursts having a duration of 
553 sec. This is about twice as much 
activity as occurred during the base 
period of Exp. I. Thus, the elimina- 
tion of both respiratory control and the 
instructions to minimize movement ap- 
pear to have resulted in the expected 
effect, namely an increase in base level 
EMG activity. Following CS onset 
EMG activity showed a triphasic trend, 
with an increase from the base line on 
the initial few seconds following CS 
onset, then a decrease which parallels 
the deceleration, followed by another 
increase. Figure 6 depicts the second- 
by-second changes on test trials for 
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rate and EMG activity on test trials—7.0-sec. 
ISI, respiration not controlled. 


PAUL A. OBRIST 


both measures similar to Fig. 2 and 4, 
Therefore, as in Exp. I, a decrease of 
EMG activity occurred at about the 
time the UCS was expected. How- 
ever, in contrast to Exp. I, the decrease 
in EMG activity was essentially only 
a return to the pre-CS base line and 
not a reliable effect, ie. pre vs. post 
change, ¢ (18) = < 1.0 for both the 
duration of bursts and number of 
bursts. Also, the magnitude of the 
cardiac deceleration observed in the 
present experiment was larger than 
that observed in Exp. I, and a decele- 
ration was observed in several Ss who 
showed no decrease in EMG activity 
following CS onset. These results sug- 
gested the possibility that the decelera- 
tory effects and EMG effects were in 
fact unrelated, or alternately, that the 
inhibition of EMG activity was directly 
related to cardiac events, but that the 
cardiac effects were also related to 
still other events which were not a sig- 
nificant factor in the first study. 
Therefore, several additional analyses 
of the data were carried out in order 
to evaluate these possibilities. 

First, it was thought necessary to 
determine whether, in the present 
study, the magnitude of the cardiac 
deceleration and inhibition of EMG 
could be demonstrated to be directly 
related. The Ss were dichotomized 
into two groups, depending on whether 
EMG activity was or was not inhibited 
following CS onset, and then the 
cardiac changes of each group were 
compared. Eight Ss clearly showed 
an inhibition of EMG activity which 
was below the base line and similar to 
the effects observed in Exp. I. Of the 
remaining 11 Ss, 5 showed an overall 
increase in EMG activity following CS 
onset, while 6 showed little or no ac- 
tivity either in the base period or fol- 
lowing CS onset. The second-by-sec- 
ond cardiac changes for each group are 
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presented in Fig. 7. As can be seen, 
the deceleration was larger in the eight 
Ss whose EMG activity decreased 
most. The difference between groups 
averaged over the three consecutive 
seconds of peak deceleration for each 
S was 40 msec. which is a reliable 
effect, t (17) = 2.73, p < .02. There- 
fore, in this respect these effects are 
consistent with the results of the pre- 
vious two experiments. However, they 
suggested that some other events might 
be related to the deceleratory cardiac 
effects. As one of the two principle 
differences between the first and pres- 
ent experiment was the control of 
respiratory activity, a systematic eval- 
uation of respiration changes was then 
carried out. 

In general, there appeared to be a 
decrease on test trials of both respira- 
tory frequency and amplitude which 
was coincident with the cardiac decel- 
eration. In order to depict accurately 
any respiratory changes that might be 
coincident with the cardiac decelera- 
tion, the 15-sec. period following CS 
onset was divided into three blocks of 
5 sec. each. The middle block closely 
approximates the period in which the 
deceleration occurs. With respect to 
frequency there was a triphasic effect 
following CS onset. The mean changes 
in each consecutive block of 5 sec. 
relative to the base line were, +2.6, 
—0.8, and +1.6 inspirations per min- 
ute, The initial increase was a reliable 
effect, t (18) = 4.38, p < 0.1, while the 
decrease during the period of decelera- 
tion from the base line was not, f (18) 
= 1.63, p < .20. However, in the five 
Ss showing no EMG inhibition of base 
level activity a decrease was seen dur- 
ing the period of deceleration which 
averaged three inspirations per minute 
with respect to the base line. On a 
number of trials in these Ss, expiration 
was held for periods of 5-10 sec. 
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Fic. 7. Second-by-second changes in heart 
rate on test trials for Ss with and without 
a cessation of EMG activity following CS 
onset—7.0-sec. ISI, respiration not con- 
trolled. 


Respiration amplitude showed a 
somewhat similar effect with the most 


.pronounced change being a decrease 


during the period of deceleration. The 
mean amplitude of all inspirations 
peaking in each block of 5 sec. ex- 
pressed as a percent change from the 
base line was, Seconds 1-5, —1%, 
t (18) < 1.0; Seconds 6-10, -11%, 
t (18) = 277, p < .02; Seconds 11-15, 
+2%, t (18) < 1.0. This reduction 
in amplitude during Seconds 6-10 was 
again particularly pronounced in the 
five Ss whose EMG activity was not 
inhibited following CS onset, the mean 
change being —29%, In these five Ss 
the relationship between the magnitude 
of the cardiac change and the decrease 
in respiratory activity was also eval- 
uated. For this purpose, a score was 
obtained for respiratory activity which 
would reflect both amplitude and fre- 
quency changes. On each trial this 
consisted of totaling the amplitude of. 
all inspirations peaking within the post- 
CS Seconds 6-12, the period of maxi- 
mum deceleration and dividing this by 
the mean base-period amplitude. Then 
for each S the trials were dichotomized 
into the six showing the most and least 


190 


respiratory decrease. A second-by-sec- 
ond analysis of heart-rate changes was 
then done over all Ss for each type of 
trial. The deceleration was found to 
be both more sustained and about twice 
as large on trials having the greatest 
decrease in respiratory activity. The 
mean decrease in msec. over the period 
6-12 sec. following CS onset was —73 
and —35 for the two types of trials. 


Discussion 


The results of these experiments indi- 
cate a concomitance between cardiac and 
somatic-motor events in the conditioning 
situation which holds even when the 
direction of the cardiac change is experi- 
mentally manipulated. At about the time 
the aversive UCS was expected with a 
long 7.0-sec, ISI, heart rate decelerated; 
bursts of EMG activity if present de- 
creased; and respiration if not experi- 
mentally controlled became shallower and 
less frequent. Second, the small bursts of 
EMG activity which occurred between 
trials were accompanied by an accelera- 
tion of heart rate. One result of intertrial 
EMG activity was an elevation of heart- 
rate base levels which, in turn, increased 
the size of the anticipatory deceleration 
of heart rate. Finally, with a short 1.0- 
sec. ISI, the cardiac and EMG effects 
were reversed, with an increase in both 
types of activity on test trials commenc- 
ing shortly after the point in time in 
which the UCS would normally occur. 

There are several aspects of these re- 
sults which require some qualification. 
First, the fact that EMG activity is in- 
hibited in some muscles is no assurance 
that activity might not have increased in 
still other muscles from which no record- 
ings were taken (Malmo, 1959; 1960, on 
symptom specificity). ^ Although there 
appears to be no evidence available which 
bears directly on this problem, there is 
some which would suggest that the cessa- 
tion of EMG activity following CS onset 
may be indicative of a lack of muscle ac- 
tivity elsewhere on the body. For 
example, a general factor of musele; ten- 
sion has been found during rest and non- 
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aversive stimulation (Balshan, 1962). 
Also in pilot studies now in progress, 
the chin muscles have been found some- 
times to be sensitive to movements in 
other parts of the body, such as in me 
leg and arm. 

Instances of cardiac deceleration were 
clearly seen, particularly in Exp. I, even 
when there was no background EMG ac- 
tivity to be inhibited and when respira- 
tion was experimentally controlled. How- 
ever, before one can conclude that these 
cardiac events are independent of somatic- 
motor events, it would seem necessary to 
evaluate the latter more systematically. 
It would seem that the inhibition of so- 
matic activity cannot be detected from 
muscles which are already in a state of 
minimal activity, a problem commonly 
encountered in the present study, but one 
not faced with cardiac activity. 

Finally, the respiratory changes ob- 
served in Exp. III were a little sur- 
prising in the light of our previous work 
in which there appeared to be no con- 
sistent relationship between cardiac detec- 
tion and respiration (Wood & Obrist, 
1964). This discrepancy may be ac- 
counted for by the fact that in the 
earlier study, the quantification proced- 
ures were less extensive and were con- 
cerned primarily with determining 
whether an increase in respiratory ampli- 
tude was related to the cardiac change. 
The present results also appear to be in- 
consistent with data reported by Smith 
(1966) in which no heart-rate decelerá- 
tion was observed in anticipation of an 
aversive UCS when respiration was 
suspended between CS and UCS onset. 
Because of the incompatibility of the 
various experimental procedures, the re- 
lationship between respiratory and cardiac 
effects must remain uncertain. We are 
inclined to believe, on the basis of some 
available evidence, (Hastings, 1966; 
Obrist, et al, 1965; Hastings & Obrist, 
1967), that the deceleratory response can 
be in part independent of respiratory 
changes. 

The results of this study as well as of 4 
those recently reported concerning the 
relationship between cardiac and somatic- 
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motor activity in dogs using classical 
(Obrist & Webb, 1967) and operant 
(Webb & Obrist, 1967) conditioning pro- 
cedures, leave one reasonably impressed 
that somatic and cardiac activity can be 
highly related events in some situations. 
This possibility is consistent with several 
lines of physiological and psychophysio- 
logical evidence some of which have been 
referred to in the introduction as well as 
reviewed elsewhere (Obrist & Webb, 
1967). What these data seem to mean 
is that conditions which modify somatic- 
motor activity will also modify heart 
rate. The likelihood of a pronounced re- 
lationship between cardiac and somatic- 
motor activity, it should be noted, is not 
contraindicated by the recent studies dem- 
onstrating the modification of heart rate 
by classical and operant conditioning 
procedures when somatic-motor activity is 
controlled by curarization (Black, 1965; 
Miller & DiCara, 1967). Such studies 
demonstrate that cardiac changes are not 
necessarily dependent on the peripheral 
occurrence of a somatic-motor response. 
They do not indicate the extent to which 


* cardiac changes are controlled by the 


same central processes which are involved 
in the simultaneous initiation and con- 
trol of somatic-motor activity. There is 
reasonably impressive evidence for this 
possibility (see Obrist & Webb, 1967). 

Tf one assumes that the cardiac events 
observed in the conditioning paradigm are 
significantly related to  somatic-motor 
events either through peripheral or cen- 
tral mechanisms or both, then some idea 
of the basis for the direction of anticipa- 
tory cardiac changes can be formulated as 
well as for their relevance to behavior. 
For example, it can be hypothesized that 
the direction and magnitude of the antici- 
patory cardiac response will be, to a sig- 
nificant degree, determined by whether 
he organism initiates or inhibits somatic- 
motor activity preparatory to the UCS. 
Thus, the influence of the affective and 
motivational processes initiated by the 
CS on heart rate will be determined in 
part by whether an organism initiates 
or- inhibits activity in coping with the 
anticipated UCS. The type of activity 
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would seem to be a function of several 
variables such as the species of organism 
and the nature of the conditions. For 
example, where flight or fight appear pos- 
sible, activity might increase; where a 
recourse to activity is not possible, then 
a cessation of activity is resorted to, e.g., 
an animal freezing. In this respect, 
Engel (1950) has suggested that vaso- 
pressor syncope during stress, where there 
are very appreciable cardiac and somatic 
inhibitory effects, occurs “. . . during the 
experiencing of fear when action is in- 
hibited or impossible [p. 11].” This would 
appear to be the situation in which hu- 
man Ss frequently find themselves. Tt 
may be that vasopressor syncope is an 
exaggeration of the effects observed dur- 
ing aversive conditioning. The two 
types of situations where these inhibitory 
cardiac-somatic motor effects occur have 
at least two things in common, namely 
fear or at least an unpleasant anticipated 
affective experience, and a condition 
where the organism has to sit there and 
take whatever events are forthcoming. 

In one sense, the cardiac events in this 
context can be most simply viewed as re- 
flecting rather momentary and quite dis- 
crete alterations in somatic-motor activity 
which in and of themselves may have little 
direct relevance to behavioral processes. 
However, in another sense, these cardiac 
events, particularly the deceleratory ef- 
fects observed in human Ss, might be 
viewed in ways which are quite relevant 
to an understanding of behavioral pro- 
cesses. For example, the results of the 
present experiments suggest that the 
cardiac deceleration is an aspect of an 
inhibitory state. That the classical condi- 
tioning paradigm initiates inhibitory 
processes which are manifested in other 
response systems is indicated by several 
other lines of evidence, some of which 
have been recently reviewed by Kimmel 
(1966). There is commonly found an 
attenuation of both the CR and UCR of 
such excitatory responses as the GSR and 
eye blink which is clearly not attributable 
to adaptation effects. Perhaps the 
cardiac response as well as respiratory 
and EMG effects like those observed in 
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the present study are more direct mani- 
festations of these inhibitory states. The 
possible significance of such inhibition is 
not altogether clear though Kimmel 
(1966) suggests they are adaptive. Here 
the implication is that inhibitory mech- 
anisms act to attenuate the aversive quali- 
ties of the UCS. Another possibility is 
that these inhibitory mechanisms are also 
involved in attention processes which in 
turn are related to the facilitation of 
organism-environmental interaction. This 
is suggested by the fact that a decelera- 
tion of heart rate-is observed in condi- 
tions which involve nonaversive UCSs 
(Wood & Obrist, 1968), as well as con- 
ditions in which an S$ is preparing to re- 
spond somatically to some relevant stimu- 
lus such as in a reaction-time task (eg., 
Lacey & Lacey, 1964). In this case, the 
inhibition may be with respect to ongoing 
activity which is not relevant for either 
the reception of or response to the antic- 
ipated event. To the extent that such 
inhibition involves ongoing somatic ac- 
tivity, it might be expected to be reflected 
in cardiac rate changes, Therefore, the 
deceleration of heart rate may sensitively 
reflect preparatory alterations in at least 
one aspect of organismic activity which 
with respect to environmental interaction 
can be either inhibitory or facilitory in 
function. 
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The covariation of reaction time and alpha desynchronization was in- 
vestigated at foreperiods of 200, 500, 1,500, and 4,000 msec. The 
results demonstrated that reaction time and alpha desynchronization 
are significantly influenced by variations in the foreperiod condition. 
Reliably faster RT and maximal alpha desynchronization occurred 
500 msec. after the warning signal; however, within the context of 
the foreperiod, the 2 variables did not covary. A unitary under- 
lying arousal process was rejected in favor of multiple neural 


arousal processes. 


EEG activation and behavioral 
arousal have been used interchangeably 
as indexes of attention or alertness 
(Feldman & Waller, 1962). Implicit 
in the synonymous use of these con- 
cepts is the assumption that a unitary 
mechanism, the ascending reticular 
activating system (ARAS), subserves 
both processes (Lansing, Schwartz, & 
Lindsley, 1959; Lindsley, 1958). 
However, results are accumulating 
from an increasing series of behavioral 
and surgical studies (Belanger, 1965; 
Feldman & Waller, 1962; Hermelin & 
Venables, 1964) that challenge the de- 
pendence of EEG activation and be- 
havioral arousal on the excitation of the 
ARAS. 

Several reaction-time (RT) experi- 
ments ( Fedio, Mirsky, Smith, & Parry, 
1961; Lansing et al, 1959) with nor- 
mal Ss have yielded results in support 
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of the relationship between EEG acti- 
vation and behavioral arousal. Speec 

of response has been employed as the 
behavioral index of arousal with the 
following results: fast response with. 
low voltage, fast EEG activity (alpha 
desynchronization), and show response 
with high voltage, slow EEG activity 
(alpha rhythms). The work of Lansing 
(Lansing et al., 1959) is representative 
of the RT model employed and the re- 
sults obtained. Visual reaction times 
were measured under three conditions. 
Foreperiods (300 to 1,000 msec.) re- 
sulting in the highest percentage of 
alpha desynchronization produced the 
fastest mean reaction time (206 msec.). 
Shorter foreperiods (50 to 250 msec.) 
with a larger percentage of trials intro- 
duced prior to alpha desynchronization 
yielded a mean RT of 255 msec. When 
no warning signal was employed the 
mean RT was 280 msec. The savings 
in response speed were ascribed to the 
differential percentage of trials intro- 
duced during alpha desynchronization 
under the three conditions. 

These findings suggest that: (a) it 
is presumably most advantageous to 
perform during the period of alpha 
desynchronization and (b) the fore- 
period that is optimal in assuring that 
the presentation of the stimulus and 
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alpha desynchronization coincide will 
produce the fastest mean RT. Such 
findings would be strengthened with 
the demonstration that RT varies with 
the percentage of alpha desynchroniza- 
tion at other durations of foreperiod. 
This is testable, since it is well docu- 
mented (Hohle, 1967) that lengthening 
the period between warning signal and 
response reduces speed of response. 

The critical issue is whether the tem- 
poral coincidence of reduced RT and 
maximal alpha desynchronization at 
the same foreperiod is adequate evi- 
dence that the measures covary. A 
major shortcoming in previous studies 
has been the failure to consider fore- 
period duration as an important deter- 
minant of speed of response when 
ample evidence exists for a relationship 
between RT and foreperiod. Compari- 
sons were made between RT and per- 
centage of alpha desynchronization at 
different foreperiod conditions, yet no 
attention was paid to the effects that 
variation in foreperiod produce on 
either of the proposed measures of 
arousal. It can be proposed that warn- 
ing signals initiate separate internal 
processes that independently result in 
faster response and alpha desynchroni- 
zation. An alternative explanation then 
is that speed of response and alpha de- 
synchronization are independent pro- 
cesses temporally locked at short fore- 
periods. If RT is related to foreperiod 
duration and independent of alpha de- 
synchronization, then the two processes 
should not covary when evaluated un- 
der the same foreperiod condition. 


MeTHOD 


In the course of this study, 
records of 64 normal male college students 
were examined. Fifty males (or 78% of 
the group) exceeded the criterion set for 
the introduction of the warning signal dur- 
ing the presence of alpha waves on at least 
75% of the trials and served as Ss. The 
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Ss were paid volunteers and ranged from 
18 to 25 yr. of age. 
EEG tracings were obtained from the right 


approximately 

national “10-20" system. The data were 
monitored on a 10-channel Gilson Medical 
Electronic 


The warning signal was a 45-msec. flash, 


synchronization with S’s eyes closed. 

The stimulus to respond was an 80 db., 
1,000 c/sec tone produced by a standard audio 
of presentation and the 


vated by a 1,000 c/sec tone, The clock 
and tone both release 
of a standard telegraph key. In the absence 
of tone, the key was main! in a de- 


| 
H 
HH 


perceptible. 
Sixty-four RTs were obtained from each 
S, 16 at each foreperiod. 


warning signal was mechanically introduced 
by E during a train of alpha waves. The 
reaction tone was electronically presented 
following the warning signal The inter- 
trial interval between the reaction stimulus 
and the following warning signal varied 
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from 8 to 24 sec. plus the time mecessary 
for E to detect visually the presence of the 
next train of alpha waves. 

The experimental index of nonaroused 
functioning used, was alpha activity (8-13 
c/sec. activity). The index of aroused func- 
tioning was the presence of alpha desyn- 
chronization coincident with the introduction 
of the response tones. Alpha desynchroniza- 
tion was defined by the following changes 
in EEG tracing: a decrease in the amplitude 
of the alpha rhythms to less than 50% of 
the height of the smallest of the three alpha 
rhythms present immediately prior to the 
onset of the warning signal for a duration 
of more than .2 sec. The warning signal 
was assumed to have evoked the change if 
the reduction in amplitude occurred within 
4 to .6 sec. after the onset of the warning 
signal. The onset of desynchronization 6 
sec. after the onset of the warning signal 
or following the dissipation of a period of 
evoked desynchronization was not attributed 
to the effects of the warning signal and was 
denoted as spontaneous desynchronization. 


RESULTS 


The reciprocal of the raw score was 
employed to normalize the distribu- 
tions. The effect of foreperiod dura- 
tion on RT was determined by treat- 
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between foreperiods as a function of blocks 
of trials. 


ing the 16 RT at each foreperiod as 
four blocks of four trials each. An 
analysis of variance for repeated mea- 
sures yielded an F (3, 400) = 775 
for foreperiod effects, significant at bet- 
ter than the .01 level. For each fore- 
period condition, Fig. 1 shows the 
average response time with each point 
representing the mean of four trials. 
High scores on this figure denote fast 
reaction time. Speed of response was 
significantly faster with the 500 msec. 
foreperiod presentation (p «.01 by 
Duncan's new multiple-range test). 
In contrast, no reliable differences in 
speed of response were found among 
the 200, 1,500, and 4,000 msec. con- 
ditions. Although slight increments in 
the speed of response can be seen dur- 
ing the second block of trials (Fig. 1), 
these fluctuations were not significant : 
F (3, 400) =.91. 

In Fig. 2, the mean RT and median 
percentage of trials introduced during 
evoked alpha desynchronization are 
plotted by foreperiods. The effects of 
foreperiod duration on RT are de- 
picted by the solid line. The median 
percentage of evoked alpha desyn- 
chronization ranged from a high 8796 
at the 500 msec. foreperiod condition to 
a low 096 at the 4,000 msec. foreperiod 
condition. The Wilcoxin matched pair 
sign-rank test (Siegel, 1956) applied 
to the desynchronization data revealed 
that significantly different (p< .01) 
percentages of stimuli were introduced 
during evoked alpha desynchronization 
for each foreperiod condition. No sys- 
tematic association can be observed be- 
tween RT and percentage of evoked 
alpha desynchronization. The fastest 
RT (246 msec.) and the highest per- 
centage (8796) of evoked alpha desyn- 
chronization resulted with the 500 
msec. condition ; however, the percent- 
age of evoked alpha desynchronization 
and RT did not covary in the remain- 
ing foreperiod conditions. 
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the median percentage of trials introduced 
during evoked alpha desynchronization. 


In order to determine if RT and per- 
centage of evoked alpha desynchroniza- 
tion covary, independent of foreperiod 
effects, two S groups were formed for 
each foreperiod condition. A distribu- 
tion of RTs was determined for each 
foreperiod condition and Ss were 
identified as above (slow responders) 
or below (fast responders) the median 
RT of the foreperiod. Following these 
criteria, an S could be designated a 
slow responder at one foreperiod and 
a fast responder at another. The 
median percentage of trials introduced 
during evoked alpha desynchronization 
for groups of slow and fast responders 
over foreperiods is summarized in 
Table 1. The results of a series of 
Mann-Whitney U tests, corrected for 
the number of tied ranks, show that 
the percentage of trials introduced 
during evoked desynchronization was 
not significantly greater for the fast 
responders at any of the foreperiod 
conditions. In Table 2, spontaneous 
and evoked desynchronization condi- 
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TABLE 1 


MEDIAN PERCENTAGE OF TRIALS INTRO- 
DUCED DURING EVOKED ALPHA 


DESYNCHRONIZATION 
Group 
Foreperiod 
Fast Slow zZ 

Responders | Responders Score 
200 56 53 0,44 
500 93 86 1.10 
1500 13 43 —2.06 
4000 0 0 —1.05 


Note.—For .01 level of significance (one tail) Z 2 
2,33, and f is nonsignificant. 


tions were combined and considered as 
identical entities, Data are omitted for 
the 200- and 500-msec. conditions since 
the brevity of these conditions excludes 
the possibility of performance under 
spontaneous desynchronization. The 
results of a second series of Mann- 
Whitney U Tests were consistent with 
the previous findings. 

An analysis of variance for repeated 
measures was performed comparing 
mean RTs for the 200-msec. foreperiod 
during two periods of cortical activity : 
alpha rhythms and evoked desynchron- 
ization. The mean RT for each S 
under the two conditions was based 
on a minimum of five trials for each 
condition. Following these criteria, a 
separate mean RT was computable for 
30 Ss under each cortical condition. 
The results of the analysis revealed no 
significant differences in response speed 


TABLE 2 


MEDIAN PERCENTAGE OF TRIALS INTRODUCED 
DURING SPONTANEOUS AND EVOKED 
ALPHA DESYNCHRONIZATION 


Group 
Foreperiod p 
Fast, Slow, zZ 
Responders | Responders Score 
1500 35 50 —1.75 
4000 25 40 —0.78 


Note.—For .01 level of significance (one tail) Z 2 
2.33, and is nonsignificant. 
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for trials introduced during these two 
distinct periods of cortical activity (F 
—.38; p>.01). The most striking 
aspect of these data was the ability of 
Ss to respond equally well under both 
electrocortical conditions. Exactly 15 
Ss responded slightly faster during 
periods of alpha and their 15 counter- 
parts responded slightly faster during 
periods of evoked desynchronization, 


Discussion 


A comparison of foreperiods with re- 
spect to speed of response disclosed re- 
liably faster RT 500 msec. after a warn- 
ing signal. Alpha desynchronization was 
also present significantly more often with 
trials of the 500-msec. presentation, 
Such results have been previously inter- 
preted as evidence for a relationship be- 
tween EEG activity and RT and as sup- 
port for the hypothesis that EEG activa- 
tion and reduced RT reflect alerted states 
produced through the excitation of the 
ARAS (Lansing et al, 1959). Our 
findings do not favor this interpretation 
of the results, since RT and alpha desyn- 
chronization do not covary in a systematic 
manner in the context of the other fore- 
period conditions. An examination of 
the 200, 1,500, and 4,000 msec. fore- 
periods raises the possibility of the fortui- 
tous occurrence of maximal desynchron- 
ization and fastest speed with the 500- 
msec. foreperiod, since similar RT per- 
formance was associated with distinctly 
different percentages of desynchronization 
during all other foreperiod conditions, 
The Ss appear able to perform as well 
whether the median index of alpha desyn- 
chronization is 0 or 56%. 

Furthermore, when comparisons are 
made with the foreperiod variable con- 
trolled, the results do not support the 
hypothesized relationship between EEG 
activity and RT. At no foreperiod does 
the group of fast responders have a 
significantly greater percentage of trials 
introduced during alpha desynchroniza- 
tion. Within foreperiods, the speed of 
Tesponse appears to be independent of 
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whether a higher or lower percentage of 
trials is introduced during alpha desyn- 
chronization. 

The present study does not replicate 
the work of Fedio (Fedio et al., 1961) 
or Lansing (Lansing et al., 1959) since 
there are procedural differences in modal- 
ities stimulated and foreperiods selected, 
The results of this study, however, sug- 
gest that certain variables have been con- 
founded in prior work. For instance, 
the commonly employed “between fore- 
period analysis” in these studies leaves 
undecided whether to attribute reduced 
RT to foreperiod duration, EEG activa- 
tion, or a combination of the two. Thus, 
Previous support for the relationship be- 
tween EEG activation and RT may re- 
quire further evaluation. The “within 
foreperiod analysis” as employed in the 
present study, provides the means to con- 
trol foreperiod effects. Under such con- 
ditions, RT and alpha desynchronization 
appear to vary independently. Both pro- 
cesses, however, are significantly influ- 
enced by variations in foreperiod length. 

Hermelin and Venables (1964) also 
failed to obtain results comparable to 
those of Lansing. They suggested that 
the use of visual, rather than auditory, 
warning signals might account for the 
discrepancies. Their assumption is based 
on the work of Bernhaut (Bernhaut, 
Gellhorn, & Rasmussen, 1953) who con- 
siders visual stimuli to have less of an 
arousing effect than auditory stimuli even 
though visual stimulation is believed to 
be the more effective producer of alpha 
blocking. This suggestion is paradoxical 
in itself, since to accept this as a valid 
explanation of the discrepancies would 
entail accepting several suppositions 
which are contradictory to the postulation 
of a unitary arousal mechanism, the 
ARAS. First, it proposes that change 
in cortical rhythms is not a direct reflec- 
tion of the degree of activation; that a 
less aroused S could demonstrate greater 
cortical activation than a more aroused 
S. Second, it implies that the degree of 
arousal is dependent on the nature of the 
stimulus rather than the intensity of 
arousal that it induces, which is rather 
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inconsistent with the formulation of an 
unitary arousal dimension. 

A recent study by Thompson and Bot- 
winick (1966) employed an auditory 
warning signal with both visual and 
auditory stimuli and offers results in 
substantial agreement with the present 
fndings. Their procedure involved the 
computation of within-S correlations be- 
tween change in EEG amplitude and RT 
at each foreperiod. No evidence for a 
positive association between RT and alpha 
desynchronization was revealed. 

The results of this study do not sup- 
port the interpretation of an association 
between speed of response and alpha 
desynchronization. The feasibility of 
continuing to represent the two variables 
as reflecting a unitary underlying process 
can be questioned since the delimited 
covariation observed, can be reasonably 
attributed to foreperiod length. The 
postulation of several neural processes 
would be more in keeping with present 
knowledge. It should be noted, however, 
that in the present study, as well as those 
reviewed, RT has been the sole measure 
of behavioral arousal. The possibility 
exists that behavioral measures other than 
RT might correlate with alpha desyn- 
chronization. 

On the other hand, the present data 
are in accord with recent surgical (Feld- 
man and Waller, 1962) and pharmacolog- 
ical studies (Bradley, 1958; Wikler, 
1952) that provide evidence for dissocia- 
tion between electrocortical activity and 
behavioral arousal. Wikler (1952) re- 
ported that pharmacologically produced 
EEG sleep patterns were accompanied 
by marked differences in the behavior of 
animals; all which he described as alert. 
Feldman and Waller (1962) worked 
with animals with nearly complete bi- 
lateral lesions in the posterior hypo- 
thalamus. Distinct alpha desynchroniza- 
tion could be produced; however, slight 
stereotyped head and limb extensions dur- 
ing midbrain reticular stimulation were 
the only behavioral responses of the 
somnolent animals. 
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A paired-associate list of 6 CV-CV pairs was constructed so that 
stimuli and responses were systematically related in terms of a linguis- 
tic feature of the consonants (ie. voicing, as in "pa-ba"). 4 experi- 
mental groups of 20 Ss each learned the list. Instructions varied in the 
way they directed attention to the Systematic relations of the pairs. 
A control group (N =20) learned an unsystematic list of the same 
terms. Experimental Ss directed to attend to their mouths learned 
faster than the other experimental groups which in turn learned faster 


; than the control group. 

In recent years psychologists inter- 
ested in language have increasingly 
turned their attention toward the anai- 
yses and theories of language that have 
been developed by linguists. Many ex- 
perimental studies (e.g., Greenberg & 
Jenkins, 1964; Liberman, Cooper, 
Studdert-Kennedy, Harris, & Shank- 
weiler, 1965; Miller & Nicely, 
1955; Wickelgren, 1966) have indi- 
cated that Ss’ judgments of “difference 
from English,” perception and identifi- 
cation of auditory signals, errors in 
short-term memory, etc., are system- 
atically related to linguistic constructs 
having to do with the sound system of 
English, 

One way of characterizing the sound 
segments of English is by stating their 

1 This study was proposed in 1960 when 
the psycholinguistic work of Greenberg and 
enkins was sponsored by the Social Science 
Research Council. Philip Gough and David 
Duggan ran some of the early versions of 
the study. The experiments reported here 
were conducted under the auspices of the 
Center for Advanced Study in the Be- 
havioral Sciences where the authors spent 
the 1964-65 academic year. The authors 
wish to thank Mark Liberman for serving 
as experimenter, 


distinctive features (Halle, 1964a, 
1964b; Jakobson & Halle, 1956; 
Trubetskoy, 1938). The theory of dis- 
tinctive features holds, roughly, that 
speech sounds can be abstractly char- 
acterized by a small set of dimensions 
such as voicing, nasality, etc. These 
dimensions can be related by the theory 
to both acoustic and articulatory events. 
In the binary version of Jakobson and 
Halle, each of the theoretical dimen- 
sions has two possible values, plus or 
minus, e.g., “voiced” or “voiceless” for 
the voicing dimension, and every signif- 
icant sound segment of the language 
can be specified by stating a value on 
each of the dimensions. (The actual 
realizations of the segments in either 
the articulatory or acoustic domains 
may, however, have more than two 
values for some features. ) 

Under the assumptions of this theory, 
a typical phonemic symbol such as /p/ 
or /b/ is simply a shorthand notation 
for a set or vector of features. Words 
can be represented as matrixes with the 
successive columns standing for the 
sound segments and the various rows 
designating the distinctive features. 
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The cells of the matrix contain pluses 
or minuses depending upon whether 
that column is positively or negatively 
specified with respect to that feature. 
The theory states, e.g., that the ma- 
trixes of the words pat and BAT differ 
only in the first column. Within that 
column the sound segments (/p/ and 
/b/) have the same values on all of 
the features except voicing. 

The extent to which two segments 
share the same values on these dimen- 
sions (e.g. distinctive features) may 
be taken provisionally as a measure of 
their similiarity both in terms of their 
articulation and their acoustics, i.e., in 
how they are made and how they 
sound. In the case of /p/ and /b/, 
e.g., most of the activity of the mouth 
is the same for the initial phonemes 
of par and BAT. The only difference 
is that in the case of BAT, the vocal 
cords are vibrating before the /b/ 
is released, while in the case of PAT 
the vocal cords do not begin to vibrate 
until after the /p/ is released. This is 
true for all of the phone pairs that dif- 
fer only in voicing. Thus, it is possible 
to specify a systematic relation between 
all voiced-voiceless pairs (though 
linguistically naive Ss rarely can ver- 
balize this difference). 

Any systematic relation between 
paired stimuli and responses furnishes 
a potential aid to the learning of those 
associations. The present study in- 
vestigated whether systematic linguistic 
relations can have this effect on learn- 
ing. 

The experiment consisted of present- 
ing a paired-associate list in which the 
stimuli were the voiceless members of a 
voiceless-voiced pair and the responses 
were the voiced members of these pairs. 
The hypothesis was, of course, that 
such pairs would be learned more read- 
ily than pairs in which the stimulus- 
response relations were not phonolog- 
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TABLE 1 
STIMULUS AND RESPONSE TERMS 
Stimuli Responses 
pa ba 
ta da 
ka ga 
fa va 
sa za 
cha ja 


ically systematic. Further, the study 
attempted to identify the relative ef- 
fectiveness of different cues for such 
systematic learning. Since distinctive 
features are related to both acoustic 
and articulatory correlates, it was pos- 
sible to assess the effects of instructions 
designed to draw Ss’ attention to one 
vs. the other of these sources. 


METHOD 


Subjects—Five groups of undergraduate 
volunteers were utilized in this study. Fif- 
teen females and five males were in each 
group. Each S was randomly assigned to 


“a group before he appeared at the laboratory. 


The Ss were run individually. 

Material.—Each S learned a single, six- 
pair, paired-associate list in which the stimu- 
lus and response terms were single syllables. 
Each syllable consisted of a consonant-vowel 
sequence. The vowel was invariant across 
all of the pairs, only the consonants differed. 
The consonants of the stimulus terms con- 
sisted of the voiceless members of pairs of 
consonants that were identical in all dis- 
tinctive features except voicing. The re- 
sponse-term consonants consisted of the 
voiced members of these pairs (see Table 1) 

Design and procedure —Four experimental 
groups all learned the same linguistically 
systematic list in which the stimuli and re- 
sponses were identical in terms of all features 
except voicing (ie. the pairs were pa-ba, 
ta-da, ka-ga, etc.). 

Standard paired-associate learning instruc- 
tions were read to each group. For one of 
the four experimental groups, the S tandard 
group, these were the only instructions given. 
However, three of the four experimental 
groups were given additional instructions 
which told them to attend to varying cue 
sources. One of these, the System group, 
was simply told to look for a systematic re- 


202 


lation between the stimulus and response 
terms. Their additional instructions were: 
“Tt may help you to learn if you look for a 
systematic relation between the left-hand 
syllables and the right-hand syllables." 
Another group, the Sound group, was given 
the above instructions to look for a system 
and in addition was told: "Pay attention to 
the sound of the syllables." Finally, the 
Mouth group was given the above instruc- 
tions to look for a system and also was told : 
“Pay attention to what your mouth is doing 
as you say the syllables" Four Ss in each 
of the experimental groups were randomly 
assigned to each of five lists which differed 
in the presentation order of the material. 
Each of these lists had five randomizations 
to control for serial learning. 

The Control group learned a list in which 
the response terms of Table 1 were randomly 
paired with the stimulus terms. This group 
received standard paired-associate instruc- 
tions. Five different sets of random pairings 
between the stimuli and the responses were 
used with four Ss assigned to each of these 
sets, 

After instructions, the lists were presented 
on a Lafayette memory drum at a 2:2-sec, 


J. J. JENKINS, D. J. FOSS, AND J. H. GREENBERG 


rate with an 8-sec. intertrial interval. The 
learning criterion was two successive error- 
less trials except that a minimum of 10 
trials was given to every S. After criterion 
had been reached, Ss were given a short 
questionnaire which asked them how they 
had learned the list and what, if anything, 
about the syllables made the list easy or 
hard to learn. 

Two learning measures were obtained: 
(a) the number of trials to reach the cri- 
terion of two successive errorless trials, and 
(b) the number of correct responses per trial 
over the first 10 trials. Analyses of vari- 
ance tests were performed on the two mea- 
sures and orthogonal contrast comparisons 
were made between the groups. 


RESULTS 


The results are graphically shown in 
Fig. 1 and Table 2. The main effect 
due to groups was significant for both 
the trials-to-criterion measure and the 
number-correct-over-trials measure, In 
both cases only the linear component of 
the orthogonal polynomial contrast set 
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was significant, F (1, 90)— 13.58, 
p < 001 for the trials to criterion mea- 
sure and F (1, 90) = 13.03, p < 001 
for the number correct over trials mea- 
sure. There was no effect due to sex 
or the interaction of groups with sex 
for either measure. 

The effect due to trials was, of 
course, highly significant on the num- 
ber-correct-over-trials measure, F (9, 
82) = 53.06, p< 001. On this mea- 
sure there also was a significant inter- 
action of Groups x Trials, with the 
linear component of the groups’ effect 
contributing most to this interaction, 
F (9, 82) — 2.16, p < .05. 

The second contrast set permitted 
the following observations. First, on 
the trials-to-criterion measure and on 
the number-correct-over-trials measure, 
the Control group differed significantly 
from the mean of the four experimental 
groups for the trials to criterion F (1, 
90) = 576, p < .025, and for the num- 
ber correct over trials F ( 
641, p< 025. Second, this con- 
trast set indicated that the Mouth 
group differed significantly from the 
mean of the other three experimental 
groups on both learning measures, F 
(1, 90) = 10.43, p? < Ol for trials to 
criterion and F (1, 90) = 8.16, p < 01 
for number correct over trials. This 
contrast comparison set indicated that 
the Groups X Trials interaction was 
due, primarily, to an interaction across 
trials of the Mouth group with the 
other three experimental groups, FQ, 
82) = 2.79, P< Ol. No other com- 
parisons within this set proved signif- 
icant. 

The results may be taken to indi- 
cate that the five groups are arrayed 
into three families: (a) the Control 
group, (b) the Standard, System, and 
Sound groups, and (¢) the Mouth 
group. 

Table 3 summarizes the results of 


TABLE 2 
TRIALS TO CRITERION 


Group 


Mouth | Sound | System Stand- | Control 


Males 10.00 | 20.60 | 14.20 22.80 | 18.80 
Females | 10.33 | 15.33 15.73 | 16.40 20.07 
Xur 10.25 | 16.80 | 15.35 18.00 | 19.75 


Note.—Five males and 15 females were in each group. 


the questionnaire data. The rows are 
mutually exclusive and indicate 
whether Ss referred to their articula- 
tors, to some aspect of the sound of 
the stimuli, or to neither of these. The 
Ss in the “other” category typically 
said that they “just associated" the re- 
sponses of used some mnemonic mem- 
ory aid. If an S mentioned his articula- 
tors at all he was assigned to “mouth” 
category even if he also said something 
about the sound or mentioned some 
other mnemonic. Similarly, if an S 
mentioned that both the stimuli and 
the responses “sounded alike” and that 
he used some other mnemonics, he was 
assigned to “sound” category. The 
results of the questionnaire give some 
indication that the instructions were 
successful in manipulating Ss’ attention 
to the various possible cues. 

The acquisition data for the eight 
Ss in the Mouth group who made some 
reference to articulation on the ques- 
tionnaire was compared with the ac- 
quisition data of the 12 Ss in that group 


TABLE 3 
FREQUENCY OF CirixG Cug SOURCES ON THE 
QUESTIONNAIRE 
Group 
Cues 
Mouth | Sound | System Stand- | Control 
Mah E a E m 
Mouth| 8 5 zi 2 0 
Soun 8 13 9 11 3 
Other 4 2 4 7 17 
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who did not mention articulation, The 
mean number of trials to criterion for 
the former subgroup was 12.5 while 
this measure for the latter sub-group 
was 10.83. This differences was not 
statistically significant, ¢ (18) = 1.67, 
p> .10. 


Discussion 


This study indicated that relations be- 
tween sounds that are said to be regular 
within linguistic theory can systematically 
influence the learning of stimuli and re- 
sponses which involve these sounds, Thus 
it further demonstrated that linguistic con- 
Structs may play an important role in 
psychological accounts of verbal behavior, 

This experiment also gave some indica- 
tion that articulatory cues are relatively 
more effective for utilizing these linguistic 
relations. When directed to attend to 
their articulatory activities, Ss learned 
much faster than Ss in any other condi- 
tion, 

As may be seen in Table 2, the pattern 
of the acquisition data for the males does 
not conform to the pattern summarized 
above. The lack of significant overall 
interaction of groups with sex is due to 
the large variability and small N for the 
males, If a larger number of males had 
been tested, either a significant interaction 
would have resulted or the data for the 
males would have conformed to that of the 
females. It seems reasonable to expect 
the latter. 

The acquisition rate for all of the 
groups was much slower than would gen- 
erally be observed in a six-item paired- 
associate task. The magnitude of this ef- 
fect is especially marked for the Control 
group (19.75 trials to criterion). A great 
deal of interference apparently existed in 
this situation and the effects of the experi- 
mental manipulations are superimposed 
upon this interference. A question nat- 
urally arises concerning the source of the 
interference. 

The most obvious and probable source 
of the learning difficulty lies in the sim- 
ilarity among all the items, both stimuli 
and responses. Until the relevant cue is 
utilized by S, it must be difficult for him 


to form the proper response pool or to 
characterize the stimulus alternatives in 
the "right" way (ie, to discover what 
aspect of stimulus variation is important) 
much less correctly associate the specific 
pairs of items, It should be noted that 
discovering the correct cue solves all of 
these problems at once, 

If learning is conceived in terms ofa 
hypothesis-formation model then it would 
seem that S operates with the burden of 
interference until the correct hypothesis 
has been discovered. In this case, the in- 
structions to attend to various cue sources 
may be considered to be influencing the 
nature of the hypotheses examined and 
the order of examining them. The ar- 
ticulatory information cuts down the po- 
tential hypothesis space most quickly, 
This formulation would imply that sys- 
tematic linguistic relations can be utilized 
as hypotheses but that generally Ss do 
not do so. 

Of course, the above conjecture says 
nothing about whether Ss must be able to 
verbalize such hypotheses. Similarly, a 
more incrementally oriented learning 
model does not assume that Ss can ver- 
balize the relevant cues, Verbalization of 
cues is neutral with respect to whether 
those cues (or hypotheses) were in fact 
utilized. As Table 3 indicates, Ss in the 
Mouth group did not refer to their ar- 
ticulators more often than Ss in the Sys- 
tem group. Yet the former Ss learned 
much faster. Also, Ss in the Mouth 
group who referred to their articulators 
in the questionnaire did not learn faster 
than Ss in that group who did not men- 
tion articulation. Yet the group per se 
learned considerably faster than any other 
group. These results give some support 
to the assertion that Ss needn’t be able 
to accurately verbalize distinctions or 
Strategies that underlie their behavior. 
Such results will probably be common in 
psycholinguistics, 

Tf one considers what was learned by Ss 
during the experiment, it is not clear that 
what Ss in the Mouth group learned was 
to add voicing to the consonants of the 
stimulus terms. Perhaps they learned to 
return their articulators to the consonant 
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position of the stimulus terms and to do 
whatever else is possible with their vocal 
cords. It may be the case that a group 
of Ss that is given a list in which all the 
pairs differ in terms of voicing, but some 
voiceless terms are stimuli and some are 
responses, will learn just as readily as the 
present Mouth group. 

Finally, this study was not concerned 
with specifying processes involved in 
phonological perception or production. 
Rather, it was a study of the psycholog- 
ical reality of a linguistic construct and 
of the cues which might let Ss exploit the 
linguistic systematicness which was repre- 
sented in the experiment. The linguistic 
construct, distinctive features, appears to 
be a good candidate for process type in- 
vestigations. Articulatory cues seem to 
be relatively more effective in correctly 
guiding Ss' behavior. 
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The hypothesis is examined that the essential factor in the size cue 
to depth from familiar objects is the perceived size (S') per unit 
of retinal size (9) of each of the objects, with the object having the 
largest value of S'/O appearing to be the more distant object. 5 
familiar objects (a box of cough drops, a half-dollar, a tape dispenser, 
a door key, and a tube of tooth paste) were simulated. The objects 
were presented in pairs and the Os indicated their perceived widths 
(S'), the perceived distance (D') of each from himself, and the per- 
ceived depth (d') between the objects in each pair. In agreement with 
the above hypothesis, the results indicate that the perceived depth 
between the objects was a monotonic function of the difference between 
the values of S'/9. The average values of d' and D' were similar to 
the simulated values for distances up to approximately 3 ft. from the 
observer. For greater distances the error in the average d' and D' 
values increased with an increase in the simulated distances. The con- 
cept of S'/9 as the significant factor in the familiar size cue to depth 
permits this cue system to be of significance in a wide variety of 


naturally occurring situations. 


If two luminous rectangles of the 
same shape but with different retinal 
Sizes are viewed monocularly in an 
otherwise dark room, the retinally 
smaller rectangle will be perceived to 
be the more distant rectangle. This is 
an example of the relative size cue to 
the perceived depth between objects, 
with the perceived depth between ob- 
jects being termed a perception of exo- 
centric distance (Gogel 1968). The 
rectangles in this case are nonfamiliar 
Objects in the sense that there is no 
reason why they would be perceived 
as having one size rather than another. 
If the rectangles are patterned, how- 
€ver, so as to appear as familiar objects, 
for example, as playing cards, they will 
have a perceived size which, it would 
be expected, would be the same as the 
physical size of a playing card; and 
again the card with the smaller retinal 


3 This study was conducted at the Civil 
Aeromedical Institute, Oklahoma City, Okla- 
homa. The authors wish to thank John J. 
Coyle, Jr. for his assistance in analyzing the 
data of the study. 


size would appear to be the more dis- 
tant. This is an example of the fa- 
miliar size cue to perceived exocentric 
depth. The similarity of these two 
cases raises the question as to whether 
the exocentric distance cue of familiar 
size can be subsumed under that of 
relative size (Hochberg & Hochberg, 
1952, 1953; Hochberg & McAlister, 
1955). This is answered in the affirm- 
ative in a study by Hochberg and 
Hochberg (1952) in which it was 
found that the perceived depth between 
different kinds of similarly shaped fa- 
miliar objects was determined by the 
retinal, not the assumed, sizes of the 
objects. A study by Epstein and 
Baratz (1964, Exp. II), however, came 
to a different conclusion. In this latter 
study the perceived depth was mea- 
sured between objects representing a 
dime, a quarter, and a half-dollar, 
presented in pairs. Each coin sub- 
tended three possible retinal sizes such 
that the relative and familiar size cues 
could be placed in agreement or in 
opposition. The results of this study 
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indicated that the perceived depth be- 
tween the pairs of coins when the two 
cue systems were in opposition was in 
agreement with familiar, not relative, 
size. Recently, it has been asserted 
after reviewing the evidence (Gogel, 
1964), that the familiar size cue can- 
not be subsumed under the relative 
size cue but that both can be subsumed 
under the concept of perceived size 
(S’) per unit of retinal size (visual 
angle, 0) of the object. Specifically, 
it has been asserted that in the absence 
of other distance cues, two objects 
which have the same values of S'/60, 
will appear equidistant. If the two 
objects have different values of 5'/6, 
the object with the largest value of 
5'/0 will appear to be the more dis- 
tant object. 

An advantage in defining the size 
cue to exocentric distance in terms of 
the 5'/0 values of the several objects 
is that the ratio 5/0 can be applied 
to objects of irregular as well as of 
regular shape (Gogel, 1964). For 
example, suppose that an irregularly 
shaped object such as a door key is 
presented in a plane frontoparallel to 
the O. According to the above dis- 
cussion, since all parts of the key ap- 
pear to O to be equidistant, all parts 
of the key should have the same 
value of S’/6. If the perceived size of 
any portion of the key and the 0 value 
of this same portion of the key are 
known, the value of $'/0 for all the 
key is determined. Tt follows from this 
definition of the size cue to exocentric 
distance, that this cue will occur be- 
tween any familiar objects regardless 
of their irregularity and differences in 
shape and indeed between any objects 
for which $'/0 values can be specified. 
The purpose of the present experiment 
was to test the hypothesis that the per- 
ceived depth between two familiar ob- 
jects of different retinal and perceived 
sizes will differ when the $'/0 values 
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of the objects differ, with the object 
having the larger 5'/0 appearing to be 
the more distant object of the pair. 


METHOD 


Apparatus —Five photographic positive 
color transparencies of familiar objects (cen- 
tered in 31X4 in. slides) were used as 
stimuli. The familiar objects were a box of 
Luden’s cough drops, a half-dollar, a small 
Scotch Tape dispenser, a door key, and a 
small tube of Crest tooth paste. All of the 
objects were photographed with their largest 
dimension horizontally and frontally oriented 
with respect to the camera. The objects 
were photographed against a black back- 
ground so that only the image of the object 
was transparent. The transparencies were 
mounted in front of light boxes con- 
sisting of diffusing surfaces homogeneously 
transilluminated by flourescent lamps. 
The luminances of the sources were 
adjusted to match the relative luminances of 
the real objects from which the transparen- 
cies were made. The relative luminances of 
the real objects were measured while using 
a diffuse white illumination at a constant 
distance from all the objects. When 
mounted on the light sources, the values of 
the most luminous parts of the transparencies 
were 1.2, 1.0, 0.6, and 0.6 ftl. for the half- 
dollar, tube of tooth paste, box of cough 
drops, and key, respectively. The most lu- 
minous area of the tape was too small to 
be measured with the available photometer. 
The stimuli were always presented in pairs 
in a frontoparallel plane 1.33 m. from O's 
eye with the centers of the two images of 
each pair separated by 23.3 cm. (10°). The 
largest dimension of the image was hori- 
zontally oriented with the midpoint between 
the two images in the median plane of the 
right eye. A .75 diopter lens mounted in 
the right eyepiece optically placed the photo- 
graphs at an infinite accommodative distance 
from O. The observation was always monoc- 
ular (right eye only). Considerable care 
was taken in order to eliminate all extraneous 
illumination so that the stimuli appeared to 
O to be real objects presented in an other- 
wise totally dark visual field. A head and 
chin rest were located at the observation 
position, with the observation position en- 
closed in black cloth so as to eliminate any 
extraneous light. Communication between 
E and O occurred by means of speakers and 
microphones. A continuous masking noise 
was presented to O except when E 
was communicating with him. A shutter 
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Positioned in front of the viewing 
aperture permitted E to change the stimuli 
without being observed by O. A contact 
relay on the head rest insured that O's 
head was properly positioned for the observa- 
tion. A light adaptation surface located to 
O's left was designed to provide a homogene- 
ous visual field with a brightness of 15 ftl. 
This was activated only when O’s head was 
appropriately placed in the adaptation ap- 
paratus. 

A hand-adjustment apparatus was used by 
O to indicate the perceived size (S") of the 
stimulus objects. The apparatus consisted 
of two vertical Square rods (3 in. thick) 
located at the level of O's waist. The left 
rod was fixed, and the right rod could be 
moved laterally. A meter stick was attached 
to the right rod to measure the amount of 
separation of the inner edges of the rods, 

Procedure—Twenty men served as Os. 
All Os were between the ages of 18 and 35 
yr. and had at least 20/20 acuity with their 
right eye as measured at both the near and 
far points using the Bausch and Lomb Ortho- 
Rater, The five stimuli were presented in 
pairs, with each stimulus equally often on 
the right and left. The resulting 20 pairs 
Were presented in a balanced 
design (Edwards, 
Sequence of pairs used with each O. 

The instructions to O emphasized that ap- 
Parent judgments were to be made with re- 
Spect to both size and distance. A previous 
Study had indicated that when instructed to 
apparent size of a 
familiar object in an otherwise dark visual 


Was avoided by 
showing O a simple model which illustrated 
the difference between judgments of apparent 


TABLE 1 


WIDTH AND DISTANCE 
(S AND D, iN ca.) 
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and angular size. The model consisted of an 
amorphously shaped object with threads ex- 
tending from its right and left side to a posi- 
tion indicated as the position of O (which 
also contained small posts representing the 
hand-adjustment apparatus). The threads 
were either parallel or converged to the 
indicated position of O. The correct method 
in the measurement of apparent width was 
communicated to O by means of the parallel 
threads. The O was informed that the 
parallel threads were an illustration of imag- 
inary parallel lines extending from the ob- 
ject with the separation of the lines equal 
to the apparent width to be measured by O's 
adjustment of the hand-adjustment appara- 
tus. The method to be avoided was indi- 
cated to O by means of the converging 
threads. These represented imaginary lines 
specifying the visual angle of the width 
of the object and O was informed that the 
indication of the right to left separation of 
these lines at any distance from himself 
was not the desired measurement. The 
O was also informed that the apparent 
width of the object might or might not 
be equal to the physical width but that this 
was of no concern. Following the instruc- 
tions, O sat for 10 min. in the totally dark 
observation booth, looked into the light 
adaptation surface for 30 sec., and then posi- 
tioned his head in the Observation position. 
Following this, the shutter was raised, re- 
vealing the pair of familiar objects. 

Verbal reports were used to measure the 
perceived distance to the objects and the 
perceived depth between them. According 
to a balanced order, for each pair of objects, 
O reported in feet or inches, or in some 
combination of both, the apparent distance 
of one of the two objects from himself. 
The O indicated which, if either, object ap- 
peared to be more distant and reported the 
Perceived depth between the objects (again 
in feet or inches, or a combination of both). 
Following this, O, using the hand-adjustment 
apparatus, adjusted the distance between the 
rods until this lateral distance seemed to be 
the same as the apparent right-left extent of 
the object. This adjustment was completed 
twice for each of the objects of the pair, 
This total Process, except for the 10 min. 
in the dark, was repeated for each pair of 
familiar objects. 


RESULTS 


The angular sizes (8) of the five fa- 
miliar objects are given in Table 1 to- 
gether with the simulated sizes (S) 
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and distances (D) of the objects. The 
simulated sizes are the widths (left-to- 
right extents) of the real objects 
oriented as in the transparencies and 
the simulated distances are the dis- 
tances from O at which the objects 
would have to be placed to subtend the 
particular angular width (@ in radians 
—S/D). The value of My is the aver- 
age reported distance, converted to 
centimeters, at which O perceived the 
object to be from himself. The value 
My is the average reported size 
(width) of the object, as determined by 


using the hand-adjustment apparatus. 
From Table 1, S and My are similar 
except in the case of the half-dollar and 
key. However, D and Mp, are only 
similar for the smaller values of D. As 
D increases, D-Mp, increases until at a 
D approximately 3 m., Mp is only 
about half the value of D. 

The relation between average values 
of 5'/0 and average values (and stan- 
dard deviations) of the reported depth 
(d') between the objects of a pair is 
given in Table 2 under “Obtained 
Values." The average values of 5'/0 


TABLE 2 
OBTAINED (PERCEIVED) DEPTH (Ms AND gS OF d’ IN CM.) AND SIMULATED DEPTH (d IN CM.) 


BETWEEN OBJECTS IDENTIFIED 


IN Row AND COLUMN HEADINGS 


Obtained Values 
5/9 
88 134 152 211 
s'/e 
Key Tape 504 Drops 
Ma’ oat Ma oat Ma od! Mar oar 
134 Tape 26 20 
152 50¢ 22 14 -6 14 
211 Drops 65 42 37 21 46 39 
239 Crest 89 54 77 49 68 50 35 25 
Simulated Values 
5/8» 
63 87 128 234 
5/0 
Key 50€ Tape Drops 
d d d d 
87 50¢ 24 
128 Tape 65 41 
234 Drops 171 147 106 
299 | Crest 236 212 171 65 


Note.—Numbers in row and column headings are 
unit of visual angle (0 in rad.) for each object. 


perceived size (S’ in cm.) or physical size (5 in cm.) per 
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for each object are shown in the outer 
column and upper row. The average 
values of d' are given to the right or 
below the object names, with each 
value of My being the average re- 
ported depth between the pair of ob- 
jects comprising that column and row. 
Each value of My is the average of 
20 scores, one from each O, with each 
score being the average perceived depth 
between the two objects with their 
right-left positions systematically re- 
versed. Table 2 is arranged to allow 
examination of the validity of the hy- 
pothesis that the perceived depth be- 
tween the objects is determined by the 
magnitude of the difference of the 5'/8 
values of the objects. The objects are 
arranged in the upper half of Table 2 
so that the value of 5'/0 increases from 
left to right and from top to bottom. If 
the above hypothesis is correct, all the 
values of My within the upper half of 
Table 2 should be positive and should 
decrease from left to right and from 
bottom to top. Nine of the 10 values 
of M, are positive, and the tendency 
for these values to change in the ex- 
pected directions seems clear. The 
data in the upper portion of Table 2 
can be summarized as follows: (a) in 
general, the familiar object appearing 
to be more distant had the greater 
value of 5'/0; (b) the greater the dif- 
ference between values of $'/0, the 
greater, in general, was the perceived 
depth between the familiar objects. 
The lower portion of Table 2 speci- 
fies the physical (simulated) size per 
unit of retinal size (5/0), and the 
physical (simulated) depth (d) be- 
tween real objects which would be re- 
quired when using actual objects (not 
photographs) in order to produce the 
visual angles of the stimuli. It is clear 
from Table 2 that, although the 5'/8 
and S/@ values were reasonably simi- 
lar, the My values were often consider- 
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ably smaller than the d values, partic- 
ularly at the larger values of d. 

The upper portion of Table 2 indi- 
cates the importance of relative values 
of S’/@ in determining the perceived 
depth between two familiar objects, It 
will be noted, however, that the form 
of the relation between relative values 
of S’/@ and d' has not been specified. 
One possible expression of this rela- 
tion can be obtained from the size- 
distance invariance hypothesis (Kil- 
patrick & Ittelson, 1953) which states 
that 


S-—K0D' [1] 
where D' is the perceived distance to 


the familiar object of retinal size (vis- 
ual angle) ð and perceived size S", 


and K is an observer constant. It fol- 
lows from Equation 1 that 
ra (Su. Se) 
da=- p(%-%) uu 


where e and f refer to the two objects 
whose separation in depth is being 
judged and d',, = D'; — D',. In order 
to test Equation 2, the average per- 
ceived depth between the two objects 
of a pair was determined and related 
to the average (algebraic) difference 
between the 5'/0 values for the two 
Objects of the same pair with the 
smaller value of 5S’/@ always sub- 
tracted from the larger. The Pearson 
product-moment correlation coefficient 
(r) between .$',/0,—5',/0, and d'y 
is 81, £ (8) = 391, p< 01. Values 
of r were also computed for the data 
from each O. The average value of 
Y (.54) was significant, t (19) = 7.24, 
b «.0l. But, it should be pointed out 
that large individual differences some- 
times occurred in the r values (o, = 
.33) with a range of r from —.42 to .94. 


Discussion 


The results from this experiment sup- 
port the view that in the absence of other 


= 
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depth cues, the occurrence of different 
values of S’/@ between two objects isa 
sufficient condition for the perception of 
depth between the objects, with the object 
having the larger value of S’/® being 
perceived as the more distant object. It 
follows from this experiment that it is 
neither the difference in perceived size 
nor in retinal size which defines the sig- 
nificant variable in the size cue to depth, 
but rather the change in perceived size 
per unit of retinal size between the ob- 
jects. The size cue to relative distance 
occurs, therefore, between irregularly as 
well as regularly shaped familiar objects. 
It is probable that it occurs between any 
objects which have perceived sizes regard- 
less of their shape or complexity. 

It appears from the significant correla- 
tion coefficient between My and differ- 
ences in S/O that Equation 2 is at least 
consistent with the obtained results. This 
does not mean, however, that the per- 
ceived distances D' and d' were neces- 
sarily veridical, i.e., similar to D and d. 
The perceptions of depth tend to be cor- 
rect only for the smallest values of D and 
d with the perceptual error (defined by 
D-D' or d-d’) increasing with increases 
in D and d. Clearly K in Equations 1 
and 2 was not unity. 

It will be recalled that D’ and d’ were 
determined by separate judgments. Never- 
theless, the perceptual errors in D’ and d^ 
are closely related. This can be seen by 
comparing the average obtained d’ with 
the d’ values obtained by subtracting the 
average values of D' of the pairs of ob- 
jects from each other. It appears that Os 
on the average are consistent in their 
judgments, For example, if one object 
of a pair were perceived to be at 5 ft. and 
the other at 3 ft., the perceived depth 
between them as directly indicated by a 
separate report would tend to be 2 ft. 

The results from this experiment lend 
support to the notion that the use of a 
comparison field or comparison objects 
can invalidate the resulting measurement 
of the perceived distance to objects or the 
perceived depth between them (Gogel, 
1964). Tf each object is perceptually 
localized in the comparison field by S’/® 
differences occurring between the experi- 


mental objects and the objects in the com- 
parison field, it would be clear that the 
perceived depth in the comparison field 
is determining, not measuring, the per- 
ceived distance to or between the experi- 
mental objects. 

If the familiar size cue to exocentric 
distance can occur between any familiar 
objects, it exists as a potential cue to 
depth in a variety of situations. Usually 
this cue system is studied in the labora- 
tory as occurring between familiar, sim- 
ilar objects that are geometrically regular 
in shape. Probably regularly shaped, 
similar objects are used because of the 
difficulty in specifying or comparing the 
visual angle © of irregularly or differently 
shaped objects. The concept that S^/0 is 
the significant event in the familiar size 
cue to exocentric distance both avoids this 
problem and asserts that the size cue to 
exocentric depth can occur as a poten- 
tially significant factor in a wide variety 
of situations. 
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CONDITIONAL CONCEPT ATTAINMENT AS A FUNCTION OF 
IF FACTOR COMPLEXITY AND THEN FACTOR COMPLEXITY? 


PATRICK R. LAUGHLIN 


Loyola University 


An experiment was designed to determine the effects of problem com- 
plexity upon the difficulty and use of focusing strategies for the “if 
factor” and “then factor” of conditional concepts. 135 college students 
solved 3 concept attainment problems from a 6-attribute 2-value con- 
cept universe in a 3 X 3 X 3 repeated measures factorial design with 
the variables: (a) if factor complexity (1, 2, or 3 relevant attributes), 
(b) then factor complexity (1, 2, or 3 relevant attributes), and (c) 
problems (3 per S). Major results were: (a) the main effects of both 
if factor complexity and then factor complexity were relatively less 
important than their interaction; there was a curvilinear relation- 
ship between total complexity (summing over the number of relevant 
if and then factors) and number of card choices to solution, untenable 


hypotheses, and focusing strategy, 


on the if factor than the then facto 
the 3 successive problems for 4 of 


Discussions of the logical structure 
of concepts (Haygood & Bourne, 
' 1965; Hunt, 1962; Neisser & Weene, 
1962) define conditional concepts by 
the symbolic logic operator of impli- 
cation (“if A, then B"; or, in the 
typical geometric form problems of 
experimental studies, "if square, then 
red"). Although relatively few ex- 
perimental studies have used condi- 
tional concepts, they are unanimous 
in finding that conditional concepts are 
more difficult than concepts based on 
the logical rules of conjunction, dis- 
junction, joint denial, or simple 
affirmation and denial (Haygood & 
Bourne, 1965; Haygood & Stevenson, 
1967; Hunt & Kreuter, 1962; Neisser 
& Weene, 1962). Haygood and 
Bourne (1965) suggest two reasons 
for the greater difficulty of conditional 
concepts. First, the conditional con- 


1 This research was supported by a grant 
from the Loyola University Faculty Re- 
search Committee. The author expresses his 
appreciation to Jon Anderson, Richard M. 
Jordan, Richard P. McGlynn, and Michael 
Schmitt for collection of the data. 
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(b) relatively more use of focusing 
, and (c) steady improvement over 
the 6 dependent variables. 


cept rule results in a larger number 
of positive instances than other rules 
for a given number of relevant and/or 
irrelevant attributes and values, while 
conditional concepts are theoretically 
best solved from negative instances 
(Hunt, 1962). Furthermore, there are 
two nonhomogeneous subtypes of posi- 
tive instances. For example, positive 
instances of the concept "if square, 
then red" may include either instances 
that are both square and red, or in- 
stances that are not square; the only 
negative instances are nonred squares. 
The first type of instances are positive 
because they satisfy the concept rule, 
while the second type are positive be- 
cause they do not contradict it. Sec- 
ondly, unlike all other logical concept 
rules for two-attribute or greater con- 
cepts, the order in which the same rele- 
vant attributes are specified results in 
different subsets of positive and nega- 
tive instances for conditional concepts. 
For example, the concept “if square, 
then red" partitions the total concept 
universe into different positive and 
negative subsets than the concept "if 
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red, then square." For all other log- the second purpose of the study was 
ical rules the order in which the rele- to determine the effects of problem 
vant attributes are specified does not complexity on both the overall use of 
matter; e. g., the positive and negative focusing strategy and the differential 
subsets for the conjunctive concept use of focusing for the if factor and 
“sed and square" are the same as the then factor of conditional concepts. 
positive and negative subsets for the 

concept "square and red." METHOD 


A third possible reason for the Design and Ss—The experimental de- 
greater difficulty of conditional con- sign was a 3X 3 X 3 repeated measures fac- 
cepts is the necessity for S to deter- A al nina (a) z does m 
mine separately bo d pex ; one, two, or three relevan 
ces gant 

^ * (TFC; one, two, or three relevant attrib- 
Unlike the components of the other utes), and (c) problems (three for each S). 
concepts of a comparable degree of The Ss were 135 college students of both 


complexity, these tw: sexes in introductory psychology courses 
plexity, thesia factors may amy at Loyola University. Fifteen Ss were ran- 


in their difficulty. Furthermore, the i i i 
A 2 : , domly assigned to each of the nine experi- 

relative difficulty of the if factor and ^ mental Vooditions) 

then factor may interact with increas- Stimulus display and — problems.—The 


in bli i . stimulus display was a 28 X 44 in, white 
EOR e complexity of the two fac posterboard containing an 8X8 array of 


pore in ways not predictable from pre- $4 2}X4 in. cards drawn in colored ink 
vious research on problem complexity with dark outlines. The 64 cards repre- 
(number of relevant and irrelevant di- sented all possible combinations of six plus 


mensions) with conjunctive concepts anor ped seu in Aa row. m P 
966: . facilitate reference to the six positions eac 
(e. g., Bourne, 1 ; Bourne & Hay was a different color, so that the color 


good, 1959, 1961; Bourne & Restle, name was the attribute and plus or minus 
1959; Bulgarella & Archer, 1962; the value of each color. The cards were in 


Haygood & Bourne, 1964; Walker & a tess E oeenn pn the EA 
H eg, the top four rows were iue plus ani 
Bourne, 1961). Thus, the following WE" totom four rows blue minus. 


study varied both if factor and then Nine types of conditional concepts were 
factor complexity in a factorial design. ^ generated from one, two, or three relevant 


B i iri- if factors, and one, two, or three relevant 
eyond the establishment of empi ae SN ara dia it tact 


cal laws relating independent stimulus Sad one then factor (11) was "if red 
variables and concept attainment per- plus, then green minus.” The concepts with 
formance, the ultimate objective of re- more than one if factor and/or then factor 
search in concept attainment is to were constructed by adding additional rele- 
determine the problem-solving proc- Nii Rau ons ve factor aay, 
esses or strategies involved. Selection Sek p? red plus and yellow plus, then green 
strategies have been studied for con- minus.” The nine concept types are subse- 
junctive, disjunctive, and biconditional C ANGE se) as ends 1 TEES S 
PPS . us, wii ie - 

wee d ranr con. attribute display, an increase in the number 
: 3 O, of either relevant if factors or relevant then 

1964; Laughlin, 1965, 1966; Laughlin factors necessarily meant a corresponding 


& Doherty, 1967; Laughlin & Jordan, jj decrease in the number of SE epis 
1 : i til attributes. Five different sets of three con- 
955i Lug & McGlynn, 19973 pu cepts and initial cards were used, with three 


no previous research has studied thei o per set. These concepts were randomly 
selection strategies utilized in the at- ected from the total set of possible con- 
tainment of conditional concepts. Thus, is, and the initial cards were randomly 


214 


selected from the subset of possible initial 
cards for each concept. 

Procedure.—The | instructions thoroughly 
explained the nature of the task and the par- 
ticular 1 1, 12, 1— 3, etc. subtype of 
conditional concept. "Three reference cards 
were used to explain the three possible types 
of feedback of yes, no, or does not con- 
tradict on each card choice: 


(1) A yes answer to your card choice 
means that the card you have chosen fits 
the concept rule because it exemplifies it, 
that is, the correct hypothesis is contained 
on the card. 

(2) A no answer to your card choice 
means that the card you have chosen con- 
tradicts the concept rule. The correct hy- 
pothesis is not contained on the card, On 
a no card the if factors will be present 
but not the then factor. 

(3) A does not contradict answer to 
the card you have chosen means that the 
card does not fall under the concept rule. 
This would occur when not all the if fac- 
tors are present, although some of the if 
factors and then factors might be present. 


These three reference cards remained before 
S throughout the experiment. Each prob- 
lem started with an initial positive card, fol- 
lowing which S selected cards and made one 
(only) hypothesis after each card choice. 
Feedback of yes, no, and does not contradict 
was given orally by E after each card choice 
and oral feedback of “yes” or “no” after 
each hypothesis. The problem was solved 
when S made the correct hypothesis. It was 
emphasized that the object was to solve the 
problems in as few card choices as possible, 
regardless of time (Laughlin, 1964). 


REsuLts 


The means for number of card 
choices to solution, overall percentage 
of untenable hypotheses, percentage of 
untenable if factor hypotheses, percent- 
age of untenable then factor hypo- 
theses, overall focusing strategy, and 
if factor focusing minus then factor fo- 
cusing, for the three problems and 
totals over problems, are given in 
Table 1. 

Number of card choices—The main 
effect of if factor complexity (IFC) 
was not significant, F (2, 126) <1. 
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The main effect of then factor complex- 
ity (TFC) was significant, F (2, 126) 
= 3.08, 5 < .05. By Duncan multiple- 
range comparisons concepts with two- 
attribute then factors required more 
card choices than either concepts with 
one-attribute then factors (p < .001) 
or concepts with three-attribute then 
factors (p < 05), while one-attribute 
and three-attribute then factors did not 
differ. The IFC x TFC interaction 
was significant, F (4, 126) — 6.04, 
p «.001. Results of Duncan com- 
parisons for the nine types of concepts 
are given in Table 2. 

In general, 1 — 3 and 22 concepts 
were more difficult than all of the other 
seven types, which did not differ from 
each other. 

Tn addition to a 3 x 3 factorial analy- 
sis of IFC and TFC, the concepts were 
analyzed for total complexity summing 
over both the number of relevant if 
factors and the number of relevant then 
factors. This analysis gave five levels 
of total relevant attributes: two (12 
1), three (2 1, 1-2), four (128, 
222, 3> 1), five (223, 32), 
and six (3— 3). The means and re- 
sults of Duncan comparisons for un- 
equal replications (Kramer, 1956) for 
the five levels of total relevant attrib- 
utes are given in Table 3. Concepts 
with four relevant attributes required 
more card choices than each of the 
other four types, while none of these 
differed from each other. The effect 
of successive problems was not signif- 
icant, F (2, 252) « 1. 

Overall percentage of untenable hy- 
potheses.—A. hypothesis could be un- 
tenable because of either the if factor 
or the then factor (or both). Un- 
tenable hypotheses because of the if 
factor were of two types: (a) the en- 
tire if factor appeared on a previous 
does not contradict card, and (5) the 
opposite of any value of the if factor ap- 
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TABLE 1 


Mean No. or Carp CHOICES To SOLUTION, OVERALL PERCENTAGE OF UNTENABLE 
HYPOTHESES, PERCENTAGE OF UNTENABLE IF FACTOR HYPOTHESES, PERCENTAGE 
oF UNTENABLE THEN Factor HYPOTHESES, OVERALL FOCUSING STRATEGY, 
AND Ir Factor Focustnc Minus THEN FACTOR FOCUSING, FOR 
THREE PROBLEMS AND TOTALS OVER PROBLEMS 


Then Factor Attributes 


If One Two Three 
Factor 
Attri- 
butes Problem Problem Problem 


One Two | Three | Total | One Two | Three Total | One | Two | Three | Total 


Number of Card Choices to Solution 


One 6.53 | 4.80 | 6.33 |17.66| 7.00 | 4.33| 6.13 | 17.46 | 10.47 
Two 7.73 | 5.53 | 4.73 |17.99 | 12.40 | 12.13 | 8.80 | 31.33| 6.60 
Three | 5.07 | 6.07 | 6.87 |18.01| 4.80 | 7.87| 7.20 | 19.87 | 5.53 


TINO 
Suo 
oo 
oe 
eS 

[zi 

- 

[3 

ie 

[3 


Overall Percentage of Untenable Hypotheses 


One .523 | .307 | .236 |1.066| .363 195 | .270 | .828| .487 | .485 | .341 | 1.313 
Two 437 | .200 | .096 | .733| .540 | ‘415 | .366 | 1.321] .266 | .199 | .160 | .625 
Three | 193 | 235 | 1285 | 7.13| .237 ; .374 | .165 | .776 | .341 | -323 .232 | .896 


» Percentage of Untenable If Factor Hypotheses 


One .361 | .221 | .200 | .782 | 229 | .112 | .075 | .416 438 | .099 | .039 | .276 
Two 7368 | .199 | .073 | .640 | .320 | .276 | .229 | .825 .208 | .124 | .124 | .456 
Three | .144 | .223 | .196 salh "201 | 221 | .135 | .557 | .323 | .439 | .262 |1.024 


Percentage of Untenable Then Factor Hypotheses 


One .370 | .229 | .206 | .805 | .257 | .148 | .282 .687 | .432 | .427 | .331 | 1.190 
Two ‘167 | 103 | .023 | .293 | .416 | .297 | .241 .954 | .195 | .151 | .116 | .462 
Three | .101 | .066 | .103 | .270 | .166 | .234 | .104 .504 | .193 | .287 | .239 | .719 


Overall Focusing Strategy 


One | 371 | 521 | .572 | 1.464 | .429 | -609 | 572 | 1.610 | 124 | .319 A81 | .924 
Two | 393 | 555 | 1617 |1.565| .264 | .397 | .389 | 1.050 | .598 1645 | .764 | 2.007 
Three | [595 | ‘365 | 497 |1.657| .599 | .486 | .755 | 1.840] .500 .337 | .574 | 1.411 


If Factor Focusing Minus Then Factor Focusing 


One .0 A .042| .155| .060| .181 | -191 .432| .111 | .033| .120 | .364 
Two — 39 “093 011] .043| .073| .063 | .133 .269 | .074 | .083| .033 | .190 
Three | .007|—.006 | .000| .001|—.035 | .078 1023 | .066| .127 |—.011| .020 | .136 


peared on either a previous positive types: (a) the entire then factor ap- 


card or a previous negative card. peared on a previous negative card, and 
j E tae h ite of lue of the 

Likewise, untenable hypotheses be- (b) the opposite of any value of 

cause of the then factor were of two then factor appeared on a previous 


FOR NUMBER OF CARD CHOICES TO SOLUTION, OVERALL PERCENTAGE OF UN- 
TENABLE HYPOTHESES, PERCENTAGE OF UNTENABLE IF FACTOR HYPOTHESES, 
PERCENTAGE OF UNTENABLE THEN FACTOR HYPOTHESES, AND OVERALL 
FOCUSING STRATEGY, FOR TOTALS OVER THREE PROBLEMS 


Number of Card Choices to Solution 
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TABLE 2 
RrsuLrS or DUNCAN MULTIPLE-RANGE COMPARISONS BETWEEN NINE CONCEPT TYPES 


1 
Concept Type 

122 1-21 221 3-1 323 322 123 222 
23 ns ns ns ns ns ns 001 -001 
122 ns ns ns ns ns 001 001 
11 ns ns ns ns .001 -001 
2221 ns ns ns 001 001 
3,21 ns ns .001 001 
323 ns .01 .001 
322 01 .001 
123 .01 

Overall Percentage of Untenable Hypotheses 

3-1 2-1 322 122 33 11 13 222 
223 ns ns ns ns 01 .001 001 001 
321 ns ns ns ns 01 001 001 
221 ns ns ns .05 .001 001 
322 ns ns 05 001 001 
122 ns ns 001 001 
333 ns .001 001 
11 .05 05 
13 ns 


Percentage of Untenable If Factor Hypotheses 


Percentage of Untenable Then Factor Hypotheses 


221 223 322 122 3253 121 222 123 
321 ns .05 .05 001 .001 .001 .001 .001 
221 ns .05 .001 .001 .001 .001 «001 
223 ns 05 05 001 001 001 
322 .05 05 .01 .001 001 
122 ns ns .01 .001 
33 ns .05 -001 » 
121 ns 001 
222 05 


223 
322 A s 

3251 ns 05 .05 .001 

221 

121 

222 
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TABLE 2—(Continued) 


Overall Focusing Strategy 


222 323 121 221 122 31 32 223 
123 ns .001 .001 .001 .001 .001 .001 .001 
222 .01 01 .001 .001 .001 .001 .001 
323 ns ns ns ns .01 .001 
121 "s ns ns .05 01 
221 ns ns ns .01 
122 ns ns .01 
31 ns .05 
322 ns 


positive card. The total number of 
untenable hypotheses on each problem 
(whether on the if factor or the then 
factor) was divided by the number of 
card choices on that problem to obtain 
the overall percentage of untenable hy- 
potheses. 

Neither the main effect of IFC nor 
TFC was significant, F (2, 126) = 
2.78, F (2, 126) <1, respectively. 
The IFC x TFC interaction was sig- 
nificant, F (4, 126) = 4.62, p € 001. 
Results of Duncan comparisons for the 
nine types of concepts are given in 
Table 2, and means and Duncan com- 
parisons for total relevant attributes in 
Table 3. There was a greater overall 
percentage of untenable hypotheses for 
concepts with four relevant attributes 
than for all other types except two, 
and a greater percentage with two than 
with five or three relevant attributes. 
The effect of successive problems was 
significant, F (2, 252) = 925, p< 
.001, representing a significant linear 
decrease in untenable hypotheses over 
problems, Fin (1, 252) = 18.48, p < 
.001. 

Percentage of 
potheses.—This 
only whether the 
hypothesis was tenable, 
the two rules indicated previously. 
The number of untenable if factor hy- 
potheses was divided by the number o 
card choices on that problem to obtain 


untenable if factor hy- 
measure considered 
if factor part of each 
according to 


the percentage of untenable if factor 
hypotheses. 

Neither the main effects of IFC nor 
TFC were significant, F (2, 126) = 
244, F (2, 120) < 1, respectively. 
The IFC x TFC interaction was sig- 
nificant, F (4, 126) = 5.30, p < .001. 
Duncan comparisons for the nine types 
of concepts are given in Table 2, and 
means and Duncan comparisons for 
total relevant attributes in Table 3. 
There was a greater percentage of un- 
tenable if factor hypotheses with six 
relevant attributes than with each of 
the other four types, and more with 
two relevant attributes than the other 
three types. Three, four, and five rele- 
vant attributes did not differ. The 
effect of successive problems was sig- 
nificant, F (2, 252) = 7.04, p < .005, 
representing à significant linear de- 
crease in percentage of untenable if 
factor hypotheses over problems, Fim 
(1, 252) = 13.87, p < .001. 

Percentage of untenable then. factor 
hypotheses—This measure considered 
only whether the then factor part of 
each hypothesis was tenable, accord- 
ing to the two rules indicated pre- 
viously. The number of untenable 
then factor hypotheses was divided by 
the number of card choices on that 
problem to obtain the percentage of 
untenable then factor hypotheses. The 
main effect of IFC was significant, F 
(2, 126) = 10.97, p < .001. By Dun- 
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TABLE 3 


MEANS AND RESULTS or DUNCAN MUL- 
TIPLE-RANGE COMPARISONS BETWEEN F; IVE 
LEvELS or TorAL RELEVANT ATTRIBUTES 
FOR NUMBER or Carp CHOICES to SoLurIoN, 
OVERALL PERCENTAGE OF UNTENABLE Hy- 
POTHESES, PERCENTAGE oF UNTENABLE Ir 
Factor HYPOTHESES, PERCENTAGE OF Un- 
TENABLE THEN FACTOR HYPOTHESES, AND 
OVERALL FocusiNG STRATEGY, FOR TOTALS 
OVER THREE PROBLEMS 


Number of Card Choices to Solution 


Total Relevant Attributes 


Two Five | Three Six Four 

17.67 | 17.70 | 17.73 | 18.93 | 24.89 

Two ns ns ns -001 
Five ns ns -001 
Three ns .001 
Six -001 


Overall Percentage of Untenable Hypotheses 


Five | Three Six Two Four 

701 | .781 | .896 | 1.006 | 1.116 
Five ns ns -001 | .001 
Three ns 01 -001 
Six ns .01 
Two ns 


Percentage of Untenable If Factor Hypotheses 


Five | Three | Four Two Six 

507 | .528 | .555 | .782 | 1.024 
Five ns ns -01 -001 
Three ns -01 .001 
Four -01 -001 
Two .05 


Percentage of Untenable Then Factor Hypotheses 


Five | Three Six Two Four 

483 | .490 | .719 | .805 .805 
Five ns -01 .001 | .001 
Three 01 -001 | .001 
Six ns ns 
Two ns 

Overall Focusing Strategy 

Four Six Two | Three | Five 

1.210 | 1.411 | 1.464 | 1.588 1.924 
Four ns .05 001 | .001 
Six "s ns -001 
Two ns -001 
Three -001 
—— ee oe 
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can comparisons there was a higher 
percentage of untenable then factor hy- 
potheses for concepts with one-attribute 
if factors than two-attribute or three- 
attribute if factors (5 < 001), while 
two-attribute and three-attribute did 
not differ. The main effect of TFC 
was also significant, F (2, 126) = 
7.62, p < .001. By Duncan compari- 
sons there was a higher percentage of 
untenable then factor hypotheses for 
concepts with both two-attribute and 
three-attribute then factors than for 
concepts with one-attribute then fac- 
tors ($ < .001), with no difference be- 
tween two-attribute and three-attrib- 
ute then factors; The IFC x TFC 
interaction was significant, F (4, 126) 
= 5.90, p< .001. Duncan compari- 
sons for the nine types of concepts are 
given in Table 2, and means and Dun- 
can comparisons for total relevant 
attributes in Table 3. The effect of 
successive problems was significant, 
F (2, 252) = 3.06, p < .05, represent- 
ing a significant linear decrease over 
problems in the percentage of unten- 
able then factor hypotheses, Fun (1, 
252) = 6.10, p < .05. 

Overall focusing strategy.—Overall 
focusing Strategy was scored according 
to two rules: 

Rule 1: Each card choice had to ob- 
tain information on at least one new 
attribute. Information was obtained 
on all card choices that altered only 
one attribute not previously proved 
irrelevant from the initial positive card. 
On such one-attribute alteration cards 
a yes answer indicated that the 
changed attribute was neither part of 
the if factor nor part of the then factor. 
A no answer indicated that the 
changed attribute was not part of the 
if factor and was part of the then 
factor. A does not contradict answer 
indicated that the changed attribute 
was part of the if factor and was not 
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part of the then factor. If the card 
altered more than one attribute not 
previously proved irrelevant (a "focus 
gamble" for Bruner et al, 1956), a 
yes answer indicated that none of the 
changed attributes were part of either 
the if factor or then factor. A mo 
answer indicated that none of the 
changed attributes were part of the 
if factor, but gave ambiguous informa- 
tion on the factor. A does not con- 
tradict answer indicated ambiguous in- 
formation for both the if factor and 
the then factor. 

Rule 2: The hypothesis had to be 
tenable considering the information 
available at that time (see previous 
rules for untenable hypotheses). Each 
card choice and accompanying hypo- 
thesis that satisfied these two rules 
was counted as an instance of focusing. 
The total number of card choices that 
satisfied the rules was then divided by 
the total number of card choices on 
the problem to give a continuous focus- 
ing score from .00 to 1.00. Thus, an 
instance that obtained information on 
either the if factor or the then factor 
counted as an instance of overall focus- 
ing strategy. 

Neither the main effect of IFC nor 
TFC was significant, F (2, 126) = 
138, F (2, 126) <1, respectively. 
The IFC x TFC interaction was sig- 
nificant, F (4, 126) = 605, p< 001. 
Duncan comparisons for the nine types 
of concepts are given in Table 2, and 
means and Duncan comparisons for 
total relevant attributes in Table 3. 
In general, there was more overall fo- 
cusing strategy with five relevant 
attributes than with all other types, 
and less with four relevant attributes 
than with any other type except six 
attributes, The effect of successive 
problems was significant, F (2, 252) 
= 6.42, p < 005, representing a sig- 
nificant linear increase in overall focus- 


ing over problems, Fu, (1, 252) = 
12.08, p < .001. 

If factor focusing minus then factor 
focusing—This measure computed 
separate if factor and then factor fo- 
cusing scores by considering only the 
information and hypothesis for the if 
factor or then factor, respectively. 
The resulting then factor focusing 
score was subtracted from the cor- 
responding if factor focusing score for 
each problem, in order to determine 
the relative use of focusing for the if 
factor and then factor. As indicated in 
Table 1, the totals over problems were 
positive for all nine concept types, in- 
dicating a relatively greater use of fo- 
cusing on the if factor than the then 
factor for all types of conditional con- 
cepts. The main effect of IFC was 
significant, F (2, 126) = 3.68, p < .05. 
By Duncan comparisons there was a 
relatively greater use of focusing on 
the if factor than on the then factor 
for concepts with one-attribute if fac- 
tors than for concepts with two-attrib- 
ute if factors (p < .05) or three-at- 
tribute if factors (p < .001), and rela- 
tively greater use of focusing on the if 
factor than the then factor for con- 
cepts with two-attribute if factors than 
three-attribute if factors (p< Ol). 
The main effect of TFC was not quite 
significant, F (2, 126) — 3.02, p < .10. 
The IFC x TFC interaction was not 
significant, F (4, 126) < 1. Thus, un- 
like the analyses for the previous five 
dependent variables, multiple compar- 
isons were not performed between 
either the nine types of concepts or the 
five levels of total relevant attributes. 
The effect of successive problems was 
not significant, F (2, 252) < 1. 


DISCUSSION 


The first purpose of the study was to 
determine the effects of three levels of 
IFC and TFC on the difficulty of condi- 
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tional concepts. On the four measures of 
problem-solving products (Duncan, 1959), 
number of card choices, overall percent- 
age of untenable hypotheses, percentage 
of untenable if factor hypotheses, and 
percentage of untenable then factor hy- 
potheses, there were significant main ef- 
fects of TFC for number of card choices 
and of both TFC and IFC for percentage 
of untenable then factor hypotheses. 
However, the IFC X TFC interaction 
was highly significant for all four meas- 
ures, Inspection of the Duncan multiple- 
range comparisons between the nine con- 
cept types (Table 2) indicated that the 
important aspect was neither IFC alone 
nor TFC alone, but the total complexity 
summing over both the number of relevant 
if factors and relevant then factors, Fur- 
ther analyses of the five levels of total 
relevant attributes (Table 3) indicated 
that for all four measures concepts with 
four relevant attributes were most difficult, 
concepts with two and six relevant at- 
tributes were next in difficulty, and con- 
cepts with three and five relevant at- 
tributes were least difficult. Likewise, 
two and six, and three and five, did not 
differ from each other. Thus, in sum- 
mary, for the six-attribute, two-value con- 
cept universe employed, there was a curvi- 
linear effect of total problem complexity 
as an interaction of IFC and TFC on the 
difficulty of conditonal concepts. 

This curvilinear relationship between 
problem complexity and performance is 
extremely unusual in concept learning, in 
which performance typically decreases 
linearily as problem complexity increases 
(eg., Bourne & Haygood, 1959, 1961; 
Bulgarella & Archer, 1962; Haygood & 
Bourne, 1964; Haygood & Stevenson, 
1967; Walker & Bourne, 1961). Sev- 
eral considerations seem relevant to this 
unusual finding. Initially, with the con- 
stant six-attribute stimulus display, as 
the number of relevant if factors and/or 
then factors increased the number of ir- 
relevant attributes correspondingly de- 
creased. Since the problems could be 
solved either by determining the relevant 
attributes or the irrelevant attributes, the 
less complex problems may have been 


solved by specifying relevant attributes 
while the more complex problems were 
solved by specifying irrelevant attributes. 
This would predict the obtained result 
that four-attribute concepts were most 
difficult, since the less complex two-at- 
tribute and three-attribute concepts could 
be solved by specifying fewer relevant at- 
tributes than necessary to solve the four- 
attribute concept, while five-attribute con- 
cepts could be solved by specifying fewer 
irrelevant attributes. Although this would 
not predict the finding that six-attribute 
concepts were easier than four-attribute 
(since six-attribute have no irrelevant at- 
tributes), with six-attribute concepts spec- 
ification of an attribute as irrelevant for 
either the if factor or the then factor 
necessarily meant that it was relevant for 
the other factor. Also, to the extent that 
Ss scan rather than focus (Bruner et al., 
1956), four-attribute concepts will be the 
most difficult because there is a larger 
number of possible four-attribute com- 
binations given the initial card than for 
any other number of attributes. Secondly, 
the previous studies all involved reception 
procedures, in which the order of in- 
stances was preprogrammed and presented 
successively, while the present experiment 
involved a selection procedure in which S 
tested cards in his own order from the 
entire stimulus universe. The selection 
procedure would probably make the stim- 
ulus universe more salient, so that the 
Strategy of specifying the irrelevant at- 
tributes would be more apparent, All 
previous studies of conditional concepts 
have used reception procedures (Hay- 
good & Bourne, 1965; Haygood & Steven- 
son, 1967; Hunt & Kreuter, 1962; Neisser 
& Weene, 1962). Further, the only study 
to vary complexity with conditional con- 
cepts (Haywood & Stevenson, 1967) did 
not systematically vary if factor and then 
factor complexity. Thirdly, all previous 
studies finding a linear relationship be- 
tween problem complexity and perform- 
ance have used conjunctive concepts, 
which may involve different relationships 
than comparable conditional concepts for 
selection procedures. However, Haygood 
and Stevenson (1967) found the decrease 


IF FACTORS AND THEN FACTORS IN CONCEPT ATTAINMENT 221 


in performance with increasing complex- 
ity to be most pronounced for conditional 
concepts. Fourthly, unlike previous stud- 
ies with conditonal concepts, the present 
study distinguished the two types of posi- 
tive instances in the instructions and feed- 
back given to S: (a) yes instances, which 
were positive because they satisfied both 
the if factor and then factor, and (b) does 
not contradict instances, which were posi- 
tive because they did not have the if fac- 
tor. Finally, the present study used a 
six-attribute concepts universe with only 
two values per attribute, and problem 
complexity may have different effects for 
attributes with three or more values. 

The second purpose of the experiment 
was to determine the effects of problem 
complexity on the use of the focusing 
strategy. There were no main effects of 
either IFC or TFC for overall focusing, 
while the interaction was highly signif- 
icant. Concepts with three and five rele- 
vant attributes resulted in the most focus- 
ing, concepts with two and six relevant 
attributes an intermediate amount, and 
four-attribute concepts the least. These 
curvilinear results exactly paralleled those 
for number of card choices. However, 
the measure of if factor focusing minus 
then factor focusing indicated a relatively 
greater use of focusing on the if factor 
than the then factor. Furthermore, this 
relative difference was significantly greater 
for one-attribute if factors than two-at- 
tribute or three-attribute if factors, and 
greater for two-attribute than three-at- 
tribute if factors. Thus, as if factor com- 
plexity increased, the relatively greater 
use of if factor focusing than then factor 
focusing progressively declined. 

The improvement over problems for the 
three measures of untenable hypotheses 
(overall, if factor, and then factor) and 
overall focusing strategy is unusual in a 
selection procedure although it is typical 
with reception procedures. Thus, the dif- 
ficulty of conditional concepts relative to 
other concept rules may decrease with 
practice on the initially unfamiliar condi- 
tional concepts (Haygood & Bourne, 
1965). 


REFERENCES 


Bourne, L. E, Jm. Human conceptual be- 
havior, Boston: Allyn and Bacon, 1966. 

Bourne, L. E., Je, & Haycoon, R C. The 
role of stimulus redundancy in the identi- 
fication of concepts. J. exp. Psychol., 
1959, 58, 232-238. 

Bourne, L. E, Jm, & Haycoop, R G: 
Effects of redundant relevant information 
upon the identification of concepts. J. 
exp. Psychol, 1961, 61, 259-260. 

Bourne, L. E., JR, & RESTLE, F. Mathe- 
matical theory of concept identification. 
Psychol. Rev., 1959, 66, 278-296. 

Bruner, J. S, Goopnow, J. J., & Austin, 
G. A. A study of thinking. New York: 
Wiley, 1956. 

BULGARELLA, R., & ArcHER, E. J. Concept 
identification of auditory stimuli as a 
function of amount of relevant and irrele- 
vant information. J. ex. Psychol., 1962, 
63, 254-257. 

Conant, M. B., & TRABASSO, T. Conjunc- 
tive and disjunctive concept formation 
under equal-information conditions. J. 
exp. Psychol., 1964, 67, 250-255. 

Duncan, C. P. Recent research on hu- 


man problem solving. Psychol. Bull., 
1959, 56, 397-429. 
Havcoop R. C, & Bourne, 14:5, JR. 


Forms of relevant stimulus redundancy in 
concept identification. J. exp. Psychol., 
1964, 67, 392-397. 

Havcoop, R. C., & BOURNE, LE, JR. At- 
tribute- and rule-learning aspects of con- 
ceptual behavior. Psychol. Rev., 1965, 
72, 175-195. 

Haycoop, R. C., & Srevenson, M. Effects 
of number of irrelevant dimensions in 
nonconjunctive concept learning. J. exp. 
Psychol, 1967, 74, 302-304. 

Hunt, E. B. Concept learning: An infor- 
mation-processing problem. New York: 
Wiley, 1962. 

Hunt, E. B, & KREUTER, J. M. The de- 
velopment of decision trees in concept 
learning: III. Learning the connectives, 
Los Angeles: Western Management 
Sciences Institute, 1962. 

Kramer, C, Y. Extension of multiple range 
tests to group means with unequal num- 
bers of replications. Biometrics, 1956, 
12, 307-310. 

Laucnuum, P. R. Speed versus minimum- 
choice instructions in concept attainment. 
J. exp. Psychol, 1964, 67, 596. 

Laucmuw, P. R. Selection strategies in 
concept attainment as a function of num- 


222 PATRICK R. LAUGHLIN 


ber of persons and stimulus display. J. LAUGHLIN, P. R, & McGtynn, R. P, 
exp. Psychol., 1965, 70, 323-327. Cooperative versus competitive concept 
LaucHLIN, P. R. Selection strategies in attainment as a function of sex and stimu- 
concept attainment as a function of num- lus display. J. pers. soc. Psychol., 1967, 
ber of relevant problem attributes, J. exp. 7, 398-402. 
Psychol., 1966, 71, 773-776. Neisser, U., & WEENE, P. Hierarchies in 
Lauc, P. R., & Douerty, M. A. Dis- concept attainment. J. exp. Psychol., 
cussion versus memory in cooperative 1962, 64, 640-645. 
group concept attainment. J. educ. Psy- Warrer, C. M., & Bourne, L. E, JR. Con- 


chol., 1967, 58, 123-128. cept identification as a function of amounts 

Laveuuin, P. R, & Jorvan, R. M. Selec- of relevant and irrelevant information. 
tion strategies in conjunctive, disjunctive, Amer. J. Psychol., 1961, 74, 410-417. 
and biconditional concept attainment. J. 


exp. Psychol., 1967, 75, 188-193. (Received May 3, 1967) 


Journal of Experimental Psychology 
1968, Vol. 77, No. 2, 223-231 


PAIRED-ASSOCIATE RESPONSE LATENCIES AS A FUNCTION 
OF FREE ASSOCIATION STRENGTH ' 


S. I. SHAPIRO? 


Pennsylvania State University 


Paired-associate response latencies were studied in a series of experi- 
ments as a function of free association strength (FAS), hierarchy, 
directionality, and mediation. Response latencies varied inversely with 
forward FAS employing both unmixed (Experiments I, II, and III) 
and mixed (Experiment V) list designs. In addition, nonprimary re- 
sponses paired with stimuli eliciting high FAS primary responses re- 
sulted in shorter latencies than when paired with stimuli eliciting 
low FAS primary responses (Experiment II), and high FAS 
bidirectional associates resulted in shorter latencies than high FAS 
unidirectional associates (Experiment III). A model of paired-asso- 
ciate learning of directly and mediationally associated words was 


tested, but yielded inconclusive results (Experiment IV). 


Several studies have recently investi- 
gated paired-associate performance as 
a function of variations in the free- 
association strength (FAS) of norma- 
tively associated word pairs. The re- 
sults with adults indicate that perform- 
ance varies positively with FAS when 
stimulus members have low word fre- 
quencies (Martin, 1964; Postman, 
1962), but that FAS does not dif- 
ferentially affect performance with 
stimuli of moderate or high word fre- 
quencies (Haun, 1960; Jenkins, in 


1This report is based upon the author’s 
doctoral dissertation (Shapiro, 1966), where 
additional methodological details and results 
may be found. The research was supported 
by Research Grant No. GB-2568 from the 
National Science Foundation to David S. 
Palermo. Preparation of this manuscript 
was completed during the author’s tenure as 
a Postdoctoral Fellow at the Institute of Hu- 
man Learning, University of Califorina, 
Berkeley. The Institute is supported by 
grants from the National Science Foundation 
and the National Institutes of Health. The 
author is indebted to David S. Palermo for 
his counsel throughout the investigation and 
in preparing this report. Helpful comments 
were also contributed by Charles N. Cofer, 
John F. Hall, and Robert Seibel. Richard A 
Olsen was kind enough to assist in design- 
ing the apparatus. 

?Now at the University of Hawaii. 


press; Martin, 1964; Postman, 1962). 
However, children's paired-associate 
performance varies positively with 
FAS even when highly frequent stimuli 
are employed (e.g., Carroll & Penney, 
1966; McCullers, 1961; Palermo & 
Jenkins, 1964a; Wicklund, 1964). 
Assuming that normative associates 
reflect their cooccurrence in natural 
language, the absolute frequencies in 
natural language of associates to high 
frequency stimuli would be greater 
than to low frequency stimuli. Since 
FAS may only index the relative 
strength of natural language habits 
(Wicklund, Palermo, & Jenkins, 
1964), the absolute strengths of norma- 
tive responses to highly frequent stim- 
uli may be great enough for adult 
paired-associate learning to take place 
very rapidly. Since children, however, 
have been less exposed to language 
than adults, the absolute frequencies 
of normative associates even to highly 
frequent stimuli may be low enough for 
FAS to affect performance with high 
as well as low frequency stimuli. 
While the typical measures of trials 
to criterion or number of errors do not 
reflect variations in FAS with adults 
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when high frequency stimuli are em- 
ployed, this does not preclude the pos- 
sibility that FAS still exerts some in- 
fluence demonstrable by a more 
sensitive performance measure. One 
potentially more sensitive measure is 
response latency. Peterson (1965), 
eg. has demonstrated that adults’ 
paired-associate response latencies sig- 
nificantly decrease over trials after the 
last error. Thus, even after perfect 
performance is attained by the error 
measure, further changes in perform- 
ance may be demonstrated with the 
latency measure. 

The present investigation therefore 
employed the response latency measure 
to study the influence of variations in 
FAS upon adult paired-associate learn- 
ing using high frequency stimuli. In 
addition to varying forward FAS, the 
effects of associative hierarchies, bi- 
directional FAS, and mediated asso- 
ciative strength were studied. 

Experiment I assessed the effects of 
high, moderate, and low FAS upon 
paired-associate response latencies. 


EXPERIMENT I 
Method 


Lists—Three lists of 10 paired-associates 
each were constructed from free association 
stimuli and their primary responses. The 
mean FAS was 55% (Range = 40-7096) 
for List 1 pairs, 23.7% (Range = 21-2876) 
for List 2, and 12.1% (Range = 10-14%) 
for List 3. The mean intrapair R-S FAS 
was 16, 1.4, and 1.3% for Lists 1, 2, and 
3, respectively. Of the 360 possible extra- 
pair interitem associations in each list, List 1 
contained 11, (M=2.7%), List 2, 4 (M— 
2.2%), and List 3, 15 (M = 1.376). 

The words for Exp. I and the subsequent 
experiments were selected from collections 
of word-association data by Bilodeau and 
Howell (1965), Bousfield, Cohen, Whit- 
marsh, and Kincaid (1961), Gerow and Pol- 
lio (1965), Jones and Fillenbaum (1964), 
Palermo and Jenkins (1964b), Riegel (1965), 
Rosenberg (1965), Russell and Jenkins 
(1954), and unpublished norms of Cofer, 
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Marshall, and Deese.5 Where FAS was re- 
ported in the norms by sex, only the per- 
centages for males were used. All stimuli 
and responses throughout the series of ex- 
periments had Thorndike-Lorge (1944) word 
frequencies of A or AA, and except for Exp. 
IV, all words were nouns. No pairs were 
selected whose members had the same initial 
letter. Intrapair R-S FAS (excepting Exp. 
III) and forward or R-S extrapair interitem 
associations were minimized in all the ex- 
periments, a restriction which necessitated 
selecting words from several different sets 
of norms. 

Procedure.—The instructions were those 
typically presented for paired-associate learn- 
ing and included a study trial prior to the 
first anticipation trial. The S was told never 
to verbalize the stimulus items, to pro- 
nounce all response anticipations loudly and 
clearly, and to avoid making any other 
sounds. A throat microphone was secured 
around S’s neck, but its purpose was not 
revealed to nor, apparently, discerned by Ss. 
The lists were presented on a Stowe memory 
drum (Model 459-B) at a 2.26:3.26-sec. rate 
with a 5.52-sec. intertrial interval. Eight 
anticipation trials were presented, during 
which E recorded the latency of each re- 
sponse, correct anticipations, omissions, and 
errors. Response latencies for errors or 
omissions were assigned a latency of 227 
sec., ie, 1/100 sec. beyond the length of the 
anticipation interval. Each list was presented 
in four different random orders, with the 
restriction that no given pair appeared more 
than once in an initial and/or terminal posi- 
tion of the orders. 

The same procedure was used for Exp. II 
and III. 

Apparatus.—A. circuit was designed such 
that when the memory drum shutter de- 
scended exposing the stimulus member of an 
S-R pair, a Hunter Klockounter (Model 
120-A) was activated which timed the inter- 
val from exposure of the stimulus until S’s 
verbal response, ie, the response latency. 
A. modified Grason-Stadler voice-operated 
relay (Model E7300A-1) stopped the 
Klockounter when the relay was pulsed by 
S’s response. Latencies were recorded to 
1/100 sec. A Hunter Decade Interval timer 
(Model 100-C) automatically reset the 
Klockounter in time to measure the latency 


8The author is grateful to Charles N. 
Cofer, George R. Marshall, and James E. 
Deese for providing these three independent 
sets of norms. 
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of S's next response. The latency measuring 
apparatus and E were hidden from S's view. 
Tape recordings were made of each S's re- 
sponses such that the latency of any response 
E was unable to determine could be retrieved 
from the tape recordings. The percentage 
of responses that had to be retrieved 
throughout the series of experiments was 
relatively small (M — 1776/list). 

Subjects.—The Ss were male college stud- 
ents from introductory psychology classes at 
Pennsylvania State University. The same 
population was drawn upon in the subsequent 
experiments, but no S served in more than 
one experiment nor learned more than one 
list. In all experiments, Ss were randomly 
assigned to one of the lists with the restric- 
tion that each list was tested once before 
it was assigned again. In Exp. I, 33 Ss 
were tested per list. The data of 4 addi- 
tional Ss were discarded because of apparatus 
failures. 


Results 


Figure 1 presents the mean item re- 
sponse latencies per trial for the three 
lists and indicates that response laten- 
cies varied inversely with FAS over 
trials. A List X Trials analysis of 
variance* indicated that the list and 
trials effects and their interaction were 
all significant, F (2, 768) = 29.58, p < 
001; F (7, 768) = 4481, p < .001; 
F (14, 768) = 199, p « 05. The 
interaction of Lists x Trials reflects 
the fact that performance on the mode- 
rate FAS pairs was similar to that of 
low FAS pairs on the initial two 
trials, but on later trials was similar 
to performance on high FAS pairs. 
The latencies across trials differed sig- 
nificantly for Lists 1 and 3 and Lists 2 
and 3, but not for Lists 1 and 2, 
t (64)—301, p< .005; t= 2.37, 
p < .025; t= 99. Significantly more 
errors (including omissions) occurred 


“The limited range of the response la- 
tencies and the lack of any marked discrep- 
ancies from normality, made transformations 
of the data unnecessary for the analyses per- 
formed in the present experiment and for 


the subsequent experiments reported. 


for List 2 (N = 50) and List 3 (N = 
53) than for List 1 (N 29), $< 
01, | Kolmogorov-Smirnov Two- 
Sample Test. 

In Exp. II, the effects of two FAS 
levels and two hierarchial structures 
were studied. 


ExPERIMENT II 
Method 


Lists—Two lists of 10 paired-associates 
each were constructed from free-association 
stimuli and nonprimary responses. The 
mean FAS of List 1 pairs was 5.3% 
(Range=5-6.2%), and for List 2, 1.3% 
(Range =.8-2%). The same stimuli were 
used for both lists. Half of the stimuli had 
high FAS primary responses (M = 55.5%; 
Range = 44-70%), and the other half had 
low FAS primary responses (M = 16.8% ; 
Range = 14.7-20.7%). The mean intrapair 
R-S FAS for List 1 was .2%, and for List 2, 
0%. Of the 360 possible extrapair interitem 
associations in each list, List 1 contained 
10 (M=1%), and List 2, 8 (M=.9%). 
The primary responses to the stimuli were 
all nouns with Thorndike-Lorge word fre- 
quencies of A or AA. 

Subjects—Thirty-two Ss were tested per 
list. The data of two additional Ss were 
discarded, one for misunderstanding instruc- 
tions, and the other because of a recording 
error by E. 


Results 


Figure 2 presents the mean item re- 
sponse latencies per trial for each pair 
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Fic. 1. Mean item response Jatencies on 
each trial of the three lists of Exp. I. 
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Fic. 2. Mean item response latencies per 
trial for each pair type of the two lists of 
Exp. IL. 


type of the lists. It may be seen that 
latencies were shorter for List 1 than 
for List 2, and that within each list, 
latencies were shorter for responses to 
stimuli with high FAS primary re- 
sponses than for responses to stimuli 
with low FAS primary responses. An 
analysis of variance in which the main 
effect of lists was a between-S factor, 
and the main effects of trials and pair 
type were intra-S factors, indicated 
that all three main effects were signif- 
icant beyond the .01 level, F (1, 62) = 
18.62; F (7, 434) = 19277; F (1, 
62) = 14.95. The only significant 
interaction was Lists X Trials, F (7, 
434) = 3.14, p<.01. The latencies 
for the two pair types within List 1 
and within List 2 differed significantly, 
t (31) = 175, p <.05; t (31) = 349, 
p <.005. Significantly more errors 
occurred for List 2 (N — 251) than 
for List 1 (N —72), p< .001, Kol- 
mogorov-Smirnov Two-Sample Test. 
Within-list errors were somewhat less 
for high FAS pairs, but not statis- 
tically evaluated because of excessive 
ties. 

In Exp. III, the effects of varying 
both forward and R-S FAS were 
studied. 


ExrPERIMENT III 
Method 


Lists—Two lists of 10 paired-associates 
each were constructed from free-association 
stimuli and their primary responses. The 
mean forward FAS of List 1 pairs was 
63.6% (Range =53.3-70.7%), and for List 
2, 26.6% (Range = 19-33.8%). The mean 
R-S FAS of half the List 1 pairs was 52.5% 
(Range = 33-71.3%), and 19.1% (Range = 
11.2-32%) for half of the List 2 pairs. For 
the remaining half of pairs in each list, 
there were either no associations or only 
idiosyncratic ones (excepting one pair of 
2%). Of the 360 possible extrapair inter- 
item associations in each list, List 1 con- 
tained 9 (M — 1996), and List 2, 5 (M= 
1.596). j 

Subjects.—Thirty-four Ss were tested per 
list, 


Results 


Figure 3 presents the mean item re- 
sponse latencies per trial for the two 
pair types of each list. It may be 
seen that latencies for List 1 were 
shorter than for List 2, and that laten- 
cies for bidirectional associates were 
shorter than for unidirectional asso- 
ciates in List 1, but the trend was re- 
versed in List 2. An analysis of vari- 
ance in which the main effect of lists 
was a between-Ss factor and the main 
effects of trials and pair type were 
intra-S factors, indicated that all three 
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Fic. 3. Mean item response latencies per 
trial for each pair type of the two lists of 
Exp. III. 
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main effects were significant, F (1, 
66) 28.57, p < 005; F (7, 462) = 
119.36, p < 001; F (1, 66) = 4.35, 
p< 05. The two-way interactions of 
Lists X Trials, Lists X Pair Type, 
and Trials X Pair Type were all sig- 
nificant, F (7, 462) = 2.62, p < .025; 
F (1, 66) 21835, p < 001; F (7, 
462) 2878, p< 001. The triple 
interaction was not significant, F < 1. 
Latencies over trials for bidirectionally 
associated pairs in List 1 were sig- 
nificantly shorter than for unidirec- 
tionally associated pairs, t (33) = 
6.66, p < .005, but performance on the 
two-pair types within List 2 did not 
significantly differ, £ (33) = 125. In 
List 1, somewhat less errors occurred 
fot bidirectional than for unidirec- 
tional associates, but in List 2, the 
opposite result was obtained. These 
within-list error differences were not 
statistically evaluated because of ex- 
cessive ties. 

Experiment IV was designed to 
study the influence of FAS upon ver- 
bal mediation. Mediation was tested 
in a three stage A-B, B-C, A-C media- 
tion paradigm in which the first two 
stages were assumed on the basis of 
normative data. The experiment was 
a partial replication of a study with 
children by Palermo and Jenkins 
(19642) which supported a model of 
mediated — paired-associate learning 
(Jarrett & Scheibe, 1962) using an 
error measure. The model asserts 
that the mediated associative strength 
between any two words (mg) is a con- 
tinued product of the chain of associa- 
tive probabilities (6’,) of any words 
which may link the two words. Thus, 
my for two unassociated words a and 
c, is the product of the &j and Osc 
when b is an associate of a, and c is an 
associate of b, but c is not an associate 
of a. On the basis of the model, it 
was expected that response latencies 
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for A-B and A-C pairs would vary 
inversely with the FAS of their direct 
(Oa) or mediated (meso) associative 
links, and would not differ for pairs 
with similar 6, and Meas values. 


EXPERIMENT IV 
Method 


Lists—Two lists of 12 paired-associates 
each were constructed. List A-B was com- 
posed of free-association stimuli and their 
primary responses. Within List A-B there 
were three sets of four paired associates, 
with mean 6» values of .44 (Range -—.40- 
49), 21 (Range=.17-25), and 410 
(Range = .08-.12). There were no intra- 
pair R-S associations and there were 9 
(M 6=.01) extrapair interitem associations 
out of a possible 528. List A-C was com- 
posed of free-association stimuli (different 
from the List A-B stimuli) paired with re- 
sponses not directly associated to A, but 
associated only through a mediating word, 
B. The B words were primary responses 
to the A words, and the C words were pri- 
mary responses to the B words. There were 
three sets of four paired-associates, with 
mean og, values of 20 (Range = .15-.26), 
07 (Range =.05-.09), and 02 (Range= 
01-03). The moas = 20 pairs were com- 
posed of associative links of high 6» (M= 
.50) and high 4. (M = 42) values, the Mogo 
= 02 pairs of low 40 (M —.11) and low 40 
(M=.19) values, and the mo, =.07 pairs 
were composed of low 4a» (M =.13) and 
high 6x (M —.55) values. There were no 
A-C or C-A associations, and of the 528 
possible extrapair interitem associations, 
there were 23 (M 0— 02). Within the 
four pairs of each of the three pair types in 
each list there was a noun-noun and an ad- 
jective-adejective pair. In the other two 
pairs of each pair type, one response was 
a noun and the other an adjective, and the 
parts of speech of the stimuli for these re- 
sponses were matched as nearly as possible 
across pair types. 

Procedure and Ss—The procedure for 
Exp. IV was the same as the earlier experi- 
ments except that the presentation rate was 
increased to 3.26:3.26 sec. (intertrial inter- 
yal =6.52 sec.) to minimize errors and 
omissions—which were assigned a latency 
of 327 sec. Thirty-two Ss were tested per 
list. The data of one additional S$ were dis- 
carded because E inadvertently terminated 
the experiment prior to the eighth trial. 
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Fic. 4. Mean item response latencies per 
trial for each pair type of the two lists of 
Exp. IV. 


Results 


Figure 4 presents the mean item re- 
sponse latencies per trial for each pair 
type of the two lists. It may be seen 
that the response latencies were 
shorter for List A-B than for List 
A-C. Within List A-B, latencies over 
trials for Om = .44 pairs were longer 
than 6,,—.21 and 6,,—.10 pairs, 
with the latter two pairs types result- 
ing in identical means. Within List 
A-C, latencies for moes =.07 pairs 
were longest, next longest for mw, = 
02, and shortest for m», = .20 pairs. 

An analysis of variance in which 
the main effect of lists was a be- 
tween-Ss factor, and the main effects 
of trials and pair type were intra-S 
factors, indicated that all three main 
effects were significant beyond the .01 
level, F (1, 62) — 45.65; F (7, 434) 
= 149.73; F (2, 124) = 2841. The 
two-way interactions of Lists X Trials, 
Lists X Pair Type, and Trials x Pair 
Type were all significant beyond the 
:001 level, F (7, 434) = 23.06; F (2, 
124) —53.15; F (14, 868) —291, 
and the triple interaction was signif- 
icant beyond .025, F (14, 868) = 2.25. 

On the basis of the model tested, it 
was expected that latencies over trials 


for the six pair types would increase 
in the following order: 6,,— 44, 
Oa» = .21 and Meas = .20 (not expected 
to differ), 65 — .10, moa = .07, and 
Msas = .02. However, the obtained 
order of increasing latencies was 6, = 
-10 and 6,,—.21 (identical), 6,, = 
44, mes, = .20, mes, = .02, and Mozo = 
.07. Comparisons between all pairs of 
pair types were significant beyond the 
.025 level when tested by appropriate 
t tests, except that of the ô,» = .10 and 
Oa» = .21 pairs. 

Significantly more errors occurred 
for List A-C (N = 289) than for List 
A-B (N=43), p< .001, Kolmogo- 
roy-Smirnov Two-Sample Test. 
Errors for the six pair types of the 
two lists generally increased in the 
same order as the latencies for the pair 
types. 

Experiment V was designed to 
clarify whether the mixed-list design 
employed in Exp. IV might be respon- 
sible for the failure to find that List 
A-B response latencies varied in- 
versely with FAS. A mixed list was 
constructed of high, moderate, and low 
FAS pairs whose response latencies 
were known clearly to vary inversely 
with FAS. If a mixed-list design has 
no intrinsic influence on the relation- 
ship between response latencies and 
FAS, then an inverse relationship be- 
tween FAS and response latencies 
would be expected as was the case in 
Exp. I, II, and III, all of which em- 
ployed unmixed-list designs in testing 
this relationship. 


EXPERIMENT V 
Method 


Lists—A single 12-pair mixed list was 
constructed on the basis of an item analy- 
sis of the latency data for the three lists 
of Exp. L Four pairs were selected from 
each list whose mean latencies over trials 
clearly varied inversely with their FAS. 
The mean FAS of the high strength pairs 
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was 62.5% (Range = 43-70%), for the 
moderate strength pairs, 23.1% (Range = 
21.3-24.6%), and for the low strength pairs, 
11.7% (10.8-13%) ; the mean item response 
latency per trial in Exp. I for these three 
pair types was 120 sec. 125 sec, and 1.36 
sec. respectively. There were 30 (M= 
1.7%) extrapair interitem associations out 
of a possible 528. 

Procedure and Ss—The procedure for 
Exp. V was the same as that of Exp. IV. 
Thirty-two Ss were tested. 


Results 


Figure 5 represents the mean item 
response latencies per trial for each 
pair type of the list, from which it may 
be seen that latencies over trials varied 
inversely with FAS. A Trials X Pair 
Type X Ss analysis of variance indi- 
cated that the pair type and trials 
effects and their interaction were all 
significant beyond the .01 level, F (2, 
62) = 29.29; F (7, 217) = 5634; F 
(14, 434) = 4.17. The interaction of 
Pair Type X Trials reflects the initial 
similarity in performance on moderate 
and low FAS pairs but divergence in 
performance for the two pair types on 
later trials, Performance on moderate 
FAS pairs in Exp. V on later trials 
did not attain that of high FAS pairs 
as in Exp. I, possibly because asymp- 
totic performance on high FAS pairs 
was reached earlier in Exp. I. The 
differences in the latencies for high vs. 
moderate FAS pairs, high vs. low 
FAS pairs, and moderate vs. low FAS 
pairs were all significant, f (31) = 
7.36, p«.001; t— 6275, p < 001, 
t=181, p<.05. The numbers 
errors for high, moderate, and low 
FAS pairs were 5, 31, and 26, respec- 
tively; statistical analyses were not 
performed because of excessive ties 
and too few errors. 


Discussion 


The results of Exp. I, II, UL and V 
clearly demonstrate that paired-associate 
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Fic. 5. Mean item response latencies per 
trial for the three pair types of Exp. V. 


response latencies of adult Ss vary in- 
versely with forward FAS. Moreover, 
the FAS effect appeared consistently 
across the eight trials tested (excepting 
high vs. moderate FAS pairs in Exp. I). 
These results were obtained despite the 
use of high word frequency stimuli 
which previous studies failed to demon- 
strate with trials to criterion and/or 
error measures (Haun, 1960; Jenkins, 
in press; Martin, 1964; Postman, 1962). 
In addition, unlike previous demonstra- 
tions of the FAS effect (Martin, 1964; 
Postman, 1962), Exp. I, III, and V, em- 
ployed primary responses for low as well 
as high FAS pairs. It should be noted 
that when only high FAS pairs contain 
primary responses, low FAS pairs are 
more likely to have less R-S FAS, to be 
less easily guessed than primary re- 
sponses, and to be more susceptible to 
interference from responses further up 
in associative hierarchies (Coleman, 
1964; Palermo, 1966). 

Relatively few errors occurred on most 
lists of the series of experiments, and 
their occurrence was largely confined to 
early trials. The percentages of errors 
occurring on the first two trials ranged 
from 66-100%/list. Since latency dif- 
ferences remained across trials, the la- 
tency measure does not simply reflect 
error differences. In addition, the in- 
fluence of assigned latencies on treat- 
ment means was minimal, since the la- 
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tency assigned to errors (1/100 sec. be- 
yond the anticipation interval) was not 
very great relative to the range of re- 
sponse latencies for correct anticipations. 
The present study, like  Peterson's 
(1965), clearly indicates that latencies 
decrease after errors cease. Unlike the 
earlier FAS studies with adults using 
high frequency stimuli, Exp. I, II, and 
III indicated that errors, though rela- 
tively slight, varied inversely with FAS. 
The error results in the present study 
may reflect such factors as the gramma- 
tical homogeneity of the paired asso- 
ciates, the strict control of R-S FAS 
and extrapair interitem associations, and 
the use of unmixed lists, which contain 
more pairs at a given FAS level relative 
to mixed lists of comparable length. 

The superior performance found in 
Exp. II for pairs in which the stimuli 
elicit high strength primary responses 
has been previously demonstrated with 
children (Wicklund et al., 1964; Wick- 
lund, 1964). Palermo (1966) has re- 
cently suggested that if a stimulus has 
many hierarchial responses of relatively 
similar FAS, these responses may be 
covertly aroused in learning and inter- 
fere with learning the actual responses 
paired with the stimuli. However, if a 
stimulus has one strong FAS response 
and all others are relatively weak, then 
presentation of the stimulus may result 
in interference from only this one inter- 
fering response in paired-associate learn- 
ing. When only one strong associate 
exists, S$ need only edit this strong and 
easily discriminable response as inap- 
propriate, whereas neither the editing 
nor discrimination task may be as easily 
accomplished when there are several ap- 
proximately equal, but lower strength 
responses. In conformity with this 
analysis, the mean FAS of the first five 
hierarchial responses in the norms for 
the high FAS primary condition in 
Exp. II was 55.595 (primary), 7.5%, 
5.4%, 4.3%, and 3.7%, respectively, and 
16.8% (primary), 12.5%, 8.5%, 5.7%, 
and 5% for the low FAS primary condi- 
tion. 

The results of Exp. III indicating that 
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bidirectional FAS can facilitate paired- 
associate learning have been also re- 
ported recently with children in terms of 
an error measure (Gallagher, 1966). 
The anomalous results of Exp. IV ap- 
parently lent little support for the model 
that was tested. Furthermore, contrary 
to the results of Exp. I, II, and III, even 
latencies for directly associated pairs 
(List A-B) did not vary inversely with 
FAS. Exp. V, however, indicated that 
response latencies did vary inversely 
with FAS using a mixed-list design and 
the same list length and presentation in- 
tervals as Exp. IV. The negative re- 
sults of List A-B, therefore, do not 
appear attributable to the use of a mixed- 
list design or to the longer list length 
and presentation intervals than in the 
earlier experiments. While the mixed 
list per se does not appear responsible 
for the results of List A-B, the list did 
contain fewer pairs representing each 
FAS level than in the unmixed lists of 
the earlier experiments. Fewer pairs at 
à given FAS level would promote the 
possibility that FAS effects were offset 
by internal item inconsistencies. The 
item analysis of Exp. I indicated that 
response latencies did overlap for some 
pairs of different FAS levels, and an 
item analysis of Exp. IV indicated that 
the discrepant results of Lists A-B and 
A-C were primarily a function of one or 
two pairs within a particular FAS level. 
For example, the inferior performance 
On Moas = .07 pairs in List A-C relative 
to Moas = .02 pairs was largely due to the 
two Meg. pairs WORK-sOFT and TROUBLE- 
Goop, which Ss reported to be “incon- 
grous” to associate. The potential sensi- 
tivity of the response latency measure to 
other factors in addition to FAS may 
prove useful in investigating residual in- 
fluences upon learning associated words, 
such as the semantic, logical, and gram- 
matical relationships existing between 
normative stimuli and their responses. 
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Ss judged the “sameness” of pairs of matrixes, each having several 
cells blackened at random. One group (Similarity search) classified 
matrix pairs as “same” if they had even one pair of blackened cells 
in corresponding locations. A second group (Identity search) classi- 
fied pairs as “same” that were identical with respect to all blackened 
cell locations. Decision times of the Similarity-search group were 
much longer than those of the Identity-search group. While the per- 
formance of the Similarity-search Ss indicated they were engaged in 
a serial processing of the stimulus array that of the Identity-search 


Ss in some conditions closely approximated gestalt processing. 


Human beings, like all pattern 
recognition devices, are frequently re- 
quired to decide whether two stimuli 
are the same or different. Although 
these judgments of “same” and “dif- 
ferent” constitute an important com- 
ponent of much of our behavior, the 
processes involved have only recently 
begun to receive direct treatment (e.g., 
Bindra, Williams, & Wise, 1965; 
Walk, 1966). Before constructing 
comprehensive theories of such judg- 
ments, it may be useful to explore the 
likelihood that all trials which term- 
inate in same responses may not repre- 
sent the operation of some invariant 
or unitary process. The length of time 
required for recognition of sameness 
can provide a sensitive index to the 
processes underlying that recognition 
(Sekuler, 1966). Research described 
in this report shows first, that humans 
can adopt any of several different 
modes of sameness recognition, and, 
second, that within certain limits im- 
posed by the complexity of visual pat- 
terns to be processed, Ss can often 
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operate more efficiently by processing 
patterns as wholes rather than in terms 
of constituent parts. 

That sameness recognition is not a 
unitary, invariant process becomes 
especially clear if it is considered how 
one might instruct a pattern recognizing 
machine to classify pairs of inputs as 
either same or different. There are 
two basic kinds of instructions one 
could give the machine. On one hand 
the machine might be required to 
classify in terms of all the discrimin- 
able characteristics of the input. It 
might compare two patterns along 
some exhaustive catalogue of details 
and, finding a mismatch for any of 
them, respond “different.” This strat- 
egy is a search for identity between 
patterns and would, e. g., permit a 
machine to declare an italic a and a 
boldface a different. On the other 
hand, one might want to train the 
recognition machine to make its classi- 
fication on the basis of a small number 
of critical details, ignoring those which 
are unimportant for the classification. 
In training a system to recognize and 
classify as same letters a written by a 
number of different people, one must 
train it to disregard certain noncri- 
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terial differences among inputs, even 
though such differences were highly 
discriminable. This second kind of 
recognition process would constitute a 
search for similarity between patterns. 
The authors have explored the impli- 
cations of these two kinds of pattern 
recognition strategies and the distinc- 
tive varieties of sameness which they 
represent by asking human observers 
to adopt such strategies in a visual pat- 
tern recognition experiment. 


METHOD 


The Ss viewed pairs of matrixes arranged 
side by side. Each of the matrixes was 
four cells X four cells, In both members of 
a pair the same number of cells (1, 2, or 
4) were blackened at random. Various 
numbers of cells in corresponding locations 
in both matrixes were blackened, eg, with 
four cells blackened in the right hand matrix 
either 0, 1, 2, 3, or 4 of the corresponding 
cells in the other matrix were also black- 
ened. Hereafter, corresponding cells filled 
in both matrixes will be said to be in reg- 
ister. 

Two groups of six undergraduate Ss each 
were used. The Ss in the “Tdentity-search” 
group were told to classify two matrixes as 
the same if for every cell filled in one matrix 
the corresponding cell was filled in the other. 
The Ss in the "Similarity-search" group 
were instructed to call two matrixes same 
if they had at least one filled cell in cor- 
responding locations. Members of both 
groups were told to call matrixes different 
if their respective criteria for same were 
not met. 

Stimuli were presented with a 16-mm. 
film strip projector equipped with an electro- 
mechanical shutter. The Ss viewed the ma- 
trixes on a rear projection screen mounted 
in one channel of a tachistoscope viewing 
box. The other channel provided a constant 
background of about 1.5 ftl. The matrixes, 
viewed over a distance of 59 cm, each sub- 
tended a visual angle of 8° and were separ- 
ated from one another by 4° of visual angle. 
Viewing was binocular. 

For each group four film strips of 300 
frames were prepared. In any strip the 
stimuli were so arranged that same and dif- 
ferent stimuli occurred with approximately 
equal frequency and in random order. _Pro- 
gramming equipment controlled the inter- 
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trial interval (3 sec.) and the advance of 
the film strip. The  clectromechanical 
shutter opened when Ss pushed a button 
and closed when they responded “same” or 
“different” on a spring loaded toggle switch. 
A clock (Standard Electric) measured the 
interval between the presentation of the ma- 
trixes and S’s response. This interval and 
the “same” or “different” response on each 
trial was recorded by E. Each S was tested 
individually in 12 sessions, with three repli- 
cations of every film strip. The order of 
film strips was randomized within blocks of 
4 sessions. In order to counterbalance the 
potentially differential difficulty of throwing 
the response switch to one side or the 
other, half of the Ss in each group used 
leftward switch throw to signal “same,” 
rightward throw to signal “different.” The 
other half of each group had the reverse 
assignments of response to switch positions. 


RESULTS 


Figure 1 shows for each group and 
number of cells filled, the relationship 
between choice times and the number 
of cells in corresponding locations in 
the two matrixes. These data include 
only trials on which the correct re- 
sponse was made. Separate analyses 
of variance were performed on the 
mean choice times for Ss in each group 
and condition of number of cells filled. 
Tn all of the analyses Replication was a 


CHOICE TIME iN sec. 
pen 
EE 
: 


CELLS IN REGISTER 


oU Zu Exi 
$i 


Fic. 1. Mean choice times for each group 
and replication. (The results for conditions 
of one, two, and four cells filled in each 
matrix are shown in panels A, B, and iG 
respectively. In each case the number of 
filled cells that occupy corresponding loca- 
tions in both matrixes is shown on the 


abscissa.) 
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significant source of variance. Each 
panel in Fig. 1 shows choice times de- 
crease systematically over the three 
replications. With only one cell filled 
(Fig. 1, Panel A) the choice times of 
either group are unaffected by the 
number of cells in register (F < 1.0 
for both). The correct response for 
each group varied, of course, depend- 
ing upon the number of cells in cor- 
respondence. For example, with one 
cell filled and none in correspondence 
the correct response for Similarity- 
search Ss was "different"; with one 
cell in correspondence the correct re- 
sponse was "same." The fact that 
mean choice times do not differ with 
number of cells in correspondence sug- 
gests that, at least, in the single case 
where only one cell is filled, there is 
no basic asymmetry in times required 
to respond "different" or "same" (cf. 
Bindra, Williams, & Wise, 1965). 

The results are somewhat modified 
when two cells in each matrix are filled 
(Fig. 1, Panel B). For the Simi- 
larity-search group there is a signif- 
icant effect, F (2, 10) = 32.80, p< 
-01, of number of cells in register: 
Choice times decrease with increasing 
numbers of cells in register. There is 
no significant effect, F (2, 10) — 1.18, 
P > .25, however, of number of cells 
in register for the Identity-search 
group. Choice times for these latter 
Ss are equally fast with 0, 1, or 2 cells 
in register. 

With four cells in each matrix filled 
(Fig. 1, Panel C), there is for both 
groups a significant effect of the num- 
ber of cells in register. But the char- 
acter of this relationship is quite dif- 
ferent for the two groups. The Simi- 
larity-search group's choice times 
decrease, F (4, 20) = 25.32, p < 01, 
with the number of filled cells in reg- 
ister. Choice times of the Identity- 
search group increase, F (4, 20) — 


9.80, p<.01, with the same inde- 
pendent variable. In addition to the 
difference between the signs of these 
relationships there is an obvious dif- 
ference in absolute magnitude. Lines 
were fitted to each function shown in 
Panel C by the method of least squares. 
For the Similarity-search group the 
slopes of the best fitting lines were 
—.21, —.18, and —.18 for each of 
the replications while the correspond- 
ing values for the Identity-search 
group were +.02, +.01, and +.02. 
"These differences in slopes between the 
two groups confirms the apparent dif- 
ferences in Fig. 1C: the effect of the 
number of cells in register is about an 
order of magnitude greater for the 
Similarity-search than it is for the 
Identity-search. 

One additional noteworthy relation- 
ship can be seen in Fig. l. Within 
any given replication of any condition 
it is true that all of the choice times 
for the Identity-search group are lower 
than the choice times for the Simi- 
larity-search group. Regardless of the 
number of cells filled or the number of 
such cells in register, Identity-search 
Ss are performing more quickly than 
are their Similarity-search counter- 
parts. Some previous experiments 
(e. g., Nickerson, 1966a) have shown 
that in some circumstances S's achieve 
faster choice times at the expense of 
increased error rates. To determine 
whether the overall superior speed for 
the Identity-search group was achieved 
at the cost of increased errors, analyses 
of variance were done which permitted 
direct comparisons of the error rates 
of the two groups of Ss. A sepa- 
rate analysis of variance was per- 
formed on the error rate data for 
each condition of number of cells filled 
1, 2, and 4, Entries in the analysis 
were percentage of errors made by 
each S within a given replication. The 
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TABLE 1 


Correct RESPONSE ALTERNATIVE, NUMBER OF TRIALS, M AND SD 
or NUMBER WRONG FOR EAcH GROUP, AND 
MATRIX CONFIGURATION 


Number of Cells Filled 


In Register 1 


K 
e 
x 
[3 
e 


Similarity Search 
Correct Response* D S D 
Number of ‘Trials? 582 | 576 | 579 

Mon Responses 


28.7 | 19.7 | 49.1 

SD 21.5 | 9.3 | 23.2 
Identity Search 

Correct Response D s D 

Number Trials 585 | 585 | 285 
Wrong Response 

M 70.3 | 53.0 | 30.0 

SD 54.0 | 149 | 7.4 


a“D" = different, ''S" = same. 
b Number of trials for each S in each replication. 


results of the analyses may be sum- 
marized as follows. In none of the 
three was the main effect of the iden- 
tity-search vs. Similiarity-search a sig- 
nificant source of variance (all ~’s > 
10). In each analysis the number of 
cells in register proved to be a signif- 
icant source of variance (all p’s < .05). 
The interaction between search group 
and number of cells in register was sig- 
nificant (p < .01) with two and four 
cells filled but not with only one cell 
filled (p.50). The interaction be- 
tween number of cells in register and 
replications was significant (p < .05) 
for the analyses of both the one cell 
and two cell filled conditions but was 
was not a significant source of variance 
in the analysis of the four cell filled 
data (52.10). The triple interac- 
tion, between search group, number of 
cells in register, and replications, was 
significant only in the case of two cells 
filled (p < .05). 

Table 1 gives the mean error rates 
for each group and matrix configura- 
tion. 


DISCUSSION 


The differences in the patterns of 
choice times for the two experimental 
groups show that they went about their 
respective pattern recognition tasks in 
quite different ways. For both two- and 
four-cell filled conditions, the choice times 
for the Similarity-search Ss decrease 
with increasing number of cells in register 
between matrixes. This relationship be- 
tween choice times and number of cells in 
register would be expected if the proc- 
esses leading to S’s response included a 
major self-terminating, serial search com- 
ponent (Egeth, 1966). These results for 
the Similarity-search group can be com- 
pared to expectations derived from the 
following simple model. As the number 
of filled cells in correspondence between 
matrixes increases so does the likelihood 
that, for any randomly selected filled cell 
in one matrix the corresponding cell in 
the other matrix will also be filled. Since 
the discovery of one pair of filled cells in 
correspondence between matrixes satisfies 
the sameness criterion of the Similarity- 
search, the mean number of "cellwise" 
comparisons required to reach this cri- 
terion will decrease with increasing num- 
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TABLE 2 
EXPECTED VALUE oF NUMBER oF CELLWISE 
Comparisons REQUIRED BY SIMILARITY- 
SEARCH Ss to CLASSIFY MATRIXES 


Number of | Number of Filled Cells in Correspondence 
Cells Filled 
in Each 


Matrix of 


Pair 0 1 2 3 4 
1 1.00 | 1.00 | — E: e 
2 2.00 | 1.50 | 1 5 
4 4.00 | 2.50 | 1.67 


1.25 | 1.00 


ber of cells in correspondence. One can 
calculate the expected value of the number 
of "cellwise" comparisons that would be 
required for Ss in each condition, These 
expected values, given in Table 2, can 
be developed on the assumption that Ss 
are using a serial self-terminating search 
strategy. To indicate how such values 
are derived consider the case of a Similar- 
ity-search S$ confronted with a pair of 
matrixes each having two cells filled but 
with only one of the pairs of filled cells 
in corresponding locations. Sampling ran- 
domly and without replacement from the 
two filled cells in one matrix, the prob- 
ability that the corresponding cell will also 
be filled in the other matrix is .50, This 
means that on one-half of the trials a 
match will occur in the first “cellwise” 
comparison and the search terminated 
with a response of same. If a match had 
not been found, it assumed that S would 
resample, and on this second comparison, 
the two corresponding cells compared 
would with certainty both be filled. Thus 
the expected value of the number of com- 
parisons required in this condition is 
S(1) +.5(2)=1.5. While it would be 
nice to be able to describe the behavior of 
the Similarity-search group in such sim- 
ple terms as is implied by the entries in 
Table 2, it is clear that a more com- 
plicated formulation will be required, For 
example consider what happens when the 
data shown in Fig. 1 for the Similarity- 
search Ss are replotted against another, 
altered abscissa. If the Similarity-search 
had proceeded, as Table 2 implies, in a 
series of “cellwise” comparisons of ap- 
proximately constant duration the choice 
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time functions should be linear when re- 
plotted against the appropriate expected 
value of Table 2. Plotting choice times 
against these expected values yields func- 
tions considerably further from linearity 
than those shown in Fig. 1. This sug- 
gests that Ss did not conform perfectly to 
the details of the logic which underlies 
the expected values derived for Table 2, 
Such deviations from the model could 
arise from failure to meet any one on 
more of its assumptions. Data on the 
ability of Ss, in channel capacity experi- 
ments, to recognize spatial position in 
stimulus contexts like our matrixes (see 
Garner, 1962, pp. 53-97) suggests that 
given enough time Ss should not have 
had difficulty in sampling without replace- 
ment. That is, Ss should have been able 
to distinguish perfectly cells already sam- 
pled and those not yet sampled. With a 
considerably greater number of cells, both 
filled and unfilled, it is obvious that Ss, 
required to operate beyond channel capac- 
ity for recognition of spatial position, 
would tend to confuse cells already sam- 
pled and those not yet sampled. Under 
these circumstances the sampling for cell- 
wise comparison could not be sampling 
without replacement, 

There is another factor which probably 
contributes to the failure of the “cellwise” 
Processing model. Egeth (1966) has dis- 
cusssed the contribution of irrelevant in- 
formation to performance in a task not too 
unlike our own.  Egeth's data argue 
strongly that Ss required to categorize 
stimuli on certain dimensions and not on 
the basis of others are very likely to be in- 
fluenced in some measure by the presence 
of the latter, irrelevant stimuli. In the 
present experiment the unfilled cells in 
each matrix may constitute just such a 
set of irrelevant stimuli. The simple model 
outlined above therefore, may also be defi- 
cient because it neglects to consider the 
time required to locate filled cells in the 
matrix. This time to locate filled cells 
would clearly be some function of any 
time required to decide against further 
consideration of any of the unfilled cells. 

The more interesting’ results in this ex- 
periment however are those from the 
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other, Identity-search, group. The striking 
difference between the performance of 
the two groups that can be seen in Fig. 1 
leads one to wonder what possible search 
strategy Ss in the Identity-search group 
could have employed. With two-cells 
filled in both matrixes (Fig. 1, Panel B) 
the choice times of the Identity-search 
group are constant over conditions with 0, 
1, or 2 filled cells in correspondence. 
This suggests that these Ss were process- 
ing the matrixes as gestalts, comparing 
the whole of one matrix with the whole 
ofthe other, Their performance resembles 
what would be expected on the basis of 
some template matching process and is 
especially interesting because their choice 
times are always as short or shorter than 
the corresponding times for the Similar- 
ity-search group. This means that despite 
the apparent greater complexity of tem- 
plate matching over cellwise comparisons, 
Ss with little or no practice in the task 
can process relatively complicated visual 
patterns as wholes in less time than is re- 
quired to process any of the constituent 
parts of those wholes. This superiority 
in processing time may again reflect the 
fact that Ss, using some kind of cellwise 
comparisons, must spend some part of the 
processing time on irrelevant stimuli, i.e., 
the unfilled cells. 

There does seem to be rather severe 
limits on the complexity of patterns that 
$8 can process as wholes. With four cells 
filled (Fig. 1, Panel C) the analysis of 
variance showed a small, but significant 
effect of the number of cells in register 
upon the choice times of the Identity- 
search Ss. With four cells filled, Ss in 
this group were not able, as they were 
with only two cells filled, to process the 
matrixes as wholes. Since the slopes of 
the choice time functions for the two 
groups differ by an order of magnitude, it 
is unlikely that Ss in the Identity-search 
group used the same pattern recognition 
process as their Similarity-search counter- 
parts, Any one of a number of “hybrid” 
pattern recognition strategies (i.e, com- 
binations of serial processing and tem- 
plate matching) could accommodate the 
data from the four-cell filled condition in 


the Identity-search. In the absence of 
additional, more severely constraining 
data further speculation on the character- 
istics of such a hybrid process is not very. 
useful, 

The complexity of the relationship be- 
tween error rate and choice time in this 
experiment can be seen by comparing 
corresponding values for each in Table 1 
and Fig. 1. Consider, e.g., the relation- 
ships between error rate and choice time 
for the case of four cells filled. As Table 
1 shows, for the Similarity-search group 
there is a tendecy for error rates to de- 
crease with increasing number of cells in 
register. But, for the Identity-search 
group the error rate increases with the 
number of cells in register. Comparing 
these trends to the curves in Panel C of 
Fig. 1 it can be seen that in the case of 
the Similarity-search group the decrease 
in choice times with increasing cells in 
register is accompanied by a parallel 
decrease in error rate while for the 
Identity-search group there is a quite dif- 
ferent relationship. For these latter Ss 
the slight increase in choice times with 
number of cells in register is accom- 
panied by an increase in error rate. This 
picture is further complicated by the data 
for two cells filled in which the curvi- 
linear relationship between error rate and 
number of cells in register (Table 1) does 
not seem to be systematically reflected in 
the choice time data of Fig. 1B, The com- 
plexity of these relationships makes it 
abundantly clear that the problem of treat- 
ing incorrect responses in choice time 
studies is far from general solution (cf. 
Egeth & Smith, 1967). 

Some investigators have reported a sys- 
tematic difference in the times required 
to judge stimuli the same as opposed to 
judging stimuli different (Bindra, Wil- 
liams, & Wise, 1965). The differences 
between the performance of our two search 
groups are not related in any obvious way 
to differences between times required for 
same and different judgments. Consider, 
e.g. the data for the two cell filled condi- 
tion (Fig. 1B). With either 0 or 1 
cell in register the correct response for 
Ss in the Identity-search group was 
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"different" For the Similarity-search 
group "different" was the correct re- 
sponse to the O cells in register condition. 
Despite this commonality of response to 
these several conditions, Fig. 1B shows 
that the times required by the Identity- 
search group are substantially less than 
those for the Similarity-search group. 
The same argument applies to the case of 
two cells filled and two cells in register. 
For both groups the correct response to 
the stimulus was "same" but again the 
times required by the Identity-search Ss, 
within each replication, are less than 
those for the Similarity-search Ss, These 
data and the other relevant comparisons 
that could be made in Fig. 1A and 1C 
indicate that the differences between Sim- 
ilarity-search and Identity-search are not 
merely some function of differences be- 
tween "same" and "different" judgment 
times. 

The major point then, is that when 
there are not too many cells filled in each 
matrix, Ss of the Identity-search group 
could process the patterns as wholes. 
This resembles the finding of Neisser, 
Novick, and Lazar (1963) that, after con- 
siderable practice, 5s could search a list 
for any one of 10 alphanumeric target 
characters as quickly as they could for 
only one of them. Nickerson (1966b) 
recently proposed that Neisser's result 
could only be obtained when two experi- 
mental preconditions held. The first of 
these was sufficient practice with the task, 
(Neisser's Ss required more than 20 days 
practice to develop their terminal be- 
havior.) The second condition was “. . . 
the use of a paradigm which requires 
activation of the same set of feature-de- 
tecting operations trial after trial [1966b, 
p. 767]." The results in the present ex- 
periment argue against the necessity of 
Nickerson's suggested prerequisites, Iden- 
tity-search Ss showed behavior like Neis- 
ser’s Ss even within the first replication 
of the experiment, ie., within Sessions 
1-4. Moreover, the fact that cells in the 
matrixes were filled at random ruled out 
any appreciable trial-to-trial similarity 
among patterns shown to Ss. 

The authors’ results then make it quite 


clear that Ss can and do adopt distinctive 
modes of pattern recognition. When the 
complexity of the patterns permits, Ss can 
process the patterns as wholes with a con- 
siderable saving in processing time. This 
reduced time does not seem necessarily to 
be purchased at the cost of increased 
error rate. While this experiment seems 
to suggest a surprisingly severe limit to 
the complexity of patterns that can be 
processed as wholes, this limit may be a 
product of the random patterns the au- 
thors used. Far more complicated config- 
urations may prove amenable to gestalt 
processing provided such configurations 
conform to the stimulus requirements of 
feature analyzers responsible for the ex- 
traction of certain important stimulus 
characteristics such as visual orientation 
and position (Sekuler & Pantle, 1967). 
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The Ss recalled lists of 12, 16, 20, and 24 common words on a single 
trial. 4 types of lists were used. C lists consisted of unrelated words, 
while E, M, and D lists contained 4 highly related words in addition 
to unrelated words. The 4 highly related words appeared as a cluster 
at the end of E lists, in the middle of M lists, and were distributed 
throughout each list in D lists. Recall of words was highest for 
M lists, followed by E, D, and C, in this order. The number of 
recalled functional units, however, was identical for M and C, and 
lower in E than in M and C. The findings suggested that (a) highly 
related words are retrieved from secondary memory as a single func- 
tional unit, (b) unitization of related words in primary memory occurs 


only to a small extent, and (c) the number of retrieved functional 


units is independent of the size of the units. 


Models of free recall 


postulating two types of memory store and transfer of information 
from one store to the other appear to be inconsistent with the data. 


A more appropriate view seems to 
and secondary memory represent di 


A typical S, who is presented with 
a list of familiar verbal items, such as 
common words, and immediately after- 
ward is asked to recall the items in any 
order they occur to him, cannot recall 
all items if the list length exceeds a 
critical value. But if S recalls the list 
again, he can frequently reproduce 
items he did not recall in the first 
output phase, even in absence of an 
interpolated input phase (Tulving, 
1967). The recall in the mth output 
phase of a certain number of “new” 
words—words not recalled in the 
n-Ith output phase—is usually accom- 
panied by the failure to recall an equi- 
valent number of “old” words—words 
recalled in the n-th output phase. 

This finding suggests that the fail- 
ure to recall some of the presented 
items in a given output phase does not 
necessarily reflect the failure of getting 


. ^ This research was supported by the Na- 

tional Research Council of Canada Grant 
No. APT-39 and the National Science 
Foundation Grant No. GB 3710. 


be one according to which primary 
fferent types of retrieval mechanism. 


appropriate information about list 
items into the memory store, nor does 
it necessarily reflect the failure of 
maintaining this information in the 
store. Nonrecall of some items rather 
seems to reflect the limited capacity 
of the retrieval mechanism to find 
access to the information available in 
the store. It is as if in any given out- 
put phase the retrieval mechanism has 
access to a limited number of units of 
stored material Access to one unit 
seems to reduce the probability of 
access to other units. In other words, 
it is as if free recall of discrete verbal 
units is limited because the retrieval 
mechanism has a limited capacity. 
If the capacity of the retrieval mech- 
anism is limited to a fixed number of 
units of material, how does one inter- 
pret systematic variations in the num- 
ber of items recalled when factors such 
as list length, rate of presentation, and 
strength of interitem associations (e.g., 
Deese, 1959, 1960; Murdock, 1960) 
are varied? One possibility is to 
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assume that these variables affect the 
size of the units that are retrieved in- 
dependently of one another. Two or 
more nominal list-items may be 
grouped together into a “chunk,” or 
"unitized" into a higher-order func- 
tional memory unit (Miller, 1956a, 
1956b; Tulving, 1966b, 1968) on the 
basis of their associative relations, se- 
mantic similarity, or other factors. 
Even if the number of accessible units 
remains constant, the overall recall, 
measured in terms of the number of 
recalled nominal list-items, does co- 
vary with the size of the functional 
units. 

The present experiment was de- 
signed to further explore the relation 
between the size of functional units 
and the number of recalled functional 
units under the conditions of a single- 
trial free-recall (FR) experiment. 
The Ss were shown lists of common 
words and asked to recall the words 
in each list immediately after the pre- 
sentation of the list. The critical ex- 
perimental manipulation had to do 
with the nature of the words in the 
lists. In addition to control lists, con- 
sisting of words randomly drawn from 
a larger pool, there were other lists 
each of which included a group of 
strongly related words, such as 
FATHER, MOTHER, BROTHER, SISTER, Or 
COW, HORSE, DOG, CAT. It was ex- 
pected that S would unitize each group 
of such strongly related words into a 
higher-order functional unit, partic- 
ularly if all the related words occurred 
in contiguous input positions in the 
list. Unitization of randomly selected 
words was expected to be more dif- 
ficult and result in much smaller func- 
tional units. It was hoped that the 
comparison of recall of nominal units, 
ie. individual words, as well as func- 
tional units, ie. groups of strongly 
related words, across different kinds 
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of lists, would provide some evidence 
relevant to the hypothesis that the 
number of recalled functional units is 
independent of the size of these units, 


METHOD 


Design.—There were two treatment vari- 
ables in this experiment: (a) length of the 
list—12, 16, 20, and 24 words, and (b) type 
of the list—C, D, M, and E, as described 
below. The íour types of list and the 
four levels of list length were combined fac- 
torially to yield 16 different treatment con- 
ditions. In addition, an eight-word C list 
was used, making a total of 17 treatment 
conditions. Each condition can be desig- 
nated in terms of the nature and the length 
of the list, e.g, C-16, D-24, E-12, etc. 

Each S was tested once under each of 
the 17 conditions, with the order of the 
treatment conditions for each S being deter- 
mined by means of a table of random num- 
bers. 

Lists—All lists consisted of common 
English nouns, or words that could be used 
as nouns, drawn from a basic pool of 296 
such words. The pool contained 12 sets 
of 4 highly related words (R words) 
plus 248 "unrelated" words (U words). A 
set of R words consisted of either four 
instances of an exhaustive conceptual cate- 
gory (Cohen, 1963), such as NORTH, SOUTH, 
EAST, WEST, or of four high-frequency mem- 
bers of a nonexhaustive category (Cohen, 
Bousfield, & Whitmarsh, 1959) such as ARM, 
HAND, LEG, Foot. The U words were in- 
cluded in the pool without any regard to 
their meaning. It is only in this sense that 
they are referred to as unrelated. The 
Thorndike-Lorge (1944) General Count 
frequencies were matched for R and U 
words. Approximately 75% of all words 
had frequencies of 50 words per million or 
over, the remaining 25% of the words 
ranged in frequency 4-49 per million. 

Sets of R words and individual U words 
were drawn randomly without replacement 
from the basic pool to make up lists of 
the required length and of different types 
separately and independently for each S. 
The four types of list were: 


1. C (Control) lists, each consisting only 
of U words, the number of words in each 
list being determined by list length (L). 

2. D (Distributed) lists, each consisting 
of four R words from one of the 12 sets, 
plus L-4 U words. The R words were dis- 
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tributed throughout the list. They occupied 
serial positions of 3, 6, 9, and 11 in 12-word 
lists; 3, 7, 12, and 15 in 16-word lists; 
3, 8, 15 and 19 in 20-word lists; and 3, 9, 
18, and 23 in 24-word lists. 

3, M (Middle) lists, each consisting of 
four R words from 1 of the 12 sets, 
plus L-4 U words. The R words were 
"blocked" in the middle of the list, occupying 
serial positions of 7, 8, 9, and 10, counting 
from the end of the list, ie., serial positions 
L-6, L-7, L-8, and L-9, with L=12, 16, 
20, or 24. 

4. E (End) lists, each consisting of four 
blocked R words plus L-4 U words, as in 
M lists, except that all R words occupied 
the last four input positions in each list. 
Thus, in 12-word lists, R words were pre- 
sented in serial positions 9, 10, 11, and 12; 
in 16 word lists they occurred in positions 
13, 14, 15, and 16, etc. 

Subjects.—Sixty students of both sexes 
taking psychology courses at the University 
of Toronto in the summer of 1966 served 
as Ss in this experiment. Participation as 
Ss in psychological experiments was @ 
course requirement for these students. 


Procedure.—Words were presented to Ss 
by means of flash cards. Each word was 
hand-printed on a 3 X 5 in. white index card. 
The E turned over the cards in a prepared 
deck constituting the list, at the rate of 
1.5 sec, per card. The S read each word 
aloud as it was presented and at the end 
of the list—signalled by the appearance of 
a blank card—recalled the words orally at 
a self-paced rate. The recall was recorded 
on magnetic tape which was later used to 
verify the written record of S's recall made 
by E at the time of recall. Sufficient num- 
ber of blank cards were placed at the end 
of each deck to prevent S from estimating 
the length of a particular list from the thick- 
ness of the deck. 

The amount of time given for recall 
varied with list length. A maximum of 3 
sec. per word in the input list was allowed 
for recall, but when S indicated that he 
was unable to recall any more words from 
the list E proceeded to present the next 
list. Thus, interlist interval varied from 
list to list, depending on the length of the 
list presented, and from S to S. A new list 
was presented approximately 10 sec. after 
the conclusion of the preceding 

The instructions to Ss were to recall as 
many words from each list as they could, 
in the order in which the words 
to them, The instructions did not include 
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Fic. 1.. Mean number of words recalled as 
a function of total presentation time/list. 


any information about the nature of com- 
position of lists, the length of lists, or the 
number of lists to be learned. 


RESULTS 


Overall recall data—As the first 
step in the analysis of the data, the 
mean number of words recalled was 
calculated for each of the 17 lists. 
These means are plotted as a function 
of total presentation time (T) per list 
for C, M, and E lists in Fig. 1. Since 
words were presented at the rate of 
1.5 sec. each, T of 12 sec. corresponds 
to 8-word lists, T of 18 sec. corre- 
sponds to 12-word lists, etc. The ver- 
tical lines drawn through all data-points 
indicate the 95% confidence intervals 
of the corresponding population means. 

Figure 1 shows that the mean num- 
ber of words recalled was a monoton- 
ically increasing and approximately 
linear function of total presentation 
time (and hence of list length) for 
these three types of list. It also shows 
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that recall was highest for M lists and 
lowest for C lists. 

The straight lines shown in Fig. 1 
were fitted to the data by the method 
of least squares. The equations of 
these functions were as follows: M 
lists, R = .060T + 7.12; E lists, R= 
.069T + 5.42; and C lists, R = .065T 
+ 4.53, where R refers to the mean 
number of words recalled and T repre- 
sents the total presentation time in sec- 
onds. These equations indicate that the 
slopes of the three functions were prac- 
tically identical and that the three 
types of list differed only in absolute 
levels of recall, a conclusion also ap- 
parent from the three functions de- 
picted in Fig. 1. 

The mean recall data for D lists 
yielded a more irregular picture. The 
means, with standard deviations given 
in parentheses, corresponding to list 
lengths of 12, 16, 20, and 24, were 
677 (1.57), 6.45 (1.82), 7.25 (179), 
and 7.52 (2.17). Thus, mean recall 
for the D-12 list was higher than for 
the D-16 list, although the difference 
was not significant: ¢ (59) — 1.17, 
p.05. 

The inversion between 12-word and 
16-word D lists in the function relat- 
ing mean recall to list length is prob- 
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Fic. 2. Proportion of words recalled as a 
function of input position. (The data points 
on the far left correspond to the last word 
in each list.) 
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ably attributable to the fact that in the 


12-word lists two of the R words. 


occurred in the last four input serial 
positions (Positions 9 and 11), while 
in the 16-word lists only one of the 
R words occurred in the last four posi- 
tions (Position 15). The relevance of 
input position to probability of recall, 
not only in D lists but in others as 
well, will become apparent in the fur- 
ther analysis of the data. 

Serial position curves—The next 
analysis of the data had to do with the 
relation between probability of recall 
and the serial position of words in 
input lists.  Serial-position curves 
were calculated for each of the 17 lists. 
The scope of the paper, however, per- 
mits the presentation of only those fea- 
tures of serial-position curves that are 
directly relevant to the main purpose 
of the experiment. 

Inspection of serial-position curves 
derived from C, M, and E lists re- 
vealed that the relation between prob- 
ability of recall and serial position at 
and near the end of the list (the 
recency effect) was largely independ- 
ent of list length within a given type 
of list, although it varied between lists. 
(Some relevant data in support of this 
conclusion will be presented later in 
this paper.) Consequently, the recall 
data for the last 10 input items in each 
list were pooled over all levels of list 
length within each of the three types 
of lists (C, M, and E). Eight-word 
C lists were not included in the analy- 
sis. 

Figure 2 depicts a summary of these 
data. The abscissa of the graph in 
Fig. 2 represents serial positions of 
words in input lists, the leftmost point 
(L) corresponding to the last word 
in the list, the next (L-1) correspond- 
ing to the penultimate word, and so on 
to L-9. The ordinate shows the pro- 
portion of words recalled, with the 
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proportion of 1.00 corresponding to 
the recall of 240 words (60 Ssx4 
Lists). 

The terminal part of the serial-posi- 
tion curves depicted in Fig. 2 shows 
the typical recency effect for C lists: 
a very high level of recall of the last 
input word, a steeply descending curve 
over the last four words, and a gradual 
leveling off of the curve thereafter. 
The recency effect for M lists is rather 
similar in shape to that of C lists over 
the last six input positions, although 
R words in Input Positions L-6-L-9 
in M lists were recalled at a much 
higher level. 

The somewhat lower level of the 
recency effect for M lists than for C 
lists is accounted for by those Ss who 
started recall of M lists with one of 
the R words. This happened on 30 
out of 240 (4 Lists x 60 Ss) trials. 
The mean number of words recalled 
from Input Positions L-3-L for 240 
trials in case of C lists was 3.12, and 
for 210 trials on which S did not first 
recall an R word in case of M lists 
the mean was also 3.12. 

In E lists the last four input words 
were always R words. With the pos- 
sible exception of the R word in Posi- 
tion L-3, these words were recalled 
almost perfectly. "Thus, the recall of 
the words from the last four input 
positions is greater in E lists than in 
either C or M lists. It is important 
to note, however, that the recency 
effect in E lists does not appear to ex- 
tend over a greater range of input 
serial positions than it does in C or 
M lists. The differences in the recall 
of words from Input Positions L-4- 
L-7 between M and C lists appear 
negligible. Theoretical implications of 
this finding will be discussed later in 
this paper. 

The serial-position curves for D lists 
agreed quite well with those of C lists, 
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Fic. 3. Serial position curves for C and 
D lists of length 24. 


with the exception that the R words 
in D lists were always recalled more 
frequently than the U words in cor- 
responding serial positions in C lists. 
Since the R words in D lists of dif- 
ferent lengths did not occupy corre- 
sponding serial positions, the data can- 
not be summarized in the same way 
as was done for M and E lists. In- 
stead, a single example is presented in 
Fig. 3, which shows the full serial- 
position curves for Lists C-24 and 
D-24. 

The probability of recall of R words 
in the D list varied inversely with the 
order of appearance of these words 
in the list, the first of the four being 
recalled at the lowest level and the last 
at the highest level. The same rela- 
tion held also in D lists of lengths 16 
and 20, but not in List D-12. 

Recall of R and U words from pri- 
mary and secondary memory.—The 
next step in the analysis of the data in- 
volved cross classification of recalled 
words in terms of their type, R and 
U, and in terms of their input position, 
terminal words, (Input Positions L-3- 
L) and preterminal words (Input 
Positions 1-L-4). The words in ter- 
minal positions are referred to as 
words recalled from primary memory 
(PM), in keeping with Waugh and 
Norman’s (1965) terminology, and to 
the recall of these words as PM recall. 
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The words in preterminal positions are 
referred to as words recalled from 
Secondary memory (SM) and to the 
recall of these words as SM recall. 
This usage of the terms PM and 
SM seems to be justified in that a 
large majority of terminal words were 
recalled early in the output sequence. 
Of all the terminal words Ss recalled 
in the whole experiment, 87% were 
recalled in Output Positions 1-4, this 
percentage being highest for C lists 
(89%) and lowest for D lists (84%). 

The cross classification of recalled 
words in terms of the type of words, 
R and U, and the type of recall, PM 
and SM, defines four additive com- 
ponents of recall: R-PM (read R 
words recalled from PM), U-PM, R- 
SM, and U-SM. Only recall of D 
lists provided estimates for all four 
components, however. In recall of 
M lists, the R-PM component was 
missing, and in recall of both C and 
E lists, two components did not occur 


LIST LENGTH 


Fic. 4. Mean number of recalled words 
of four different types as a function of 
list length. 
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because of the composition of those 
lists. 

The results of the analysis of the 
recall data in terms of these four pos- 
sible components of recall are shown 
in Fig. 4. Each data-point represents 
the mean number of words of a given 
class recalled by 60 Ss. The data for 
D-12 lists are omitted from Fig. 4, 
because, as stated earlier, the composi- 
tion of that list differed from that of 
other D lists. In the D-12 lists the 
last four input positions included two 
R words, while in all other D lists 
they included only one R word. 

Figure 4 reveals a rather interesting 
feature of the data. In each type of 
list there is one and only one com- 
ponent of recall that varies monotoni- 
cally and approximately linearly with 
list length. In all four types of the list 
that component is the same, namely 
U words recalled from secondary 
memory. The three other compon- 
ents—R-PM, U-PM, and R-SM—ap- 
pear to remain relatively invariant 
with changes in list length. Thus, PM 
recall is independent of list length, re- 
gardless of whether the PM words are 
R words or U words, and so is the 
SM recall of R words. 

Recall of functional units—The 
final analysis of the data was under- 
taken in the light of the assumption 
that when an S recalls one or more 
highly related words, such as the R 
words used in this experiment, he in 
fact recalls a single functional unit. 
The number of functional units in- 
volved in the recall of U words, of 
course, cannot be estimated, but since 
unitization of U words probably oc- 
curred to the same extent in lists of all 
types, this uncertainty should not in- 
validate comparisons across list types. 

Thus, Ss recall protocols were re- 
scored in terms of the number of func- 
tional units recalled. The S was given 
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credit for the recall of a functional 
unit when he recalled at least one R 
word from a given list or one U word. 
For example, an S who recalled three 
R words and five U words from a list 
received credit for the recall of six 
functional units. To the extent that 
unitization among U words took place, 
this method overestimates the number 
of recalled functional units, but this 
possible error presumably affects all 
list types more or less equally, And 
our main interest lay in the compari- 
son of the number of functional units 
for different types of list. 

This analysis was not applied to D 
lists, because the assumption about 
unitization of all R words in a list 
did not seem justifiable. While R 
words from E and M lists were invari- 
ably recalled as a group, R words 
from D lists were recalled in noncon- 
tiguous output positions frequently 
enough to cast doubt on the assump- 
tion that they were always processed 
as members of the same higher-order 
functional unit. 

For the other three types of lists— 
C, M, and E—the data on the recall 
of functional units as defined above 
were pooled over lists of length 12-24. 
The mean number of functional units 
recalled was 6.30 for C lists, 6.25 for 
M lists, and 4.43 for E lists. Thus, 
the mean number of functional units 
is virtually identical for C and M 
lists, and considerably lower in E lists 
than in C and M lists. The # test 
comparing the means of E and C lists 
yielded a t (59) of 11, p < 01. 


Discussion 


Contiguously presented related words 
seem to operate as a single functional 
unit of memory in free recall if they 
occur in the midddle of the input list, 
that is, if they are retrieved from secon- 
dary memory (Waugh & Norman, 1965) 
or from the long-term store (Atkinson 
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& Shifrin, 1967). The number of 
words by which recall from M lists ex- 
ceeded the recall from C lists rather pre- 
cisely matched the number of related 
words that a typical S recalled in addi- 
tion to the first related word he recalled. 
When all related words recalled by an 
S from a given M list were considered 
as constituting a memory unit equivalent 
to that represented by a single unrelated 
word recalled by S, the number of such 
units recalled was identical for M and 
C lists. Thus, the number of functional 
units of memory that can be retrieved 
under the conditions of free recall seems 
to be independent of the size of the units. 
Recall of constituent elements of a 
higher-order functional unit appears to 
occur without any cost to the capacity 
to retrieve independent units as such. 
The finding of independence between 
size of the functional unit and the num- 
ber of functional units that can be re- 
called complements the earlier finding 
reported by Tulving and Pearlstone 
(1966). In that experiment, the num- 
ber of functional units (accessible con- 
ceptual categories of words) was greater 
for lists of 48 words than for lists of 
24 words, and greater under the condi- 
tions of cued than under the conditions 
of noncued recall, but the size of recalled 
units was invariant with these two vari- 
ables. Variations in the number of 
higher-order functional units probably 
reflect the consequences of subjective 
organization of these units into still 
higher-order units, in the manner of 
hierarchical organization described by 
Mandler . (1967), although for the time 
being this suggestion still represents 
only a conjecture. The important point, 
however, is that recall within a higher- 
order unit is independent of the number 
of such units that are recalled. 
Unitization of the kind observed in 
M lists did not occur to the same extent 
in E lists in which the related words 
occupied four terminal input positions, 
The Ss memorizing E lists recalled fewer 
words in addition to the group of four 
related words than they recalled in addi- 
tion to a single U word in C lists. 
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Furthermore, the range of input positions 
over which the recency effect was ob- 
served in E lists was no more extended 
than it was in C lists (Fig. 2). Some 
unitization of R words in E lists probably 
did occur, as indicated by the superior 
recall of E lists when compared with C 
lists, but the extent of this unitization 
was small, Only if it is assumed that the 
recall of four R words from E lists was 
approximately equivalent to the recall 
of three functional units, would the num- 
ber of functional units recalled from E 
lists approximate the number of func- 
tional units recalled from C lists: 6.43 
and 6.30 for E and C lists, respectively, 
with the data averaged over lists of 
length 12, 16, 20, and 24. Thus, it ap- 
pears that related words in terminal 
input positions are handled more like 
individual words than members of a 
higher-order unit. 

Some unitization of related words also 
occurred in, and apparently facilitated 
recall of, D lists, as shown by the higher 
recall of words from these lists in com- 
parison with C lists, but again the ex- 
tent of this facilitation was less than that 
Observed in the case of M lists. Whether 
this difference is attributable to the pre- 
sentation of the related words in D lists 
in noncontiguous input positions, to the 
fact that some of the related words oc- 
curred in preterminal positions and some 
in terminal positions, or to both factors, 
cannot be determined from the existing 
data. 

The present results do not seem to be 
readily reconcilable with models of free 
recall which distinguish between two 
kinds of memory as representing sepa- 
rate memory stores (eg., Atkinson & 
Shiffrin, 1967; Glanzer & Cunitz, 1966: 
Glanzer & Meinzer, 1967 ; Waugh & Nor- 
man, 1965). If items can be stored in 
secondary memory (SM) or long-term 
store (LTS) only through their transfer 
from primary memory (PM) or short- 
term store (STS), and if only some 
items are so transferred, how is the 
high level of recall of related words in 
M lists and their unitization in SM or 
LTS to be explained? In answering 
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this question one must assume either | 
that related words become unitized al- 
ready in PM or STS, or that they are 
not unitized in PM or STS and become 
unitized in SM or LTS. Adoption of 
the former position would create the 
problems of explaining why recall of 
words was less in E lists than in M lists 
and why recall of units was. lower in E 
lists than in C lists. It would also re- 
quire the explanation of why the recency 
effect was not extended over a larger 
number of terminal input positions in 
E lists than in C lists and M lists, 
Adoption of the latter position, on: 
the other hand, makes it necessary to 
explain how related words not unitized. 
in PM or STS are, almost without 
exception, transferred into, and how they 
then become unitized in, SM or LTS. 
In coping with this problem, one might 
perhaps argue that the probability of re- 
call of a unit is proportional to the total 
presentation time for that unit (Waugh, 
1967) and that the related words were d 
recalled at a higher level of probability 
because the unit of which they were 
members was presented for a time four 
times longer than that of a single unre- 
lated word. The simple proportional 
relation between presentation time and 
probability of recall, however, .did not 
hold under the present conditions. 
Because of these apparent difficulties 
with models postulating two ' separate 
stores, and at least until such time that 
the difficulties are cleared up, the authors 
prefer to identify PM and SM with differ- 
ent types of retrieval mechanism rather 
than with different types of store (Tul-. 
ving, 1966a, 1968). It is assumed that 
every stimulus word in the list, or, more 
precisely, information about the fact that 
the word occurred in the list, is placed 
into a unitary store in a coded form. 
Coding refers to storage of additional 
information with each to-be-remembered 
word at the time of input (Tulving, 
1966a, 1968; Tulving & Osler, 1968). 
This additional information serves as’ a 
source of retrieval cues that provide 
access to the information about the to- 
be-remembered words. Certain types of 
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retrieval cues can be manipulated experi- 
mentally (Earhard, 1967; Tulving & 
Osler, 1968; Tulving &  Pearlstone, 
1966), but some kinds of retrieval cues 
must be operating even if E is not aware 
what they are and even if he has no 
direct control over them. The explana- 
tion of differential accessibility of re- 
lated and unrelated words, and of words 
from different parts of the input list, 
therefore, must be sought in differential 
effectiveness of different kinds of re- 
trieval cues. 

The terminal list items, for instance, 


. may be stored with information about 


their distinctive input positions or their 
distinctive time tags (Yntema & Trask, 
1963), and this additional information 
may serve as a retrieval cue. The S 
knows already before the presentation of 
a list that some items will occur in the 
last few’ input positions and hence this 
type of retrieval cue itself is always 
accessible to him. Retrieval of pre- 
terminal items, on the other hand, may 
be mediated by LisT as a retrieval cue. 
List refers to a given collection of items, 
differentiated from other possible lists 
in terms of its own temporal date or 
demarcation. If the items in the list, 
at the time of their presentation, can be 
readily classified by S into two distinc- 
tive categories, such as “related words" 
(eg. color names—BLUE, GREEN, YEL- 
Low, RED) and "other words,” LIST can 
serve as a general retrieval cue for 
related words and other words which 
then in turn serve as more specific re- 
trieval cues for words’ associated with 
the two categories, in keeping with the 
characteristics of “hierarchical organiza- 
tion (Mandler, 1967). Since it is known 
that probability of recall of an item 
associated with a given retrieval cue 
varies inversely with the number of items 
associated with the same cue (Earhard, 
1967; Tulving & Pearlstone, 1966), and 
since the potency of the retrieval cue 
varies directly with the strength of pre- 
experimental association between it and 
its associated words, the high probability 
of recall of related as compared with un- 
related words is to be expected. 


Although, at the level of experimental 
operations, S’s task in an FR experi- 
ment can be viewed as one of recall of 
individual items, retrieval of stored in- 
formation must be conceptualized, at the 
theoretical level, as always involving 
mediation by retrieval cues. The present 
speculations as to the nature of retrieval 
cues in free recall may be woefully in- 
adequate, but they are amenable to ex- 
perimental tests. At the very least, it 
seems clear that the identification of 
functional units of memory and the re- 
trieval cues which provide access to these 
units in the memory store under the 
conditions of free recall constitutes a 
pressing theoretical problem. In the 
long run, nothing much can be gained 
by postulating a homunculus searching 
through one or more types of memory 
store for desired mnemonic information. 
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Previous studies have found in a decision-making task that Ss are 
less than optimally responsive to nonsymmetrical, biased payoffs. The 
hypothesis that Ss attribute a utility to being correct was compared 
here with the hypothesis that they maintain a constant critical odds 
regardless of the degree of bias in the payoffs. 3 groups of 10 Ss 
each made decisions under symmetrical payoffs and under payoffs 
with 2? of bias. 2 groups used a nonsequential procedure, making 
decisions only after the presentation of all items of information. 
The 3rd group revised decisions after each item of information. The 
utility hypothesis received mild support; the constant critical-odds 
hypothesis was rejected. 2 strong sequential effects were observed 
that accounted for 81% of nonoptimal responses under unbiased 


payoffs for the 3rd group. 


In a study of the optimality of be- 
havior in a simple decision-making 
task, Pitz and Downing (1967) ob- 
served largely optimal behavior under 
conditions of symmetric or unbiased 
payoffs. A choice between two hy- 
pothetical “states of nature" was based 
upon stimulus information generated 
by one of the hypothetical states. With 
nonsymmetrical payoff matrixes gener- 
ating an a priori bias in favor of one 
response, Pitz and Downing observed 
that the optimality of behavior was 
considerably reduced. Their Ss were 
apparently unwilling to make responses 
that, while having a lower probability 
of being correct, nevertheless had a 
larger expected value. A suggested 
explanation was that Ss attribute a 
Sizeable utility to being correct over 
and above the monetary payoffs in- 
volved in the decision, which would 
lead to a less than optimal biasing of 
response tendencies. A study of a per- 
ceptual decision task by Ulehla (1966) 
produced similar results, in that the 
biasing of Ss’ responses was less than 
optimal. Ulehla also suggested the 
possibility of a utility for being cor- 
rect that would account for this de- 
parture from optimality. 


The present study is a reexamina- 
tion of the Pitz and Downing findings 
and a test of the utility hypothesis. 
Performance was examined under two 
nonsymmetric payoff matrixes of dif- 
fering degrees of bias. If Ss possess 
a constant utility for being correct, it 
should be possible to estimate this util- 
ity separately for both sets of payoffs. 
Identical estimates under both condi- 
tions would be evidence in favor of the 
hypothesis, whereas widely divergent 
estimates would imply that some other 
explanation should be sought. 

In order to maximize the expected 
value of decisions, one may compute 
the odds, Q, in favor of any one hy- 
pothesis by means of Bayes' theorem. 
For a two-hypothesis task, in which 
H, and H, are the two hypotheses, a 
cutoff point or critical odds, Q*, is 
determined by characteristics of the 
payoff matrix. That is, È 


(42) — V(1,2) 
"yüin-von Ld 


where V (i,j) is the payoff for making 
response i when hypothesis j is correct. 
Whenever © is greater than Q*, S 
should choose H,, and when Q is less 
than Q *, S should choose H,. When 


Q* 
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each hypothetical state is equally likely, 
Q is equal to the likelihood ratio, a 
ratio of probabilities that is a sufficient 
summary of the relevant stimulus in- 
formation (see Pitz & Downing, 
1967). 

For each S one may plot the pro- 
portion of H, choices for each value 
of the likelihood ratio describing the 
stimulus information. The likelihood 
ratio for which the proportion of re- 
sponses is .5 may be taken as an 
estimate of the subjective critical odds, 
referred to here as A, the point at 
which S is indifferent between 
choosing H, and H,. From the value 
of A it is possible to estimate the hy- 
pothetical utility for being correct in 
the following way: If to the values 
V (1, 1) and V (2, 2) an unknown util- 
ity U is added, A may be assumed to 
be given by the following equation, 


y = (V2) + V) = V(12) 
~ (VG) + 0) = Vay 


It is clear from the form of this equa- 
tion that if U is positive, A is closer 
to 1 than is Q *. Equation 2 may be 
solved for U to give 


[2] 


U= (4) [(Y(22) — v(1,2)) 


EXXCV (2) — V (Q2:1))- SES 


One could then use the value of U 
computed from Equation 3 for one 
set of payoffs to predict performance 
under another set by means of Equa- 
tion 2. A problem with the use of 
Equation 3, however, is that estimates 
of A may occasionally be larger than 
1 when Q* is less than 1, or vice 
versa. In this case there will be a 
large negative value of U; hence, a 
comparison of mean values of U would 
be impossible. This awkward result is 
avoided, without changing the basis 
for the analysis, by carrying out analy- 
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ses with predicted and estimated values 
of À. 

An alternative hypothesis is that Ss 
ignore the nature of the payoffs, except 
for noting the direction of the bias. 
This would imply a constant cutoff or 
Critical odds rather than a constant 
value of U. When using Equation 3 
for one set of payoffs, then Equation 2 
to predict A under a different set of 
payoffs, a constant A hypothesis implies 
that such predictions would show a 
systematic error. The extent of this 
error may be found by using the same 
value of à for computing U by means 
of Equation 3 as that value which is to 
to be predicted by Equation 2. The 
error in prediction implied by a con- 
stant critical odds is an overprediction 
of A when A < 1, and underprediction 
when A> 1. 

The task employed by Pitz and 
Downing was a nonsequential decision 
task in which all information was pre- 
sented to Ss prior to their decision. 
However, using a sequential decision- 
making procedure, it is possible to 
evaluate the stability of U and A at 
different stages of the decision-making 
process. In a sequential task, Ss are 
presented with events in a series, and 
revise their decisions following each 
informative event. Both the utility 
and constant-critical-odds hypotheses 
would predict similar estimates of U, 
and hence of A, at each stage. 

The importance of studying sequen- 
tial and other types of dynamic de- 
cision-making has been emphasized by 
Edwards (1962). In the present 
study an attempt was made to compare 
performance in the nonsequential case 
with performance when Ss had an op- 
portunity to revise their decisions as 
information was accumulated. There 
are in the literature several studies of 
the subjective revision of opinion (e. g. 
Phillips & Edwards, 1966), but these 
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studies typically have used as a de- 
pendent variable probability estimates 
or confidence judgments. A simple 
binary choice provides less informa- 
tion, but has advantages as a. depend- 
ent variable over a continuous esti- 
mation procedure. The task is more 
akin to typical decisions, and, per- 
haps for this reason, it is likely that 
Ss understand the task better than they 
understand probability estimation. 
Hence behavior is less likely to be 
affected by irrelevant task character- 
istics. 


METHOD 


Subjects —Three groups of 10 Ss each 
were employed. All were male volunteers 
from an introductoy psychology course, who 
were told that they would earn from $4 to 
$8, depending upon their performance, for 
serving for approximately 3 hr. 

Procedure—A two-alternative decision 
task was employed in which the hypothetical 
states were represented by two bags, each 
containing 100 chips. Bag 1 contained 60 
red and 40 blue chips; Bag 2 contained 60 
blue and 40 red chips. On each trial, one 
bag was chosen at random by E and 5 
chips were drawn from the chosen bag with 
replacement after each draw. The results 
of the five draws were presented, either 
sequentially or simultaneously ; and Ss were 
required to decide whether the chosen bag 
was Bag 1 or Bag 2. 

Stimulus sequences were preprogrammed 
in accordance with the probabilities specified 
by the task, rather than using actual bags 
and chips. For each trial a hypothesis was 
chosen at random, and red and blue chips 
were randomly selected according to the 
probabilities specified by the chosen bag. 
No attempt was made to maintain the fre- 
quencies of events within any degree of 
approximation to expected values, but since 
a large number of trials was used, obtained 
frequencies were all close to expected fre- 
quencies. The sequences were programmed 
on punched paper tape, and information was 
presented to Ss by means of two counters 
(Industrial Electronics Series 10,000 read- 
outs), one counting red chips and the other 
counting blue chips. 

Group 1 Ss were presented with all rele- 
vant information simultaneously. At the 
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TABLE 1 
Pavorr MATRIXES USED, WITH VALUES 
IN CENTS 
Nonsequential Sequential 
KeA Groups Group 
Bagi | Bag2 | Bagi | Bag2 
Mo 
Bag 1 2 0 0.5 0 
Bag 2 0 2 0 0.5 
1 
Bag 1 3 —1 1 —0.33 
Bag 2 -1 0 | —0.33 0 
2 
Bag 1 4 -1 1 —0.25 
Bag 2 -4 0 | -—1 0 


Note.—Negative values indicate losses. 


start of each trial the counters registered 
the results of all five draws from the chosen 
bag. The Ss were allowed as much time 
as they wished to make a decision, and fol- 
lowing each decision the results of another 
five draws were presented. 

Group 2 Ss were presented with informa- 
tion sequentially, in that the results of the 
five draws were registered on the counters 
one at a time at intervals of .7 sec. Other- 
wise the procedure was equivalent to that 
used with Group 1. A comparison of the 
performances of Group 1 Ss and Group 2 
Ss provided a measure of the effects of se- 
quential versus nonsequential presentation 
of information. 

Group 3 Ss were presented with informa- 
tion sequentially, as was the case with 
Group 2. However, Ss were required to 
make an initial decision following the first 
item of information and to revise this de- 
cision following each of the four other items. 
No time limits were imposed upon S’s 
responses, and information was presented .7 
sec. after each decision. In comparison with 
Group 2, Group 3 provided information on 
the effects of sequential decision-making 
unconfounded by differences in the presenta- 
tion of information. 

A total of 360 trials was presented in six 
sequences of 60 trials each, with brief rest 
periods between sequences. For each pair 
of sequences, or 120 trials, a sepecified pay- 
off matrix was employed which was dis- 
played and explained to S. The three pay- 
off matrixes are shown in Table 1, in which 
positive numbers indicate gains and nega- 
tive numbers indicate losses. Mo was a 
symmetric matrix, in the sense that each 
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response was rewarded equally when cor- 
rect and punished equally when incorrect, 
Mi was a nonsymmetric matrix biased in 
favor of Bag 1; Ms was even more biased 
in favor of Bag 1. Since Ss in Group 3 
made more total decisions than did Ss in 
other groups, payoffs were made smaller 
for Group 3, so as to make average winnings 
per hour roughly equal for all groups. The 
degrees of bias, however, were the same 
for all groups. 

If r is the number of red chips drawn 
Írom the chosen bag and b is the number of 
blue chips, the likelihood ratio, L, following 
any number of draws, is given by Equation 
4. It is clear from this equation that L 
depends solely upon the difference between 
the number of red and blue chips sampled. 


LPO|H) foy 
L-pj (a) t 


For Mo the critical odds, Q*, was 1. For 
M: Q*—25 and for M, Q*=.125 
Since it was assumed that each hypothesis 


+—b>0 for Mo, when r—b>—4 for M, 
and when r—b>—6 for M; 

All Ss gave their responses verbally, and 
at the end of a trial they were told whether 
they were correct or incorrect. Decisions 
during the first 120 trials were made with 
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SAMPLED 


a function of r for Groups 1 and 2 and for 
(Proportions for Group 3 were computed from the final decisions on each trial). 


the symmetric payoff matrix M»; however, 
the order in which M, and M, were used, 
following Mo, was randomized. The order 
of presentation of the six stimulus sequences, 
and hence the Pairing of sequences with pay- 
offs, was also randomized independently for 
each S. 


RESULTS 


Inspection of the data indicated no 
differences between Groups 1 and 2, a 
result confirmed by an analysis of the 
proportion of Bag-1 responses for each 
group. The proportion of Bag-1 re- 
sponses for each value of r, where r 
is the number of red chips in each 
sample, is shown in Fig. 1 for Groups 
l and 2 combined and for Group 3. 
Proportions for Group 3 were taken 
from the final decision on each trial. 
The results are in complete agreement 
with the findings of Pitz and Downing 
(1967), in that there was a high degree 
of optimal responding as defined above 
under unbiased payoff conditions, Mo, 
and a biasing of responses that was in 
the correct direction, but much less 
than optimal, for M, and M, Con- 
sistent differences between M, and M, 
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argue against a constant A hypothesis; 
in other words, Ss were responsive to 
the degree of bias in the payoffs. The 
fact that the effects of biased payoffs 
were noticeably less for Group 3 than 
for Groups 1 and 2 appears to be en- 
tirely a result of smaller absolute pay- 
offs for Group 3, a finding which will 
be discussed below. Under payoffs 
M,, there were no significant differ- 
ences among groups. 

Payoff effects.—Using Equation 3, 
estimates of the utility U were ob- 
tained for all groups under payoff 
condition, M,. In Group 3, Ss made 
five decisions during each trial, and 
estimates of U were made separately 
for the last four decisions, following 
2, 3, 4, or 5 items of information. In 
each case, the method used to estimate 
the cutoff point A, or the subjective 
critical odds, was that outlined above. 
For each S, the proportion of Bag-1 
responses was obtained for each value 
of r, and plotted against the likelihood 
ratio that was given by Equation 4. 
It was assumed that A was the value of 
the likelihood ratio for which the pro- 
portion was .5, implying indifference 
between the two alternatives; and this 
was found by linear interpolation be- 
tween the two proportions which 
straddled .5. 

The test of the utility hypothesis was 
to use estimates of U obtained under 
condition M, to predict values of A 
under M, by means of Equation 2. 
The results are shown in Fig. 2. Esti- 
mates of A for Ss in Group 3 were 
generally closer to 1 than for Ss in the 
other two groups, a result of the 
smaller bias in response tendencies for 
Group 3 under biased payoffs. This is 
to be expected from Equation 2 since 
the absolute values in the payoff ma- 
trixes were smaller for Group 3 than 
for the nonsequential groups, and the 
Same utility for being correct would 


have more effect upon A for Group 3. 
That is, if U is constant, A will be closer 
to 1, the smaller the other values in 
Equation 2. An intraclass correlation 
coefficient (Winer, 1962, pp. 124-132) 
was used to evaluate the consistency or 
reliability of a single value of A; the 
correlation was .27 when measured 
across different payoff conditions. 
Using a similar method to estimate 
the reliability of estimates for Ss in 
Group 3 following differing numbers 
of informative events, the intraclass 
correlation for a single estimate of A 
was .29. Neither of these reliabilities 
is very large. 

Using Equations 2 and 3, it is pos- 
sible to illustrate in Fig. 2 the predic- 
tions from a constant-critical-odds hy- 
pothesis. The curve is the result of 
using the value of A for M, together 
with payoffs M,, to predict A. In other 
words, A should be unchanged by vary- 
ing the payoffs. Although the results 
for some Ss were better described by 
this hypothesis, the utility hypothesis 
is a better description of the mean re- 
sults for all Ss. In fact, for Groups 1 
and 2, in 16 cases out of 20 the A 


© Group 1S3 
© Group 2 Ss 


O Group 3 55 


— Utility Hypothesis 
— — Constant À Hypothesis, Groups | ond 2 
—+— constant À Hypothesis, Group 3 


CRITICAL ODDS PREDICTED FROM M, 


E! 6 8 1.0 
SUBJECTIVE CRITICAL ODDS, M, 


Fic. 2. Scatter plot of predicted and ob- 
tained values of A for payoff matrix Mi. 
(The data for one S in Group 3 is not 
plotted; predicted and obtained values for 
this S were 1.77 and 1.16, respectively.) 


254 


TABLE 2 


PRoportIoNns or BAc-1 RESPONSES IN 
Group 3 FOLLOWING THE "TH 
EvENT, GIVEN THE PRECED- 

ING STIMULUS AND 
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RESPONSE 
Nae Blue Chip Red Chip 
of Red 
Chis | Bat | mag2 | Bagt Bag2 
n=2 
0 24 .01 — — 
1 .58 34 .86 -73 
2 — VE 97 68 
n=3 
0 56 .01 — — 
1 42 10 .79 30 
2 81 .39 95 75 
3 = — 99 74 
n=4 
0 50 01 => — 
1 36 .03 .60 19 
2 57 .20 .89 69 
3 .92 42 .99 12 
4 — — .99 34 
n=5 
0 33 .02 — — 
1 .39 .03 70 09 
2 44 .07 18 40 
3 19 34 .92 79 
+ 93 .00 97 53 
5 = — 1.00 75 


estimated by Equation 2 was more ex- 
treme for M, than for M,, as would be 
predicted by the utility hypothesis. 
These Ss show up in Fig. 2 as points 
further from 1 than predicted by the 
dashed line. This fact leads to rejec- 
tion of the constant A hypothesis at 
the .01 level by a one-tailed binomial 
test. By a similar test, it is not pos- 
sible to reject the utility hypothesis. 
Again, it appears that Ss did respond 
to the degree of bias in the payoffs. 
For Ss in Group 3, neither hypothesis 


could be rejected, although for Group 
3 there is less difference in predictions 
from the two hypotheses. Differen 
in the order in which M, and M, were 
used were quite unrelated to differ- 
ences in estimates of U or A. 

Sequential effects.—The term “se- 
quential effect" has commonly been 
used to refer to two different aspects 
of behavior. On the one hand, de- 
cisions made on one trial may be 
affected by the decisions and outcomes 
of previous trials. On the other hand, 
for Ss in a sequential task, revisions - 
of decisions within a trial are affected 
by preceding responses and stimulus 
events. In the Pitz and Downing 
study, sequential effects of the first 
kind were reported, the strongest being 
a negative recency effect that occurred - 
following incorrect decisions by S, but. 
which was not evident following cor- 
rect decisions. In the present study 
similar effects were observed, but they 
were small and by no means univer- 
sally significant. 

Some important sequential effects 
of the second variety are indicated in 
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PROPORTION OF OPTIMAL RESPONSES 


Ue 1 2 3 4 5 
DIFFERENCE BETWEEN RED 
AND BLUE CHIPS 
Fic. 3. Proportion of optimal responses 
by Group 3 Ss as a function of the differ- 
ence between red and blue chips sampled 
at different values of n. - 
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Table 2. The table breaks down the 
proportion of Bag-1 responses, aver- 
aged across Ss in Group 3 and across 
all trials and payoff conditions. Since 
by Equation 4 the likelihood ratio was 
constant for given values of r and m, 
proportions within a given row were 
obtained under equivalent objective 
conditions. The four columns contain 
response proportions following the 
occurrence of a blue chip or a red chip, 
and following a previous choice on that 
trial of Pag 1 or Bag 2. Two strong 
and statistically significant effects may 
be observed in the table. There was 
a tendency for Ss to match their re- 
sponses to the previous stimulus 
events, independent of objective likeli- 
hood ratios, and to repeat their pre- 
vious responses. Entries in Table 2 
match this pattern without exception, 
and very rarely did proportions for 
individual Ss fail to show these tend- 
encies. On any one occasion, of 
course, these two tendencies must 
either conflict or reinforce each other, 
and their independent effects can only 
be separated statistically. 

Both of these tendencies may pro- 
duce nonoptimal responses, responses 
which are inappropriate in view of 
the likelihood ratio. In fact, some 
81% of nonoptimal responses under 
unbiased payoffs were a result of one 
or both of the effects. One might ex- 
pect, therefore, that the level of opti- 
mality of responding would be greater 
early in a sequence than it was later, 
there being less opportunity for se- 
quential effects to occur early. Figure 
3 contains information supporting this 
assumption, showing a greater opti- 
mality of responding early in a se- 
quence of information for payoffs M, 
only. From these data, there are four 
independent comparisons that may be 
made to test the null hypothesis that 
the probability of optimal responses 
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being more frequent for smaller n is 
5. The number of Ss in Group 3 who 
exhibited each frequency, from zero 
through four, of differences in the pre- 
dicted direction, was compared with 
the expected number under the null 
hypothesis. A x? test of goodness of 
fit gave x? (4) = 21.06, p < 01. For 
comparison purposes, Fig. 3 shows the 
data for Groups 1 and 2 under M,. 
Although differences in optimality 
for Group 3 following different n were 
significant for the unbiased payoffs, the 
effect disappeared for M, and M, 
The data in Table 2, however, were 
consistent across payoffs. 


DISCUSSION 


The major findings of the Pitz and 
Downing study (1967) were replicated. 
Performance under symmetrical payoffs 
was quite optimal, while biasing of re- 
sponses under nonsymmetrical payoffs 
was considerably less than optimal. The 
hypothesis that Ss possess a utility for 
being correct that is responsible for the 
failure to respond appropriately to payoffs 
received mild support. It is clear from 
the variability around the line of unit 
slope in Fig. 2 that other variables must 
be involved, variables which are difficult 
to identify from a single binary decision. 
The intraclass correlation coefficient be- 
tween payoff conditions for Groups 1 and 
2 was very similar to the correlation ob- 
tained from Group 3 within trials. It is 
likely, therefore, that the variance which 
is unexplained by the utility hypothesis 
must be accounted for by relatively 
transient within-trial effects. The vari- 
ability itself may be of interest, since it 
implies that in a signal detection experi- 
ment, eg, Ss’ criterion odds may be un- 
stable within conditions. Most of the 
data indicated that, in spite of the vari- 
ability, Ss are responsive not only to the 
direction of bias in the payoffs, but also 
to its magnitude. 

Other data which exhibit a less than 
optimal biasing of response tendencies 
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may be found in a signal detection experi- 
ment recently reported by Galanter and 
Holman (1967). They accounted for per- 
formance by suggesting that the utility of 
money is a power function. In support of 
this hypothesis, and contradictory to the 
hypothesis of a constant utility for being 
correct, they found that the multiplica- 
tion of the values in a payoff matrix by a 
constant produced no shift in perform- 
ance. However, in order to account for 
the performance of their two Ss, it would 
be necessary to assume utility functions 
with exponents of less than .5. No other 
study of utility has observed such a 
markedly nonlinear utility function, at 
least for monetary values less than 
$1.00. Such a scale would imply, e.g., 
that 4¢ is less than twice as valuable as 
le, a conclusion that is intuitively unap- 
pealing. A further argument against the 
use of a power function to explain Galan- 
ter and Holman's data is that the exponent 
would apparently have to be a function 
of the degree of bias shown by the payoffs, 
being larger for their more highly biased 
payoffs.? 

The data of Table 2 may account in 
large part for nonoptimal responding, at 
least for the sequential group, under un- 
biased payoff conditions. In a recent 
study of probability estimation, Peterson 
and DuCharme (1967) observed what 
they referred to as a primacy effect: In 
revising their probability estimates, Ss 
were apparently more influenced by events 
early in a sequence than by later events. 
However, Table 2 implies that Ss were 
influenced more by recent events than by 
earlier events, in that the proportion of 
Bag-1 responses following a red chip was 
larger than the proportion following a 
blue chip, ceteris paribus. While these 
data may appear to conflict with the re- 
sults of Peterson and DuCharme, Table 2 
also indicates that Ss had a strong tend- 
ency to repeat their previous responses, 
independently of the stimulus event. It 
might well be that the Peterson and Du- 


2 The estimates of power function expon- 
ents were obtained from the data reported 
by Galanter and Holman (1967) in their 
Fig. 1 and 2. 
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Charme primacy effect reflects a tendency 
to maintain one's previous response, re- 
gardless of the information, rather than to 
react more strongly to early events. 

The existence of sequential effects of 
any form in decision making implies that 
Ss’ performance is a function not only of 
the previously presented information, but 
of trial-to-trial dependencies. Any attempt 
to describe decision making within a 
Bayesian framework must take account of 
such effects. The existence of sequential 
effects in a signal detection task has 
been recognized, and has been the sub- 
ject of theoretical treatment by, e.g., At- 
kinson (1963) and Schoeffler (1965). Be- 
tween-decision sequential dependencies re- 
semble those observed in binary choice 
experiments but have less effect upon re- 
sponses in a decision-making task. This 
finding itself is important, since it sug- 
gests that Ss do pay attention to relevant 
information when it is available. The 
matching of responses to recent stimulus 
events might be handled by a modification 
of Bayes' theorem which incorporates a 
decay of early events.? This would as- 
sume that Ss place too little importance 
upon early events in a sequence. On the 
other hand, the repetition of previous re- 
sponses is very reminiscent of the effect 
of commitment upon resistance to opinion 
change (Brehm & Cohen, 1962), and is 
difficult to handle within a Bayesian model. 

It is, unfortunately, impossible to deter- 
mine to what extent the sequential effects 
observed with Group 3 were the result of 
the sequential presentation of information, 
the overt revision of decisions, or both. 
The only significant difference between 
groups in the presnt study was clearly 
the result of the smaller absolute payoffs 
used with Group 3, but large individual 
differences made the power of between- 
group significance tests very low. It ap- 
pears that for further investigation of the 
source of sequential effects, an independent 
groups design is inappropriate. 


3 This has been suggested by J. D. Dodson 
in PRC Report R-194 (1961) of the Plan- 
ning Research Corporation Los Angeles, 
California. 
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RESPONSE PRETRAINING AND SUBSEQUENT 
PAIRED-ASSOCIATE LEARNING? 
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3 experiments, involving 232 Ss, examined the effect of pretraining 
Ss on items which were subsequently used as responses (R) in paired 
associates. Paired-associates learning involved force availability of the 
Rs and unlimited time for response. Experiment I and II examined the 
effects of pretraining adjectives. Lists of both high- and low-intralist 
R similarity were employed. Unlike the Saltz (1961) study which em- 
ployed a 4-sec. anticipation interval, pretraining had no effect on sub- 
sequent paired associates. Experiment III found a significant effect 
, of pretraining nonsense syllables, p > .05, despite the forced availability 
technique. The relation of the data to R availability and R differentia- 


tion theories was discussed. 


The present set of experiments is 
concerned with the mechanisms re- 
lating response pretraining to sub- 
sequent ^ paired-associates ^ learning 
(PAL). 

Underwood, Runquist, and Schulz 
(1959) showed that having Ss learn 
a set of adjectives, in a free recall 
situation, facilitated subsequent PAL 
in which these adjectives were used as 
the response (R) terms. They inter- 
preted their results as supporting a two- 
stage learning theory in which Ss must 
first learn the R terms (i.e., must make 
the Rs available) and then learn to as- 
sociate the Rs to their appropriate stim- 
uli. From this point of view, response 
pretraining eliminated the need for the 
availability phase in the PAL task, 
leaving only the associative phase. 

Saltz (1961) interpreted the data 
as suggesting that R pretraining pro- 
duced differentiation of the R terms, 
reducing competition among these Rs. 
To test the availability versus differen- 
tiation position, Saltz used a forced 
availability technique during the PAL 
phase. All the R terms were typed 


1 The present research was supported by 
National Science Foundation Grant No. 
GB 2538. 
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on cards which were set before S. To 
force S to use the cards, a number was 
printed in front of each R and Ss 
were required to respond by giving 
the appropriate number. Number-re- 
sponse combinations were randomly 
changed from trial to trial. Despite 
the fact that the Rs were thus made 
available to all Ss during the PAL 
phase, Ss who received R pretraining 
learned the pairs more quickly than 
those who did not. The results were 
interpreted as consistant with a dif- 
ferentiation position and opposed to 
availability theory. 

However, Runquist and English 
(1964) have proposed an alternative 
interpretation of the Saltz data. They 
suggested that R pretraining might 
permit more rapid scanning of the 
cards for the correct R. Since Saltz 
used a 4-sec. anticipation interval, 
rather than unlimited time for re- 
sponding, faster scanning of the cards 
could result in an advantage for the 
pretrained Ss. They predicted that if 
availability were forced by means of 
the Saltz technique, but Ss were given 
unlimited time to respond, the differ- 
ence between pretrained and nonpre- 
trained Ss should disappear. Using 
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a 4-sec. anticipation interval, and 
essentially the same materials em- 
ployed by Saltz (1961), they repli- 
cated the Saltz results. However, 
with unlimited time for resonse, pre- 
trained Ss learned significantly more 
slowly than nonpretrained. These re- 
sults were contrary to both the dif- 
ferentiation and the availability posi- 
tion. Runquist and English suggested 
that these results may be due to char- 
acteristics of the Saltz forced availabil- 
ity technique. 

The present paper reports three ex- 
periments which attempted to clarify 
some of the issues involved in R pre- 
training. Experiments I and II ex- 
amined the effects of R pretraining of 
meaningful words on PAL under con- 
ditions of unlimited time for response. 
Experiment I examined this issue 
using the 10 pair lists previously em- 
ployed by Saltz (1961). With unlimited 
time for responding, the potential 
amount of familiarization with the R 
terms was much greater during the 
PAL phase of Exp. I than during the 
pretraining phase. This could atten- 
uate pretraining effects. To examine 
this possibility, Exp. II was designed 
to produce faster PAL by decreasing 
the number of items to be learned. 
Experiment III was concerned with 
the effects of R pretraining of nonsense 
syllables. Since differentiation of an 
R is assumed to be a function of fre- 
quency of use, pretraining should have 
a greater effect on differentiation of 
nonsense words than on real words. 


METHOD 


Subjects—The Ss were 232 students from 
the Introductory Psychology course at 
Wayne State University. Of these, 92 
served in Exp. 1, 68 in Exp. IL and 72 
in Exp. III. The Ss were assigned ran- 
domly to conditions in each experiment. 

Materials—Experiment I employed two 
paired-associates lists of 10 pairs each, 
CVCs as stimuli and adjectives as Rs. The 
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R items were those used by both Under- 
wood, Runquist, and Schulz (1959) and 
by Saltz (1961). In one list these Rs were 
similar words, in the other list they were 
dissimilar. Stimuli ranged from 45-53% 
in Glaze (1928) values. No two CVCs 
began with the same letter, and no two 
ended in the same letter. The Rs used for 
irrelevant pretraining were the same as 
those used by Underwood, Runquist, and 
Schulz (1959) for irrelevant pretraining. 
One irrelevant pretraining list consisted of 
similar (synonymous) adjectives, the other 
consisted of dissimilar adjectives. 

Experiment II employed two paired-asso- 
ciates lists of five pairs each, CVCs as stim- 
uli and adjectives as Rs. One list consisted 
of half the pairs from the similar list of 
Exp. I, the other consisted of half the pairs 
from the dissimilar list of Exp. I. The Rs 
of the similar list were: pleasant, carefree, 
laughing, happy, smiling. The Rs of the 
dissimilar list were: spicy, equal, modern, 
tiresome, fiery. The first five irrelevant 
Rs listed by Underwood, Runquist, and 
Schulz (1959) were used for irrelevant pre- 
training. 

Experiment III employed a six-pair list 
of two-letter nonsense syllables as Rs, with 
adjectives as stimuli, The pairs were: 
QUARTER-MJ, ramous-MQ, —EQUAL-GJ, 
open-GX, GrANT-KOQ, spicy-KX. The 
items used for irrelevant pretraining were: 
SV, SF, PG, PV, LG, LF. 

Procedure—In Exp. I, Phase 1 consisted 
of a free learning condition. The relevant 
pretraining group was shown 10 adjectives 
(the Rs of the subsequent PAL phase) for 
five alternate learning and recall trials. 
Items were presented at about a 1-sec. rate 
on index cards, followed by a 1-min, inter- 
val in which S was to say all the words he 
could on each trial; S was told not to 
attempt to learn the serial order since this 
would change. The same procedure was 
used for the irrelevant pretraining condi- 
tions, except that the Rs learned in Phase 1 
were different from those of the subsequent 
PAL phase. In the irrelevant pretraining 
conditions, Ss who learned Rs of high intra- 
fist similarity in PAL learned items of high 
intralist similarity in Phase 1; those who 
learned Rs of low intralist similarity in PAL 
learned items of low intralist similarity in 
Phase 1. 

In the PAL phase, a Stowe memory drum 
was modified so that E could, by means of 
a foot pedal, stop the drum and keep a 
stimulus in the window until S$ responded. 
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All 10 R words were present before S on an 
index card, words numbered 1-10 (the top 
word numbered “1,” the next word num- 
bered “2,” etc.) The Ss was instructed 
to locate the correct R-term on the card 
and respond by giving the number typed 
in front of the word. After Ss responded, 
the window on the drum opened, displaying 
the correct R for 2 sec. Pairs were pre- 
sented in three different orders on the 
drum. Five different cards containing the 
R items were used, the items in a different 
order on each card so that § could not 
learn which number belonged to a given 
word, but was forced to examine the list 
to find the correct R. The Ss were given 
15 learning trials unless they responded per- 
fectly on two consecutive trials prior to 
the fifteenth trial, 

Experiment II paralleled Exp. I with the 
following differences, In place of the anti- 
cipation procedure used in Exp. I, Ss were 
given alternate learning and test trials. On 
the learning trials, Ss were presented the 
S-R pair for 6 sec. On the test trials, the 
stimuli alone were presented until Ss chose 
the correct R from the card; the next stim- 
ulus was then presented with no feedback 
concerning the accuracy of response. Only 
10 learning test sequences were used. 

Response pretraining and PAL procedures 
of Exp. III were identical to those of Exp. 
II except for the use of different materials. 
However, immediately prior to R pre- 
training, Group 1 (half of the Ss of Exp. 
III) served as Ss in a study of free recall 
learning involving meaningful words. This 
prior experience was balanced across experi- 
mental conditions. 


RESULTS 


Experiments I and II, both using 
meaningful words as Rs were con- 
sistant in finding that R pretraining 
had. no significant effects in PAL 
under conditions of unlimited time for 
response. In Exp. I, mean errors 
during PAL of similar Rs were 66.8 
after relevant R pretraining and 64.0 
after irrelevant pretraining. The cor- 
responding means for dissimilar Rs 
were 36.1 and 37.3. Analysis of vari- 
ance indicated F (1, 88) <1 both for 


the main effects of relevant versus irre- 
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levant pretraining and for the inter- 
action between pretraining and simi- 
larity. The main effect due to simi- 
larity was significant, F (1, 88) = 
18.59, p < .01. 

In Exp. II, mean errors during 
PAL of similar Rs was 6.56 after re- 
levant pretraining and 6.06 after irrele- 
vant pretraining. The corresponding 
means for dissimilar Rs were 4.44 and 
4.63. Again analysis of variance in- 
dicated no significant main effects due 
to pretraining and no significant inter- 
action between pretraining and simi- 
larity, in both cases F (1, 64) < 1.00. 
Thus, despite the much more rapid 
learning in Exp. II, compared to Exp. 
I, pretraining of adjectives had no 
effect on subsequent PAL in Exp. II. 
While the mean errors were larger 
for dissimilar Rs, as in Exp. I, the 
main effect due to similarity failed to 
reach significance, F (1, 64) — 1.96. 

In both Exp. I and II, early blocks 
of PAL trials were analyzed for pre- 
training effects. No such effects were 
found. It appears that additional fa- 
miliarization with the Rs, occurring 
during PAL, cannot account for the 
failure to find facilitation due to pre- 
training in these data. 

In Exp. III, using nonsense syl- 
lables as Rs, relevant pretraining sig- 
nificantly facilitated PAL. For Group 
l Ss (who participated in a prior 
learning experiment) relevant R pre- 
training resulted in 12.65 errors dur- 
ing PAL, while irrelevant pretraining 
resulted in 16.2 errors. For the naive 
Ss (Group 2) the corresponding mean 
errors were 5.75 and 10.75. Analysis 
of variance indicated that the main 
effects for pretraining produced F (1, 
68) = 5.15, » «.05. The effect of 
the prior task was also significant, 
Group 1 learning more slowly than 
Group 2, F (1, 68) = 11.08, p < .01. 
The interaction between groups and 
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pretraining did not approach signif- 
icance, F (1, 68) « 1.00. 

Figure 1 shows that the superiority 
of relevant pretraining, in Exp. II, 
develops on the first trial. In Group 1 
this superiority persists Over all 10 
trials. In Group 2, the superiority of 
relevant pretraining disappears only 
when all Ss approach errorless per- 
formance. Thus there is no indication 
that additional familiarity with the R 
terms, developed during PAL, reduces 
the comparative effectiveness of rele- 
vant pretraining. 

DISCUSSION ; 

The present data clearly show that 
pretraining on nonsense items facilitates 
subsequent paired-associates learning in 
which these items are the R terms. This 
occurs even though Ss have unlimited 
time to respond, and a forced availability 
technique makes the nonsense items 
available to all $s, nonpretrained as well 
as pretrained. These data are consistent 
with the response differentiation position 
proposed by Saltz (1961). They appear 
to be contrary to the Underwood, Run- 
quist, and Schulz (1959) position that 
R  pretraining facilitates subsequent 
paired-associates learning by increasing 
the availability of the Rs. 

Turning to the data for meaningful Rs, 
however, Exp. I and II indicate no facili- 
tory effect on PAL due to R pretraining 
under conditions of forced availability 
and unlimited time for response. 

On the basis of research to date, two 
alternative interpretations of the mecha- 
nisms of pretraining appear reasonable. 
The first interpretation is that both dif- 
ferentiation and availability are relevant 
factors, Pretraining increases the differ- 
entiation of nonsense syllables, resulting 
in faster subsequent paired-associates 
learning. However, meaningful words 
are already well differentiated and so 
pretraining does not influence their sub- 
sequent learning as Rs in paired associ- 
ates. The fact that Underwood, Runquist, 
and Schulz (1959) obtained facilitation 
of paired associates with pretraining on 


Fic. 1. Mean errors per trial in Exp. IIT 
for relevant and irrelevant R pretraining 
conditions of Groups 1 and 2. A 


meaningful Rs, in the absence of forced 
availability, suggests that R availability 
occurs as a consequence of R pretraining. 
While the operation of R differentiation 
due to pretraining appears to be re- 
stricted to nonsense material, availability 
would presumably operate at all levels of 
meaningfulness. 

However, a second interpretation of the 
data is also possible. Until this point in 
the present paper we have been concerned 
with differentiation as a function of pre- 
vious experience with a word (ie, ac- 
quired differentiation). In addition, it 
seems reasonable to assume that differ- 
entiation of a word is a function of the 
intensity of the stimulus for the word. 
Thus a memorial trace would be less dif- 
ferentiated than a physically present 
word, and the differentiation of a physi- 
cally present word would increase with 
increased time during which the word is 
physically present. In such an event, un- 
limited time for examination of already 
well differentiated words might bring 
such words to asymptotic level of differ- 
entiation and attenuate any previous ef- 
fects of R training. Such a relationship 
would account for Saltz (1961) obtaining 
facilitation due to pretraining on mean- 
ingful words when the paired-associates 
test permitted 4 sec. for anticipation 
while the present studies failed to find a 
corresponding facilitation for meaning- 
ful words with unlimited time for re- 
sponse. Aspects of the Saltz (1961) data 
appear to be consistent with this interpre- 
tation. In that study, superiority for 
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pretrained, similar responses did not oc- 
cur on early trials, but developed later in 
learning. This data does not appear to 
be consistent with the Runquist and 
English (1964) position that pretraining 
merely facilitated scanning the availabil- 
ity cards in the 4-sec. interval permitted 
by Saltz (1961). If superiority produced 
by pretraining were due to facilitated 
Scanning one would normally expect the 
Superiority to appear early and then 
rapidly drop in magnitude. Future re- 
Search will be directed at testing the 
writer's interpretation of the interaction 
between time and pretraining. 

It should be pointed out that the dis- 
tinction between stimulus and response 
is relatively arbitrary in studies such as 
those here reported. An S is shown a 
stimulus and the entire set of appropriate 
items with which this stimulus might 
have been paired on the previous trials. 
In effect, $ is asked to match an item 
from one stimulus array with its appro- 
priate partner in a second stimulus array, 
This fact poses Special problems in using 
nonsense Rs. Pilot work on Exp. III 
utilized dissimilar CVCs as Rs. Since 
all Rs were physically present during the 
paired-associates test, Ss could (and did) 
tend to identify the correct CVC R by 
simply using its first letter, ignoring the 
Syllable as a whole, To obviate this, Exp. 
IIT Successfully used, as R terms, two- 
letter syllables in which neither the first 
nor second letter alone could be used 
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as a reliable cue for the correct associ- 
ative pairing, 

Finally, there are no obvious differ- 
ences in procedures which might account 
for the difference in results between the 
Runquist and English study and the 
Present data involving meaningful Rs, 
Considering the greater number of S in 
our study, the greater range of R char- 
acteristics (ie, lists of high intralist 
similarity as well as lists of low intralist 
similarity) and the consistency of our 
findings over lists of various lengths, the 
writers are relatively confident that R 
pretraining on meaningful words has no 
detrimental effect on paired-associates 
learning when this learning involves the 
Saltz forced availability technique and 
unlimited time for response, 
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A study of operant conditioning of vasoconstriction is reported. Fin- 
ger volume was recorded during a 5-min. base-line period, during a 
25-min. acquisition period and during a 10-min, extinction period. 
During the acquisition period each of 2 experimental groups (n= 
12) were reinforced at the peak amplitude of each criterion vaso- 
constriction. 2 matched control groups (n=12) received an equal 
number of reinforcements, but reinforcement was contingent upon 
vasomotor stability. A base-line control group (n=6) received no 
reinforcement, During extinction, 1 experimental group was rein- 
forced for vasomotor stability, while reinforcement was omitted for 
all other conditions. Experimental Ss showed a significant increase 
in number of vasoconstrictions during acquisition; control Ss did 
not. Conditioned vasoconstriction was independent of gross bodily 
movement, muscle tension in the forearm and finger, heart rate, and 
respiratory irregularities. The results were interpreted as evidence 
against theories which state that visceral responses can only be 


classicaly conditioned. 


The question of whether visceral re- 
sponses can be conditioned by operant 
techniques has for some time been a 
controversial matter on both theore- 
tical and empirical grounds. Early 
empirical attempts failed (e.g, 
Mowrer, 1938; Skinner, 1938), but 
during the past few years numerous 
Successful attempts have been reported 
(Crider, Shapiro, & Tursky, 1966; 
Fowler & Kimmel, 1962; Kimmel & 
Hill, 1960; Kimmel & Kimmel, 1963; 
Millr & Carmona, 1967; Miller & 
DiCara, 1967; Razran, 1961; Shearn, 
1962; Shearn & Clifford, 1964; 
Trowill, 1967; Van Twyver & Kim- 
mel, 1966). 

Since these recent reports appar- 
ently refute theoretical arguments that 
visceral responses cannot directly be 
instrumentally conditioned (Keller & 
Schoenfeld, 1950; Kimble, 1961, 1964; 
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Konorski & Miller, 1937; Mowrer, 
1947, 1950; Skinner, 1938; Solomon & 
Wynne, 1954), it is important to note 
that mediation of the visceral response 
via instrumentally conditioned skeletal 
responses appears to have been ruled 
out for both GSR (Van Twyer & 
Kimmel, 1966) and heart rate (Miller 
& Di Cara, 1967; Trowill, 1967). 
The status of a third visceral re- 
sponse, vasoconstriction, is less clear ; 
the results of a study of instrumental 
conditioning of vasomotor changes in 
the rabbit (Fromer, 1963) failed to 
provide strong evidence for instru- 
mental conditioning of this response, 
but data from human Ss (Snyder & 
Noble, 1965) suggest that this re- 
sponse can be instrumentally condi- 
tioned. In the latter study, however, 
mediation of the visceral response via 
certain skeletal responses could not be 
ruled out. In view of the theoretical 
and practical importance of the general 
problem and the large individual dif- 
ferences in patterns of visceral re- 
sponses (Lacey & Lacey, 1958, 1962), 
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the present study was designed to 
determine whether or not finger vaso- 
constriction could be instrumentally 
conditioned in the absence of mediat- 


ing skeletal responses. 


MeEtTHop 


Subjects—A total of 54 introductory psy- 
chology students were assigned to two ex- 
perimental conditions (n= 12), two matched 
conditions (n= 12), and a base-line control 
(»=6), There were equal numbers of 
males and females in each group. 

Apparatus—Right index finger volume 
was recorded from a Yellow Springs-Fels 
plethysmograph on one channel of an Offner 
Model R Dynagraph oscillographic recorder 
at 2.5 mm/sec speed with a Type 9806A 
ac/de Input Coupler (set for dc). The 
electronically controlled heater of the finger 
cup was modified to dc operation to elimi- 
nate 60-cycle interference to EMG record- 
ing, and cup water temperature was held 
to 35° C+ .5°, 

Right forearm extensor-flexor EMG was 
recorded on a second oscillograph channel 
using a Type 9852 EMG Integrator Input 
Coupler modified to .25-sec, rise and 1-sec. 
decay time constants, The EMG electrodes 
were two standard EKG arm electrodes 
(32 X 55 mm.) plus a third electrode for 
ground placed on the forearm near the 
elbow. Sanborn Redux Paste was applied 
to the electrodes, which were held in place 
lightly with standard rubber arm straps. 
Preliminary tests indicated that placement 
of the two active electrodes for maximal 
EMG from isometric extension and flexion 
of the index finger also gave a measure- 
able indication of any movement of muscle 
tension great enough to produce a finger 
Vasoconstriction or to affect vasomotor tone 
aes by finger pulse volume amplti- 
tude. 

Abdominal and thoracic respiration were 
Tecorded on the third and fourth oscillo- 
graph channels with Type 9806A ac/dc In- 
put Couplers (set for dc) from mercury-in- 
tubber strain gauges around the trunk at 

the xiphoid and underarm levels. Each 
gauge operated in a full bridge resistance 
circuit powered by a mercury cell, The 
gauges were made up according to Ackner's 
(1956) description except for the use of 
Pure silver electrodes which are much more 
stable with mercury than stainless steel, 
"The reinforcement light was a standard 
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pilot-light assembly turned on by E, 
corded on the marker channel of the oscillo 
graph, and turned off after 3 sec. by 
electronic timer. The light was locat 
on a small sloping panel on the front of 
S's chair, which also served as a rest 
the right arm and plethysmograph cup. - 

Procedure—The Ss were seated in 
sound-treated and electrostatically shiel 
room, and were read the following instruc. 
tions : 


This is an experiment in learning, We 
are attempting to condition an involu 
tary response over which you have no di. 
rect control. I will attach pickups to your 
finger, arm, and trunk to record various | 
activity. In no case will you receive an 
electric shock or other unpleasant stim- 
ulus. All you have to do is watch the 
signal light that indicates when you are — 
making the correct response. But don't - 
try to make any kind of voluntary re- 
Sponse. Just sit in a comfortable position, 
relax, breath quietly and evenly, and don't — 
make any unnecessary movements. This 
will take 40 min., so if you get uncom- 
fortable and have to move, go ahead and 
change position but don't keep moving — 
around any longer than necessary, It 
necessary for you to stay awake. If any — 
of the equipment slips or you become un- | 
comfortable you can call me over the - 
intercom, which is on all the time. Do 
you have any questions? ti 


In the instructions to Group B (base-line ^ 
control) reference to the signal light and T 
correct response was omitted. 1 

After placement of the electrodes, a 5-min, 
base-line period was followed by a 25-min, 
conditioning and a 10-min. extinction period. A 
There were no rest pauses, 9 

All Ss were treated alike during the base- 
line period, in which responses were re- 
corded but the reinforcement light was 
omitted. During conditioning, Ss in two 1 
experimental groups (EC and EN) which D 
were treated alike until extinction, received p 
reinforcement at the peak amplitude of any 
definite vasoconstriction not preceded by |. 
bodily movement; Ss in Group B were 
treated just as during the base-line period, 
while for Ss in Group C reinforcement was 
contingent upon 5-sec. vasomotor stability. 
Each S in the latter group received the same — 
number of reinforcements per 5-min. period. n 
as did a given S in Group EC or EN. and — 
in this sense these Ss were matched with P 
Ss in Groups EC and EN. 


OPERANT CONDITIONING OF VASOCONSTRICTION 


During extinction, the reinforcement light 
was omitted for all Ss except those in 
Group EC, which continued to receive re- 
inforcement at about the same rate as be- 
fore but now contingent upon 5-sec. vaso- 
motor stability. 

Scoring—Changes in finger volume were 
scored so as to exclude the respiratory com- 
ponent by measuring the vertical distance 
between the beginning and end of the change 
at corresponding points (phase) in the res- 
piratory cycle. In addition, CRs which 
occurred within 5 sec. after thoracic or ab- 
dominal respiratory irregularity were ex- 
cluded. The cardiac component of the 
plethysmogram was excluded by measuring 
along the troughs of the pulse waves of the 
plethysmogram. Very slight movements of 
the finger resulted in an abrupt change in 
both the plethysmogram and the EMG 
records and vasomotor changes occurring 
2-5 sec. after any such movement were also 
excluded from the main results. Vasocon- 
strictions which began between 2-5 sec. 
following the onset of the light (reinforce- 
ment) were also excluded. Finally, indexes 
of both tonic respiratory rate and heart 
rate were obtained. The former was ac- 
complished by counting the number of 
respiratory cycles during the first 1 min. 
during successive 5-min, periods that was 
free of gross respiratory irregularities. An 
approximation of heart rate was obtained 
from the plethysmogram. 

The minimum amplitude of vasoconstric- 
tions included in the main results was set 
individually for each S so as to give an 
operant level during the base-line period of 
approximately one vasoconstriction free 
from skeletal muscle artifacts in the plethys- 
mogram, respiration, or EMG records. This 
criterion which ranged .45 cu. mm.-3 cu. mm. 
with a mean of 1.52 cu. mm., or a range of 
3 mm.-21 mm. of movement on the oscil- 
lographic records. with a mean of 10.8 mm., 
permitted Ss to be equated for initial oper- 
ant level regardless of recording sensitivity, 
total finger volume within the plethysmo- 
graph cup, and other such variables asso- 
ciated with absolute volume measurements; 
it also eliminated very small and unreliable 
volume changes that may not have been 
vasomotor in nature. 


RzsuLTS AND CONCLUSION 


. The mean number of vasoconstric- 
tions that met the amplitude criterion 
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Fic. 1. Mean number of vasoconstrictions 


for successive 5-min. periods for experi- 
mental (EC and EN), matched control 
(C), and base-line control (B) conditions. 


are shown in Fig. 1 for successive 
5-min. periods. 

As described above, the amplitude 
criterion was arbitrarily set to provide 
the same initial base-line operant level, 
and the actual means obtained during 
the initial 5-min. base-line period did 
not differ significantly (p< .20). 
During the conditioning phase there 
was a marked increase in number of 
vasoconstrictions for both experi- 
mental conditions, a decrease for the 
matched controls (C) and no essential 
change for the base-line control (B). 
During the final two 5-min. acquisition 
periods, and during extinction, each S 
in Groups EC and EN gave more cri- 
terion vasoconstrictions than did his 
matched control Since there was no 
overlap between these distributions, 
statistical tests of the differences be- 
tween experimental and control groups 
did not seem appropriate. 

The data of Group C included a 
number of scores of zero and were not 
included in the analyses of variance 
described below. However, the overall 
difference between Groups B and C 
during the conditioning phase was sig- ' 
nificant, t (28) = 271, p < .02, but the 
apparent increase in performance of 
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Group C during extinction was not 
significant (p > .20). 

The results of an analysis of vari- 
ance of Groups EC and EN during 
conditioning showed a significant prac- 
tice effect, F (4, 80) = 1264, p< 
.001. All other main effects and inter- 
actions were nonsignificant (p > .20). 
Extended-trend analysis of variance 
(Grant, 1956) revealed that only the 
linear, F (1, 80) = 44.30, p < .001, 
and quadratic, F (1, 80) = 5.90, p< 
.05, components of the overall practice 
effect were significant. The linear 
component accounted for approxi- 
mately 88%, and the quadratic com- 
ponent for approximately 12%, of the 
variance. 

Similar analyses which included the 
final 5-min. conditioning period to- 
gether with the 10-min. extinction 
period also were performed. The 
tendency for Group EC performance 
to decline relative to that of Group EN 
during extinction proved to be reliable 
as evidenced by a significant linear 
component of the EC + EN x Extinc- 
tion interaction, F (1, 40) = 4.15, p< 
05. That linear component accounted 
for 96% of the variance. Further 
analyses showed that there was a sig- 
nificant downward linear trend for 
Group EC, F (1, 40) = 9.03, p < .01, 
but not for Group EN (F < 1.0). 


Fic. 2. Respiratory-related vasoconstric- 
tions of experimental (EC and EN), 
matched control (C), and base-line control 
(B) conditions. 
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The decline in performance of 
Group EC during extinction together 
with the decline in performance of 
Group C during the acquisition phase 
suggests that these conditions resulted 
in learning not to respond. This effect 
is small, however, compared to the 
effects of conditioning for Groups EC 
and EN during acquisition and, more- 
over, was not found in an earlier 
study (Snyder & Noble, 1965). Con- 
sequently, any conclusions at this 
point must be highly tentative. 

Respiratory irregularities —As stated 
earlier, vasoconstrictions associated 
with respiratory irregularity were ex- 
cluded from the main results. It is of 
interest, however, to note that, as 
shown in Fig. 2, vasoconstrictions 
which followed within 2-5 sec. a res- 
piratory irregularity increased for the 
experimental conditions, but not for 
the control conditions. The binomial 
sign test showed that differences 
among groups were not significant at 
the beginning of acquisition (p > .30), 
but that the difference between either 
experimental group and the control Ss 
was significant by the end of training 
(p < .05), indicating that respiratory 
irregularity also became conditioned. 
The tendency for the performance of 
Group EN to be greater than that of 
Group EC was not reliable (p > .30). 
The failure to find any decrease in 
mean number of vasoconstrictions for 
the experimental conditions during ex- 
tinction is not surprising in view of 
the fact that only those respiratory 
irregularities which were followed 
within 2-5 sec. by a criterion vaso- 
constriction were reinforced. This 
represents a partial reinforcement sit- 
uation, and resistance to extinction 
would be expected. 

EMG control.—No instances of re- 
flexive vasoconstrictions to phasic 
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EMG activity were scored because 
large bursts of EMG activity threw the 
plethysmograph off scale and single 
spikes had no effect upon vasomotor 
activity. Continuous, moderate EMG 
activity was found to correlate with 
obvious irregularity in the plethysmo- 
gram; in fact, the plethysmograph em- 
ployed was more sensitive to muscle 
artifact than the EMG recording. 
Respiratory rate and heart rate.— 
There were no significant differences 
among conditions either for mean 
respiratory rate or for mean heart rate. 
In conclusion, the results indicate that 
like GSR and heart rate, vasoconstric- 
tion can be instrumentally conditioned, 
and that this was independent of gross 
bodily movement, muscle tension, 
heart rate, respiratory irregularity, and 
minute finger movements. The results 
thus provide additional evidence which 
is contrary both to theories which state 
that ANS responses can be learned 
only by classical conditioning, and 
theories which assume that the learn- 
ing of somatic and visceral responses 
are mediated by functionally different 
neuroanatomical systems. 
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72 Ss first learned 12 CVC pairs presented in the same serial order 
onevery trial. Half of the Ss learned an LH list (low formal stimulus, 
S, term similarity, high formal response term similarity), and half 
learned an HL list, Then a 2nd transfer list was learned to evaluate 
the relative cue strength of S-term and serial position (SP) cues 
in different positions of the Ist list. For S-terms this was accom- 
plished by changing the order of the Ist-list pairs and for SP cues 
by changing only the order of the S-terms. The transfer results 
showed a greater utilization of S-terms with List LH but a greater 
utilization of SP cues with List HL. Under both similarity con- 
ditions, however, better performance was displayed for S-terms taken 
from the middle of the Ist list, while SP cues facilitated 2nd-list 
learning more in beginning and end positions, 


Paired-associate (PA) learning ex- 
periments usually vary the order of 
presentation of the pairs on every 
trial. Consequently, S must rely upon 
the elements of the stimulus (S) term 
to cue the correct response (R) term. 
However, when the pairs of the list 
are presented in the same serial order 
on every trial, S can also respond to 
whatever cues are tied to serial posi- 
tion (SP), such as ordinal position, 
relative position, or preceding R-terms 
in the list. Thus, under a constant- 
order method of pair presentation, S 
is provided with a compound stimulus 
consisting of S-term and SP cues. In 
a study designed to investigate the 
relative importance of these two types 
of cues in learning Newman and 
Salt; (1962) had Ss first study a list 
of pairs in serial order. Then a recall 
test was given in which the serial order 
of the S-terms was the same or differ- 
ent from that used during study trials. 


1This investigation was supported by 
Public Health Service Research Grant MH- 
11179-02 from the National Institute of 
Mental Health. The author wishes to thank 
Slater E. Newman for his suggestions for 
improvement of the manuscript. 


Under the different-order condition, 
SP could no longer directly serve as 
a cue. The results showed no decre- 
ment from changing the order, thereby 
clearly indicating that S-term cues are 
sufficient to produce the correct re- 
sponse. Similar results have been re- 
ported by Carluccio and Crowder 
(1966), Martin and Saltz (1963), and 
Newman and Forsyth (1965). 

The failure to find a decrement from 
changing the order on testing does not 
by itself, however, argue against the 
notion that Ss also used positional cues 
in learning. Indeed, Newman and 
Saltz (1962) suggested that under the 
different-order condition the S-term 
cue may reinstate the cue for SP via 
mediation. More direct evidence for 
the utilization of SP cues is shown by 
the Martin and Saltz (1963) study. 
After PA learning, in which test-order 
had no significant effect, Martin and 
Saltz found significantly faster serial 
learning of R-terms when these ap- 
peared in same as opposed to a dif- 
ferent order from the PA list, thereby 
demonstrating that something had 
been learned about the order of the 
list. Rubin and Brown (1967) re- 
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ported a similar serial-learning facilita- 
tion following  constant-order PA 
learning. 

From the available evidence it would 
appear that both S-term and SP cues 
are used in constant-order PA learn- 
ing. However, at present it is not 
clear how these cues are utilized in dif- 
ferent positions of the list. To answer 
this question, the present study first 
required the learning of paired asso- 
ciates in a constant serial order. Then 
a transfer test was used to determine 
separately the cue strength of S-term 
and SP cues in different parts of the 
list. This was accomplished for S 
terms by changing the order of the 
pairs and for SP cues by changing only 
the order of the S terms. By examin- 
ing performance on R terms as a func- 
tion of the position each occupied in 
the first list, it should be possible to 
evaluate the role of S-term and SP 
cues in learning. 

Several considerations suggest that 
the two types of cues will be used dif- 
ferently in different positions of the 
list. First, research in serial learning 
(e.g. Battig, Brown, & Schild, 1964; 
Ebenholtz, 1966; Young, Hakes, & 
Hicks, 1967) indicates that SP cues 
facilitate learning more in beginning 
and end than in middle positions. 
Second, there is reason to believe that 
S-R associations should be stronger 
in middle than in end positions of the 
list. Specifically, it appears that posi- 
tion discrimination is more difficult in 
the middle than ends of the list (Eben- 
holtz, 1963). Thus, in the more dif- 
ficult middle part of the list, S may 
tend to rely more heavily on the S- 
term to. cue the response. Further 
support for this notion is provided by 
Brown and Rubin (1967) who showed 
more facilitation for middle- than end- 
position items in a serial list when 
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these served as stimuli in a subsequent 
PA task. 

For increased generality, it was also 
deemed desirable to use more than one 
list in the present study. Several in- 
vestigators (eg. Battig, Brown, & 
Nelson, 1963; Newman, 1966; New- 
man & Saltz, 1962) have postulated 
that high S-term similarity should aug- 
ment the use of SP cues in constant- 
order PA learning. In the extreme 
case of complete identity among S- 
terms, S can only respond to cues de- 
rived from SP. To the extent that SP 
provides a basis for the learning of R- 
terms in serial order, then low similar- 
ity among R-terms should also increase 
the likelihood of responding to SP: 
serial learning proceeds faster with 
low than high interitem similarity 
(Underwood & Richardson, 1956). 
For these reasons two lists were con- 
structed. The first (List LH) was 
designed to favor S-term utilization 
and was comprised of S-terms of low 
formal similarity and R-terms of high 
formal similarity. The second was 
constructed to favor the utilization of 
SP cues and was an HL list. 


METHOD 


First-list learning.—Each of 72 paid Ss, 
all summer-session students at Kansas State 
University, first learned a 12-pair list pre- 
sented in the same serial order on every 
trial. Half of the Ss learned an LH list and 
half an HL list. The LH list, comprised of 
Archer (1960) 31-81% CVC pairs, was 
exactly the LH list used in a previous 
study by Rubin and Brown (1967) and de- 
scribed in detail therein. The HL list was 
obtained by reversing the S- and R-terms of 
the LH list Assignment of Ss to each 
list was based upon a prearranged order 
which assured that the groups were filled 
at the same rate. " 

Transfer—After reaching a criterion of 
one errorless trial on the first list, Ss 
within both the LH- and HL-list condition 
were subdivided into four matched groups 
on the basis of first-list performance (trials 
to criterion). Each group then learned in 
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a constant serial order a different second 
PA list, constructed from the syllables of 
the first list. For Group SP (n=8), each 
R-term appeared in the same position as it 
did in the first list but with a different 
first-list S-term, ie, the S-terms were 
unsystematically re-paired with R-terms, with 
the restriction that no S-term immediately 
precede or follow the same S-term as it 
did in the first list. For Group S (n=12), 
each R-term appeared with its first-list 
S-term but in a different position; each pair 
being separated by at least two positions 
from a previously adjacent pair in the first 
list. To control for difficulty of learning 
in different positions of the second list, 
one-third of the Ss (four) in Group S 
learned each of three different orders of the 
second list. Each order was constructed so 
that one of the three pairs from each quarter 
of the first list appeared in the beginning 
third of the second list, another in the mid- 
dle third, and the third pair in the end 
third of the second list. When pooled across 
all three list-orders, the pairs from each 
quarter of the first list thus appeared 
pal often in each third of the second 
ist, 

The two remaining groups of eight Ss each 
constituted controls. One (Scrambled R) 
received the S-terms in the same positions 
as in the first list and the R-terms in dif- 
ferent positions, following the procedure de- 
scribed for Group SP. The second 
(Scrambled S-R) received both S- and R- 
terms in different positions both with respect 
to the first list and to one another in the 
second list. The same S-R pairings were 
used for the Scrambled R, S-R, and SP 
groups. Whereas both control groups in- 
volved re-pairing of R-terms, the Scrambled 
S-R group also repaired S-terms with SP. 
The use of these groups was intended not 
only to provide a basis for comparison with 
Groups S and SP, but also to determine if 
any benefit accrues from preserving the 
Stimulus structure of the first list. 

Procedure.—Upon entering the experi- 
mental room, S was seated in front of a 
translucent projection screen which blocked 
his view of E and the apparatus. He then 
Was read usual instructions for PA learning, 
With no mention of the constant serial order. 
Learning proceeded under the recall (study- 
test) method, which began first with the 
Successive presentation of the 12 pairs, fol- 
lowed in the same order by the individual 
Presentation of the S-terms for recall. The 

Was required to spell his responses. 
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The same procedure was used for both 
first- and second-list learning, except that 
all Ss were told before learning of the 
second list how the list was derived. That 
is, Ss in Group S were told that the pairs 
would be presented in a different order, and 
in Group SP that only the S-terms would 
be in a changed order. This was done so 
that S could respond immediately on the 
basis of the appropriate (nonre-paired) cue. 
The Ss in the Scrambled R and S-R groups 
were similarly informed about the con- 
struction of their respective transfer lists. 

All materials were prepared on 2X2 in. 
slides and presented through an automati- 
cally operated slide projector. A 4-sec. rate 
of presentation was used throughout each 
trial, with a 10-sec. interval between study- 
test series and trials. Learning of the sec- 
ond list for all groups was carried out to a 
criterion of one errorless trial. 


RESULTS 


First-list learning —The number of 
total errors (omissions and failures to 
respond correctly), errors before and 
after the first correct response to each 
pair, and trials to criterion, were tab- 
ulated separately for each S under the 
LH- and HL-list condition. All four 
measures showed better performance 
for the HL than LH list. However, 
analysis of variance revealed only the 
difference in after errors and trials to 
criterion, respectively, between List 
HL (7.4 and 8.5) and List LH. (12.5 
and 10.4) to be significant, Fs (1, 70) 
= 5.30, at or beyond the .05 level, the 
criterion for all analyses to be re- 
ported. Because of matching, com- 
parability of the various transfer 
groups in terms of initial learning abil- 
ity was assured within each List con- 
dition. The mean number of trials to 
criterion for the four groups under the 
HL-list condition ranged 8.3-8.9 and 
under the LH-list condition 9.8-11.1. 
Neither the main effect of Groups nor 
the Groups X Lists interaction was sig- 
nificant (Fs <1 for all four perform- 


ance measures). 
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TABLE 1 sults are presented in Table 1. 
lrscestacs oF Savina Scosrs be seen, second-list learning 
On Necowp List 


cues 
mained paired with their first-list Re 
terms than when repaired with a dif. 
ferent R-term, The overall 


the combined Scrambled R and 


groups (11.1) proved to be highly sig 
nificant, F (1, 64) vs 39.26. pe. 


than Group SP for the LH list, while 
Group SP was superior with the 
list, leading to a significant LH- 


Further analysis revealed nonsignil- 
icant differences in performance be 
li tween (a) Groups S and SP; (b 
(1, Scrambled R and S-R groups; i 

' (e) List LH (35.5) and HL (323) 
all Fs <1. All remaining interactions 
involving Lists were also nonsignif- 
icant. Analysis of saving scores based 
on total errors revealed results essen - 
tially the same as those obtained using 
trials to criterion. 
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fra dist. in- 
volving Lists and Serial Position were 
insignificant, the transfer curves in 
Fig. 1 are based on data pooled across 
Lists. As shown, Group SP made 
relatively more second-list errors on 
items from middle than end 

ol the first list, while Group S dis 
played better performance in the first 
and middle positions. The difference 
in curvature proved to be reliable as 
evidenced by both a significant quad- 
atic, F (1, 36) 9789, and cubic, 
F (1, 36) = 5.30, component of the 
Position x S-SP interaction, — Fur- 
ther analysis showed the SP transfer 
curve to possess both a 

linear, F (1, 14) = 295, and 

ratice component, F (1, 14) m 9,56, 
and the S transfer curve a significant 
cubic component, F (1, 22) & 471. 


to first-list. position. 


Discussion 


The occurrence. of significantly 
tecond-tist performance by Groups S and 
SP dearly shows that both Sterm and 
poritional cues develop functional stimulus 
pet in constant-order PA learning. 

thermore, there were fewer 
errors for S-terms taken from 
for SP cues taken from ends of 
Net, Only in the first. position 
term performance equivalent to 
played in middle positions, To 
Vent the latter finding might reflect 
tion of the cue for SP is uncertain, 

The transfer results for Storms 
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quential learning tasks may be a general 
phenomenon. 

In agreement with original expectation, 
intralist similarity was shown to influ- 
ence significantly the overall extent to 
which S-term and positional cues are used 
in learning; however, type of list (LH- 
HL) had little, if any, effect upon rela- 
tive utilization of cues in different posi- 
tions of the list. Although it cannot be de- 
termined in the present study whether the 
observed effects of similarity are due to 
S-term similarity, R-term similarity, or 
to the particular combinations used, it 
seems clear that intralist similarity repre- 
sents a potent variable for future studies. 
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INTERCOMPONENT ASSOCIATION FORMATION DURING 


PAIRED-ASSOCIATE TRAINING WITH 
COMPOUND STIMULI* 


THEODORE E. STEINER AN» ROBERT SOBEL 


University of Illinois 


2 studies were run to test whether Ss form intercomponent associa- 
tions during paired-associate learning with compound stimuli, In 
both experiments the stimuli were word-color pairs and Ss were 
trained to a learning criterion of 1 perfect trial. The 1st study was 
conducted to test Ss’ ability to correctly match the components of 
training compounds. The results indicated that Ss matched signif- 
icantly better than the chance level. The 2nd study was designed to 
investigate 2 possible ways in which correct matching might occur. 
Ss might match components because of some intercomponent associa- 
tion or because both components had been paired with the same 
response (response matching). Special matching tests were de- 
signed to separate the above mentioned ways matching might occur, 
The results indicated that intercomponent associations had a signif- 
icant effect, but response matching did not, The overall results of 
both experiments were consistent with the hypothesis that Ss form 
intercomponent associations during paired-associate training with 


compound stimuli. 


Two independent lines of research on 
paired-associate learning using com- 
pound stimuli have provided results 
compatible with the notion that inter- 
component associations are formed 
during training. 

In a paper on stimulus selection, 
Underwood, Ham, and Ekstrand 
(1962) suggested that Ss may have 
formed associations between the color 
and trigram components of color-tri- 
gram stimuli. Spear, Ekstrand, and 
Underwood (1964) obtained results, 
using a verbal discrimination technique, 
which were interpreted by the authors 
as supporting the hypothesis that inter- 
component associations are formed dur- 
ing learning. James and Greeno 
(1966), in a study of the effects of 
overtraining on stimulus selection, in- 
icated that one alternative explanation 
of their findings was that intercompo- 


1 This research was supported by a grant 
to the senior author from the University of 
Illinois Research Board. 


nent associations were formed during 
overtraining. 

In several articles on pattern-vs.- 
component discrimination — learning 
(Estes & Hopkins, 1961; Friedman, 
1966) it was suggested that during 
training with compound stimuli, Ss 
initially respond to a compound as a 
collection of components. But as train- 
ing progresses, Ss begin to respond to 
the entire compound as a unit. One 
interpretation of the previous state- 
ment, not made by the above experi- 
menters, is that Ss form associations 
between the components of compound 
stimuli. 

The present experiments were de- 
signed to provide a relatively direct 
test of the existence of intercomponent 
associations. In both studies, follow- 
ing training on a list of compound stim- 
uli to a criterion of one perfect trial, 
some form of a matching procedure was 
used to test for intercomponent asso- 
ciations. Requiring Ss to try to match 
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stimulus components of training com- 
pounds provides a method of testing for 
intercomponent associations without 
additional training or transfer trials. 

In Exp. I, the ability of Ss to match 
components of stimulus compounds 
was tested. 

TTwo ways in which Ss might (cor- 
rectly) match components of a com- 
pound stimulus, in addition to guessing, 
were considered. First, S may have 
formed  intercomponent associations 
(S-S) and match on the basis of these 
associations. Second, it is a possibility 
that an S may match two components 
if the same response has been learned 
to each (Spear et al, 1964). The 
second method will be called response 
matching (S-R-S). The flexibility of 
the matching procedure was used in 
Exp. II to assess the contribution to 
matching performance of intercompo- 
nent associations and response match- 


ing. 
EXPERIMENT I 
Method 


Subjects—The Ss were 48 introductory 
students at the University of Illinois. Par- 
ticipation in the experiment fulfilled a course 
requirement. 

Materials—The training stimuli were 
seven word-color compounds with the word 
appearing on a solid color background. The 
words, except for HAT replacing pre, were 
the words used by Underwood, Ham, and 
Ekstrand (1962). The colors used were 
orange, yellow, red, pink, blue, green, and 
purple. The words and colors were ran- 
domly paired to form a single list. The 
responses, also randomly assigned, were the 
digits 2-8. _ 

For the matching test, a column of all 
seven words appeared on the left of a sheet 
of paper and on the right appeared a column 
of all seven color patches and the appropriate 
color names. Three randomly determined 
orderings of the words and colors within 
the columns were used. 

Design.—Two groups of 24 Ss each were 
trained on the training list to a criterion of 
one perfect trial. Groups 1 and 2 received 
2 sec. and 4 sec, per trial, respectively, to 
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study each S-R pair during training, 
lowing the criterion trial, Ss were 
the matching test. 

Apparatus and Procedure—The trai 
stimuli were presented using a hand: 
trolled slide projector. A Hunter 
was used for timing. 

The training list was presented usin 
RT procedure, which consists of pre: 
alternately study trials for all items i 
list, and then test trials for all items in 
the list. The R cycle was either 2 or 
sec/item depending on the group. The t 
cycle was S paced. After an S met the 


list, S was given the matching sheet 
instructed to draw lines connecting the v 
and colors which had appeared ti 
during training. There was no time 
on the matching task. 


Results 


The mean total numbers of errors 
to the learning criterion for Group 1 
and Group 2, 5.54 and 3.26, re 
tively, were shown to be significa: 
different by an analysis of variance, 
F (1, 46) = 4.14, p < .05. 1 

The data of major interest were thi 
matching data. A 2X 3 analysis of 
variance of the total number of corr 
matches/S indicated that the two 
ing levels, 2 sec. and 4 sec., dur: 
training and the Timing x Matchi 
Order interaction had no signifi 
effect, F (1, 42) < 1 and F (2, 42) 
1.5. However there was a signifi 
effect due to the matching orders, F (2, 
42) = = 3.25, p < .05. Because both the 3 
main effect of the timing during train 
ing and the interaction of Timing X- 
Matching Order were nonsignificar 
the data for the two training grou, 
were pooled for further analysis. 

The probability distribution, am 
therefore the mean, of the number E) 
correct matches by chance alone can 
be calculated by a well known formula _ 
(Parzen, 1960). The mean number o 0 
matches expected by chance, in 
case, is 1.001. Using the error term 
from the analysis of variance of the 
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matching data, the observed average 
number of correct matches, 4.4, was 
found to be significantly different from 
chance performance, F (1, 42)— 
138.64, p < 01. However, the anal- 
ysis of variance of the matching data 
indicated an effect due to the matching 
orders. The mean numbers of correct 
matches for the three orders were 5.44, 
3.84, and 3.84. The average of the 
two poorer orders, 3.84, was still sig- 
nificantly above chance, F (1, 42) = 
97.94, p < .01. 


Discussion 


The results of the first experiment in- 
dicate that Ss can match components of 
training compounds significantly better 
than would be expected by chance, There- 
fore, some explanation of the observed 
matching performance is necessary. 


EXPERIMENT IT 


Experiment I provided evidence that 
Ss could match word and color com- 
ponents of word-color compounds fol- 
lowing training on the compounds, 
However, the observed matching could 
arise from two sources. The Ss may 
have formed associations between the 
word and color components (S-S). 
Also, Ss may have matched components 
that were paired with the same re- 
sponse (S-R-S) during training. The 
present study was designed to attempt 
to separate the possible contributions 
of intercomponent associations (S-S) 
and response matching (S-R-S). 


Method 


Subjects—An additional 48 Ss were ob- 
tained for this study from the same source 
as in Exp. I. 

Materials—The training stimuli were 
eight word-color compounds with the words 
appearing on a solid-color background. The 
words used were those employed by Under- 
wood et al. (1962) plus mar. The words 
and colors were randomly paired to form 
a single list. The digits 2-5 were randomly 
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assigned as responses. The S-R list used 
was the following: NEW brown—2; DE 
orange—2; nev yellow—3; cot pink—3; DAY 
red—4; HAT blue—4; Gas purple—S; BAD 
green—5. In the training list, two compound 
stimuli were paired with each response so 
that matching tests could be designed to 
isolate matching due to intercomponent asso- 
ciations and matching due to response 
matching (S-R-S). 

The matching test stimuli were sheets of 
paper with a word on the left and a column 
of three color patches with the appropriate 
color names on the right. Each S received 
two samples of each of four types of match- 
ing test. For the purposes of randomiza- 
tion, the fourth type of matching test was in- 
cluded as a filler. In the fourth type, a word 
was paired with three incorrect colors, so 
that the frequency of each test type was 
equal, although all eight words were tested. 
The word and color assignments used in the 
matching tests were eight randomly chosen 
assignments from the set of possible assign- 
ments with the following restrictions, Each 
color was correct for at most one matching 
test per S. Each word was used equally 
often, across Ss, in the matching test Types 
1, 2, and 3. In addition, for each S the 
position of the correct colors were independ- 
ently randomized so that they appeared 
equally often in each of the three positions. 
The balancing of the correct position within 
each S was possible with four matching 
types because Type 4 had no correct colors. 
The order of the matching tests was ran- 
domized for each S. 

Design.—Each S was run, using the RT 
procedure, to a criterion of one perfect 
trial Following criterion, the S' was shifted 
to the matching tests. Each S received two 
instances of each of four types of matching 
tests. 

In the Type 1 matching tests, an S was 
presented with a word; the color paired 
with the word in training; the other color 
paired with the same response as the test 
word, and a third color. For an example, 
see Table 1, Column 2. Intercomponent 
association tendencies (S-S) should contrib- 
ute to matching the word and the correct 
color, which was the color paired with the 
word during training. If an S response 
matches (S-R-S), then either of the colors 
paired with the same response during train- 
ing may be matched, presumably with equal 
probability. Any guessing tendencies should 
cause matching of any of the three colors 
with equal probability. Columns 1 and 2 of 


278 


TABLE 1 
OUTLINE AND RESULTS or MATCHING TESTS 


Match- Hypothetical lobana 


i Example of X 
Type | Matching Test | peesponse | Propor: 
BED 
yellow c(1) | S-S; S-R-S; | .615 
1 pink e SRS; g -250 
green c(3) |g 135 
BED 
yellow c(1) | S-S; S-R-S; .698 
g 
2 red  c(2 
brown eo {2g 302 
BED 
imk EU S-R-S; g .365 
3 ue c 
orange c(3 ( 2g ax 


Table 1 provide, in summary form, an out- 
line of the hypothetical response tendencies 
contributing to the matching of the various 
colors for the three meaningful matching. 
tests. Intercomponent associations are sym- 
bolized by S-S, response tendencies by S-R- 
S, and guessing by g. Type 1 matching 
tests are outlined in Row 1. A measure of 
intercomponent association can be obtained 
by observing the difference between the cor- 
rect matches, c(1) in Table 1, Row 1, and 
the frequency of matching to c(2), the color 
which was not paired with the test word but 
was paired with the same response as the 
test word. In the example, “yellow” func- 
tioned as c(1), "pink" as c(2), and "green" 
as c(3). 

The Type 2 matching test was designed to 
provide a pure test of matching. In Type 2 
matching tests the color paired with the 
test word was presented along with two 
colors which were paired with two other 
responses in training. An example is found 
in Table 1, Column 2, Row 2. Any tendency 
to respond either by intercomponent asso- 
ciations or by response matching should lead 
to a correct match, c(1), in the Type 2 
tests. See Table 1, Row 2 for schematic 
outline. 

The third type of matching test was to 
provide a pure test of response matching 
(S-R-S). The S was required to match a 
word with either the color, c(1), which was 
paired to the same response as the word but 
not paired with the word in training, or one 
of two other colors. The two other colors 
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were not paired with either the word or with 
the same response during training. An ex- 
ample appears in Table 1, Column 2, Row 3, 
Any response matching (S-R-S) tendencies 
should favor matching c(1). Guessing 
should provide equal matching for all three 
colors. 

The Type 4 matching test was a word 
paired with three colors which had neither 
been paired with the word nor with the 
same response as the word during training. 
The fourth type was included for randomiza- 
tion and balancing purposes, as was pre- 
viously indicated. 

Apparatus and procedure.—The apparatus 
was the same for this experiment as in 
Exp. L The procedure was essentially the 
same except for two differences: Ss were 
all run with 2 sec. study time/item on each 
trial, and the matching test instructions were 
slightly modified to fit the new test format. 


Results 


There was no indication, by analysis 
of variance of total errors/S, that the 
two orders of the training list differed 
in difficulty, F (1, 46) < 1. 

A summary of the matching per- 
formance to the three types of match- 
ing tests appears in Column 3 of Table 
l. The data for each matching test 
type is based on 96 cases, because each 
S had two examples of each type. For 
statistical analysis, three difference 
scores were obtained for each S in the 
following ways. For Type 1, the dif- 
ference score for each S was the num- 
ber of c(1) matches minus the number 
of c(2) matches in the two instances 
of the Type 1 matching test. The dif- 
ference score provides a measure of the 
S-S tendency. By Table 1, it can be 
seen that the S-S tendency to c(1) is 
the only difference between the theoret- 
ical matching tendencies to c(1) and 
c(2). Under the null hypothesis of no 
S-S tendency, the average difference 
would be expected to equal zero. For 
test Type 2, the difference score for 
each S was the number of c(l) 
matches minus one half the number of 
c(2) or c(3) matches. As can be seen 
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from Table 1, the Type 2 difference 
score provides an estimate of matching 
corrected for guessing. Under the null 
hypothesis of chance matching, the 
Type 2 difference score would have an 
expected average equal to zero. The 
difference scores for the Type 3 match- 
ing tests were obtained in the same 
fashion as the Type 2 difference scores 
and provides a measure of the response 
matching (S-R-S) tendency corrected 
for guessing. See Table 1, Row 3. 
The Type 3 difference score would 
have an expected average equal to zero 
if there were no S-R-S tendency. 

Analysis of variance of the difference 
scores indicated no significant training 
order effect or Training Order X Match- 
ing Type interaction, F (7, 20) < 1 and 
F (14, 80) < 1. The effect on the dif- 
ference scores due to matching type 
was significant, F (2, 80) = 7.06, p < 
.01. Of major interest was whether 
the difference scores for the various 
matching types significantly differed 
from zero. For matching Types 1, 2, 
and 3, the average difference scores 
were .729, 1.093, and .125, respectively. 
The difference scores for matching 
tests Types 1 and 2 significantly dif- 
fered from zero, F (1, 80) = 15.75, 
p < Oland F (1, 80) = 348, p < 0l. 
The average difference score for the 
Type 3 matching test did not signif- 
icantly differ from zero, F (1, 80) < 
1, Of interest, but not statistically in- 
dependent of the above analysis, was 
whether the number of c(1) responses 
differ between the Type 1 and Type 2 
tests. The difference was not signif- 
icant, ¢ (47) = —.66. 


Discussion 


The pattern of results seems fairly 
clear. The Type 2 matching test result 
provides evidence that Ss can match com- 
ponents of compound stimuli, under the 
present conditions, and confirms the Exp. 
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I result. The data from the Type 1 
matching test provides evidence that Ss 
matched on the basis of intercomponent 
associations. The analysis of the differ- 
ence scores from the Type 3 matching 
tests indicated that there was no evidence 
that response matching (S-R-S) had a 
significant effect on matching. A differ- 
ence in the number of correct matches, 
c(1), to Types 1 and 2 would be expected 
because S-R-S tendencies might be ex- 
pected to put Ss in a response competition 
situation in Test Type 1. The lack of a 
significant difference between the fre- 
quency of correct matches for Types 1 
and 2 is a second bit of evidence that re- 
sponse matching (S-R-S) did not signif- 
icantly contribute to matching. It is easy 
to formulate reasonable mathematical 
models which would predict that the re- 
sponse matching tendency (S-R-S) might 
be present, but would not manifest itself 
in a competition situation with the correct 
color, as in the Type 1 matching tests. 
But the fact that the effects of response 
matching in Type 3 were negligible would 
be surprising if S-R-S tendencies do con- 
tribute at all, under the present stimulus 
and training conditions, to component 
matching. 

In both experiments Ss were able to 
match components of training compounds 
significantly better than would be expected 
by chance. In Exp. II, there was no evi- 
dence of response matching (S-R-S). It 
appears that Ss have formed intercom- 
ponent associations during training on 
compound stimuli. 

Both the results of this experiment and 
the findings of Spear et al. (1964) pro- 
vide evidence of the existence of inter- 
component associations, but under differ- 
ent conditions. Spear et al. provide evi- 
dence that intercomponent associations 
are formed during verbal discrimination 
training, but no separate assessment was 
made of intercomponent associations de- 
veloped during paired-associate training 
and response-mediation effects. The pres- 
ent study was designed specifically to sep- 
arate the effects of intercomponent as- 
sociations produced by paired-associate 
training and response mediation. 
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The present results have some addi- 
tional implications. In many experiments 
on stimulus selection, following training 
on compound stimuli, Ss are given recall 
tests with single components (Estes & 
Hopkins, 1961; James & Greeno, 1966; 
Underwood et al, 1962). If intercom- 
ponent associations are formed during 
training, the performance to one com- 
ponent when tested alone may be affected 
by the level of learning to the other com- 
ponents of the same compound. The 
above process might be expected to min- 
imize recall performance differences be- 
tween selected and unselected components. 

Also, it has been suggested (Estes 
et al, 1961; Friedman, 1966) that the 
stimulus unit changes over the course of 
training on a pattern-vs.-component dis- 
crimination from a set of components into 
a pattern. The present evidence for the 
existence of intercomponent associations 
is suggestive of the possible fruitfulness 
of an analysis of the role of intercom- 
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ponent associations in the hypothesized 
change of stimulus unit, 
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CONTOUR INTERACTIONS IN VISUAL MASKING* 


KEVIN HOULIHAN ax» ROBERT W. SEKULER 
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Gratings of various orientations were used to mask a briefly exposed 
vertical test bar. The amount of masking decreased as the grating 
was rotated over a range of 42° from vertical While the total 
amount of masking varied with exposure duration of the grating (5, 
50, or 500 msec.), the shape of the functions relating masking to grat- 
ing orientation was unchanged. The decrease in masking with increas- 
ing difference in the orientations of grating and test bar may be useful 
as a measure of the sensitivity functions of mechanisms responsible for 


the perception of contours. 


Microelectrode studies of the visual 
systems of lower animals have revealed 
neural cells which are "tuned" to spe- 
cific stimulus configurations or events 
(eg. Hubel & Wiesel, 1962). For 
example, there are some units that re- 
spond with great vigor to a border or 
edge of a particular orientation while 
remaining relatively unresponsive to 
the same border after rotation through 
some angle. The possibility that our 
own visual systems may contain simi- 
lar mechanisms is a most exciting one 
for the analysis of visual perception 
and has already produced some human 
psychophysical experiments designed 
to explore this proposition (Andrews, 
1965; Campbell & Kulikowski, 1966; 
Sekuler & Pantle, 1967). The func- 
tional properties of such mechanisms 
may provide important insights into. 
Our perceptual capacities, particularly 
those that are related to the extraction 
of stimulus invariants from the chang- 
ing pattern of proximal stimulation. 

One area of concern is the psycho- 
physical characteristics of orientation- 
sensitive mechanisms, particularly the 

1 Supported by grants from the National 
Institute of Neurological Diseases and 
Blindness and from the Northwestern Uni- 
versity Research Committee. This article 
is based on the Master's thesis of the first 


author which was directed by the second 
author. 


range of orientations over which they 
respond. Visual masking is a tech- 
nique that permits the examination of 
these response ranges. If viewing con- 
tours with one orientation interferes 
with the detection of a contour having 
some other orientation, the extent of 
such interference is a measure of the 
degree to which perception of both 
these orientations is mediated by com- 
mon mechanisms. For example, Sek- 
uler (1965) measured the interference 
that briefly exposed gratings of vari- 
ous orientations exerted upon detec- 
tion of both horizontal and vertical 
test bars. His data suggest, at least 
in his backward masking situation, 
that mechanisms involved in the per- 
ception of contour orientations differ- 
ing by 45° or more are largely inde- 
pendent of one another. 

Some evidence suggests that with 
increasing exposure duration the range 
of orientations to which contour analy- 
zers are responsive may diminish, i.e., 
their tuning around their preferred 
orientation becomes sharper. An- 
drews (1965) had Ss judge the ap- 
parent orientation of a briefly pre- 
sented small, dark line. Although the 
line was always maintained in the same 
orientation, Ss reported that on'suc- 
cessive exposures it seemed to change 
orientation, varying by as much as 30° 
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around its true orientation. With in- 
creasing exposure durations the ap- 
parent changes in orientation dimin- 
ished. Relating these observations to 
the neurophysiological work of Hubel 
and Wiesel (1962), Andrews argued 
that the variation in apparent slope of 
the line with short exposures reflects 
the latency of inhibitory processes in 
the contour-sensitive mechanisms. Evi- 
dence from studies of auditory mask- 
ing also suggests that with time there 
develops a similar sharpening of tun- 
ing. Using a forward masking pro- 
cedure, Elliot (1967) found that when 
a masking tone was presented for very 
brief durations, the frequency range 
of test tones it would mask was greater 
than that which could be masked by a 
more prolonged presentation of the 
same masker. 

These studies suggest that with in- 
creasing stimulus duration mechanisms 
responsive to a particular dimension 
of stimulation become more selective 
in their responsivity. If perception of 
the orientation of a grating were medi- 
ated by mechanisms whose tuning 
sharpened with time, one could expect 
that briefly exposed gratings would 
mask a broader range of test bar 
orientations. The experiment de- 
scribed in this paper measured the 
masking of a vertical test bar by grat- 
ings of various orientations. The hy- 
pothesis of primary interest was that 
the function relating amount of mask- 
ing to the difference in orientation be- 
tween test and masking stimuli would 
steepen with increasing duration of the 
masker. 


METHOD 


Stimuli were presented in Maxwellian 
view using a standard three-channel optical 
system with glow modulator tube sources. 
A steel biting board covered with dental 
impression compound was mounted on a 
heavy duty rack and pinion to steady S's 
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head. A 2-mm. artificial pupil was used 
in front of the right viewing eye; the left 
eye was occluded throughout. When a con- 
toured masking stimulus was called for, a 
glass-mounted photographic grating could 
be inserted in a collimated portion of the 
beam in one channel of the optical system. 
Gratings could be inserted, withdrawn, and 
reinstated with a negligible change in posi- 
tion. Seen in Maxwellian view, each grat- 
ing pattern consisted of alternating dark 
and bright bars, each subtending 8' visual 
angle. The test stimulus was a small bright 
bar (1*X8' visual angle) and was pre- 
sented with long axis vertical in the center 
of the field. Both masking and test stimuli 
were optically superposed by cube-type beam 
splitters on a 4.3 ftl. adapting field of 16? 
diameter. The mean luminance of the 
masking field with grating in place was 
17.2 ftl, while the unattenuated luminance 
of the test channel was 1.5 ftl. A small 
dark fixation point was present about 4° 
to the left of center of the adapting field. 

Thresholds for the detection of the test 
bar were measured using a blockwise track- 
ing procedure (Sekuler, 1965). On each 
trial the masking grating was presented 
for the required duration and, after an inter- 
stimulus interval of 10 msec., the test bar 
could be presented for 10 msec. On a ran- 
domly selected half of all trials the glow 
modulator in the test bar channel was not 
pulsed, causing the test bar to be omitted 
from the usual masking sequence. After 
each trial Ss judged whether the test bar 
had or had not been presented. The num- 
ber of correct responses in each block of 
12 trials controlled the test-bar luminance 
for the succeeding block of 12 trials. When 
S made more than nine correct responses 
in a block, the luminance for the next block 
was increased by .1 density units; if fewer 
than nine correct responses, the luminance 
was decreased by the same amount. This 
permitted S’s responses to maintain the 
test-bar luminance at approximately the level 
required for 75% correct performance. 
Luminance was changed by adding or sub- 
tracting neutral density filters in the test- 
bar channel. 

In addition to measuring the masking of 
the test bar by variously oriented gratings, 
thresholds were also measured under one 
control condition. In the control condition 
no grating was inserted in the masking 
channel; instead appropriate neutral density 
filters were used to produce a homogeneous 
masker of luminance equal to the luminance 
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of the grating masker. Masking produced 
by such a homogeneous field is due only 
to rapid light adaptation and if a grating 
produced more masking than such a homo- 
geneous field, this additional masking can 
be attributed to the involvement of contour- 
sensitive mechanisms. 

Hereafter, alpha will designate the angle 
between the orientation of the test and 
masking contours. The experiment had two 
independent variables, duration of the mask- 
ing stimulus and alpha. The three masker 
durations (5, 50, and 500 msec.) were 
combined factorially with five values of 
alpha (0°, 6°, 18°, 30°, and 42°) as well 
as with the homogeneous masking control. 
A block randomized design was used in 
which masking duration was varied between 
sessions while alpha varied within sessions. 
The S KH was studied in three complete 
replications of the experiment, S RS in two. 

In every condition the threshold was mea- 
sured in 10 blocks of 12 trials each, The 
primary data of the experiment were mean 
values of neutral density filters in the test- 
bar channel during the threshold tracking. 
Only filter values used following the first 
reversal of direction of filter changes 
(whether increasing or decreasing density) 
were considered in these calculations. 


RESULTS 


Data from the two Ss were judged 
similar enough so they could be com- 
bined, the mean thresholds for both Ss 
and all conditions are shown in Fig. 1. 
The points plotted are the mean neu- 
tral density filters required to keep 
the test bar at threshold. 

An analysis of variance was per- 
formed on the data from each S, sep- 
arately. The effect of alpha was sig- 
nificant (p < .01) for both RS, F (4, 
15) — 5.17, and for KH, F (4, 30) — 
428. At all three masking durations 
sensitivity increases as a function of 
alpha, or in other terms, the amount 
of masking decreases as alpha in- 
creases. At alpha = 42° sensitivity is 
approximately the same as with the 
homogeneous control masker. 

The hypothesis relating the slopes 
of the functions shown in Fig. 1 to 
masking duration received no support. 
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Fic. 1. Log relative sensitivity as a 
function of the difference in orientation be- 
tween vertical test bar and the masking 
grating. (Values obtained with the homo- 
geneous control masking field are shown to 
the right of the abscissa. Parameter is 
masking field duration.) 


In both analyses of variance the inter- 
action between masking duration and 
alpha was not statistically significant 
(p.25). The analyses of variance 
yielded significant (p < 01) effects 
of masking duration for both RS, F 
(2, 15) = 33.30, and KH, F 
(2, 30) = 5.86. As Fig. 1 shows, the 
5-msec. masking duration produced the 
smallest amount of masking of the 
three durations studied. The two lon- 
ger durations, 50 and 500 msec., each 
produced about the same amount of 
masking. 

Table 1 shows the standard devia- 
tions of the measurements. 


TABLE 1 
SD ror Data SHOWN IN Fic. 1 


Masking Pattern 


0° 6° | 18° | 30° | 42° | Control 


.|.24|.27 | 17 | -22 | -20 | .23 
Hen “11 | .08 | 10] .11 | 17 | .10 
.34 | 40 | .40 | .45 | .37 
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Discussion 


Several similarities may be noted be- 
tween the results of this experiment and 
those reported earlier by Sekuler (1965). 
In the present experiment little difference 
was found between masking with alpha = 
42° and masking by a homogeneous field of 
equivalent luminance. A similar relation- 
ship was obtained in the earlier studies 
which used a backward rather than a 
forward masking procedure, a dark test 
bar rather than a light one, and duration 
rather than luminance thresholds. The 
amount of masking levels off somewhere 
between alpha = 45° and alpha = 60°, and 
is constant thereafter all the way to 90°. 
Insofar as our results are relevant to 
the mechanisms responsible for the as- 
sessment of visual orientation, they sug- 
gest that such mechanisms are restricted 
in their sensitivity to a range of some 45° 
of orientations, presumably on either side 
of their most preferred orientation. If 
one can extrapolate from the functions of 
Fig. 1 to the sensitivity functions of 
orientation analyzers, one would guess 
that such analyzers are only 50% as sensi- 
tive to orientations 6°-18° away from 
their preferred orientations as they are 
to their preferred, i.e., about .3-log unit of 
the maximum masking represented in Fig. 
1 was dissipated by the time alpha = 6° 
or alpha = 18° was reached. This figure 
is in good agreement with estimates made 
from the data of Sekuler (1965), though 
this study used a different masking pro- 
cedure. 

An analogue to the present experiment 
may be found in the work of Stiles 
(1959) and others on the mechanisms 
of color vision. In the color vision work, 
an increment threshold is measured for a 
small test patch when superimposed upon 
a continuously present background. By 
determining the extent to which the in- 
crement threshold for one wavelength is 
raised by a background of some other 
wavelength it is possible to determine the 
spectral sensitivity of whatever mechanism 
is responsive to that particular test wave- 
length. By repeating this procedure for 
a large number of combinations of test 
and adapting wavelengths Stiles was able 
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to describe the spectral tuning of several 
different color vision mechanisms, The 
spectral tuning of such mechanisms has 
recently been shown to be unchanged by 
the temporal relationships between adapt- 
ing and test stimuli. 

Alpern and Rushton (1965) have shown 
that the spectral tuning of Stiles' (1959) 
color mechanisms is unchanged when the 
test flash is followed, after a 50-msec. de- 
lay, by the adapting flash. The tuning of 
orientation mechanisms seem to exhibit 
the same sort of independence of test- and 
adapting-stimulus temporal sequence. 
Essentially the same results are obtained 
when test precedes adapting field (Se- 
kuler, 1965) or when the adapting field 
precedes the test (the present experi- 
ment). 

The relative amount of masking gener- 
ated by 5-, 50-, and 500-msec. flashes in 
this experiment needs some comment, 
Biersdorf (1955) has shown that the crit- 
ical duration, measured under conditions 
similar to our own, was about 60-70 
msec. Beyond these durations the reci- 
procity between luminance and duration 
which is described by Bloch's law fails to 
hold. Bloch's law implies that the bright- 
ness of our masking stimuli, for durations 
less than the critical interval, will be de- 
termined by the product of luminance and 
duration, Beyond the critical duration 
brightness of the masking flash should be 
determined by its luminance alone. Onley 
and Boynton (1962) have shown that in 
conditions where brightness can be varied 
separately from luminance, the amout of 
masking is a function of the masking flash 
brightness rather than luminance. Given 
the critical duration for our particular 
conditions one would expect that flashes 
of 50 and 500 msec. would be of approxi- 
mately equal brightness. Consequently, 
it is not surprising that thresholds meas- 
ured following either 50- or 500-msec. 
masking flashes of constant luminance are 
quite similar (see Fig. 1). Another 
aspect of the present data is not consistent 
with the work of Onley and Boynton. 
Given a critical duration somewhat in 
excess of 50 msec., Bloch's law leads us to 
expect that the brightness of the 5-msec. 
masking flash would be approximately 
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one-tenth that of the 50-msec. flash. The 
dependence of masking upon brightness 
that Onley and Boynton demonstrated im- 
plies that the masking produced by the 5- 
msec, flash should be only one-tenth that 
produced by the 50-msec. flash. This 
means that there should be about 1-log 
unit separation between the sensitivity 
functions shown in Fig. 1 for the 5- and 
50-msec. flash conditions. The authors 
see no obvious compelling explanation for 
the fact that there is on the average only 
one-half of the predicted separation be- 
tween the two sensitivity functions, 

In summary, then, although no evidence 
was found of progressive sharpening of 
tuning of contour analyzers with increas- 
ing exposure of the masking pattern, it 
did seem that over a variety of masking 
conditions the interaction of contours as a 
function of their orientational differences 
was remarkably constant. 
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The eyelid responses of 160 Ss were differentially conditioned to 
simple arithmetic problems presented visually. Each problem CS was 
different, and the differential cue was whether the problem was cor- 
rectly or incorrectly solved. In addition 20 Ss were reinforced with 
the air puff UCS for both right and wrong problems. The “truth 
value” of the problems served as a good differential stimulus, and the 
conditioned discrimination transferred differentially to the words 
“right” or “wrong” which were associated with the reinforced class 
of problems. Explicit verbalization of the appropriate word, “right” 
or “wrong,” during the differential training had no effect on the trans- 
fer to the word as a CS. Verbalization reduced the number of Ss 
showing the voluntary (V) form of the eyelid CR, and for those 
showing the conditioning (C) form, conditioned discrimination was 
significantly better when the wrong problems were reinforced than 
when the right problems were reinforced with the air puff. This 
finding was related to the congruency of the slightly aversive UCS 
following wrong problems rather than right problems. When both 
right and wrong problems were reinforced more CRs were evoked by 
the right problems, confirming Soviet results with true and false 
sentences. 


The aim of the present investigation 
was to study the phenomena of classical 
and differential conditioning when the 
differential stimulus was the “truth 
value” or the correctness or incorrect- 
ness of arithmetic problems. 

Pavlov (1927) may be translated as 
saying, “A (word) is as real a con- 
ditioned stimulus for man as all the 
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other stimuli in common with animals, 
but at the same time more all inclusive 
than any other stimuli . . . Owing to 
the entire preceding life of the human 
adult a (word) is connected with all 
the external and internal stimuli com- 
ing to the cerebral hemispheres, signals 
all of them, replaces all of them and 
can, therefore, evoke all the actions and 
reactions of the organism which these 
stimuli produce [p. 405]." Words are 
indeed superb differential stimuli in 
eyelid conditioning, and the transfer of 
a differential CR from one pair of rein- 
forced and nonreinforced words to an- 
other pair is immediate (Grant, Levy, 
"Thompson, Hickok, & Bunde, 1967). 
In developing Pavlov's ideas, Soviet in- 
vestigators have produced an extensive 
literature on the interactions of lan- 
guage stimuli and responses, the second 
signalling system, with the first signal- 
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ling system, the more primitive affer- 
ent-efferent system common to man and 
the other more highly evolved animals 
(eg., Hartman, 1963; Ivanov-Smolen- 
sky, 1956; Razran, 1961). Particularly 
relevant to this investigation are studies 
by Faddeéva, Fuflygina, Gerasimchuk, 
and Naroditskaia (Ivanoy-Smolensky, 
1956, pp. 122-176), several of which 
are admirably reviewed by Hartman 
(1965). 

Generally speaking, Pavlov (1927) 
maintained that through reinforcement 
the CS comes to signal the onset of 
the UCS. If the CS is meaningful in 
its own right, however, a world of new 
phenomena and relationships becomes 
possible. When language or another 
symbol system, e.g., numbers, provides 
the conditioned stimuli, the properties 
and interrelations of the symbol system 
permit that the CS do much more than 
signal the UCS onset. For example, 
words have at least grammatical, syn- 
tactic, semantic, and associative rela- 
tionships. A sentence may convey in- 
formation, express a command, be 
relevant or irrelevant to the experi- 
ment, have truth value, rightness or 
wrongness, etc. Any of these features 
of the “meaningful” CS may be rele- 
vant to differential conditioning and to 
the transfer of differential conditioning. 

The present experiment, dealing with 
the truth value of the CS, was inspired 
largely by El’kin’s (1957) surprising 
finding that the truth or falsity of a 
brief statement was an important factor 
in its effectiveness as a CS. Condi- 
tioned eyelid responses were readily 
obtained with young children when the 
CS sentences were true, but when the 
CS sentences were false the condition- 
ing was very poor. This remarkable 
result, if confirmed, suggests some of 
the complex and subtle phenomena that 
may arise when meaningful conditioned 
stimuli are employed. 

Simple arithmetic problems were 


chosen to be the differential stimuli in 
this experiment for two reasons: first, 
their truth value was unambiguous to 
college Ss who were reasonably skilled 
in the formalities of simple addition and 
subtraction; and second, students have 
learned that the correctness or incor- 
rectness of arithmetic is not a trivial 
matter. Differential conditioning was 
employed in all but one procedure, be- 
cause learning a conditioned discrim- 
ination requires that S read and eval- 
uate the correctness of each problem 
stimulus. Further, differential condi- 
tioning permits examination of both 
excitatory and inhibitory phenomena in 
the conditioning process. An unusual 
feature of the experiment was that the 
nominal CS differed on every trial, i.e., 
no problem appeared twice, so that in 
a sense the differential CS was truth 
value as each stimulus was an instance 
of the concept of right or wrong. Clas- 
sical conditioning to concepts has been 
reported by Branca (1957) and Lang, 
Geer, and Hnatiow (1963) ; and Brot- 
sky (1967) has recently reported GSR 
conditioning to verbal concepts of the 
Underwood variety (Underwood & 
Richardson, 1956). Otherwise, this 
kind of concept conditioning experi- 
ment seems to have been executed only 
by Soviet researchers, e.g., Faddeéva 
and Naroditskaia (Ivanov-Smolensky, 
1956, pp. 156-176) who used the bulb- 
squeezing motor method of instru- 
mental training with verbal reinforce- 
ment and who were interested primarily 
in the dynamic transfer of conditioning 
from specific picture instances of a 
concept to the word that named the 


concept class. 


METHOD 


Apparatus —The S wore a plastic head- 
piece, originally designed to support a weld- 
er’s mask, On the right side of the head- 
band was mounted the microtorque potentio- 
meter, which was used to pick up the eyelid 
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response, and an air jet directed at the right 
cornea. The eyelid movements were ampli- 
fied and differentiated, and both the re- 
sponse and time derivative were recorded by 
a Texas Instrument 2-channel Oscillo/riter. 

The S sat in a double soundproof room 
within an outer room where the rest of the 
simulating and recording equipment was 
arranged. He sat in a comfortable ophthal- 
mological examination chair and faced the 
interior of a wooden rectangular enclosure 
that was painted flat white. At the rear 
of the enclosure, approximately 120 cm. 
from S was a 35 X45-cm. ground glass 
screen upon which stimuli could be pro- 
jected from the rear by means of a 16-mm. 
strip film projector. 

The conditioned stimuli were all simple 
addition and subtraction problems of the 
form: 6+3=9, or 7—1-—4, Half the 
problems were right and half were wrong, 
and a different problem was presented on 
each trial. These stimuli appeared as white 
numerals on a dark background, Each nu- 
meral was 25 mm. wide and 32 mm. high, 
and the total width of a problem was ap- 
proximately 170 mm., so that no eye move- 
ments were required to read the total prob- 
lem. Luminance of the numerals was 40 
mL. and that of the background of the 
ground glass screen was 4.5 mL. The sur- 
rounding illumination in the white box was 
approximately .15 mL. The duration of 
each CS was 1,900 msec. The UCS, a puff 
of nitrogen to the cornea of 200 msec. dura- 
tion, and pressure at the source of 160 mm. 
of mercury, was sufficient to evoke a sharp 
eyelid reflex. When given, the UCS was 
delivered 1,700 msec. after the onset of the 
CS. The 1,700-msec. ISI was selected on 
the basis of a pilot study of verbal RT 
to. tachistoscopically presented problems 
(North, Grant, & Fleming, 1967). Here 
the mean verbal RT to the various types of 
problems was about 1 sec. so that 1,700 
msec, permitted an adequate interval between 
classification of the problem as right or 
wrong and the onset of the UCS so as to 
produce optimal differential conditioning 
(Hartman & Grant, 1962). Intertrial inter- 
vals varied from 15 to 25 sec. with a mean 
of 20 sec. and were programmed by Western 
Union Tape transmitter. 

Procedure.—For all groups involving dis- 
crimination training, the experiment was di- 
vided into three phases. The first phase 
consisted of 60 differentially reinforced trials 
on right and wrong problems. Reinforced 
and unreinforced problems were randomly 


interspersed with the restriction that no 
more than three consecutive right or wrong 
problems could appear in the sequence, Fol- 
lowing this there was a 60-trial reversal 
phase where the reinforcement contingencies 
were reversed for half of the Ss and where 
the initial reinforcement contingencies were 
continued for the other half. Finally, all 
Ss were given 10 generalization trials, 5 to 
the word "right," and 5 to the word "wrong," 
presented in a random sequence. None of 
the generalization trials was reinforced. 

Experimental design—In the initial ex- 
periment one pair of groups was initially 
reinforced on right problems and another 
pair of groups was initially reinforced on 
wrong problems. One group of each pair 
continued the initial discrimination for the 
second phase of the experiment, and the 
other group in each pair had the reinforce- 
ment contingency reversed. Subsequently 
these four groups were replicated with the 
addition of instructions to classify the prob- 
lem verbally as "right" or "wrong" imme- 
diately upon presentation of the CS. In 
addition, these groups were to read aloud 
any word that appeared on the screen. Thus 
the total experimental design was trifac- 
torial, the three orthogonal factors being: 
(a) rightness or wrongness of the initially 
reinforced stimulus, (b) continuation or re- 
versal of the initial reinforcement contin- 
gency, and (c) verbalization or nonverbal- 
ization of the rightness or wrongness of the 
problem. 

Finally, a ninth group (RW) of Ss was 
given the same sequences of 120 right and 
wrong problems with all trials reinforced. 
These Ss received the verbalization instruc- 
tions. This aspect of their task was, of 
course, irrelevant to the reinforcement, but 
verbalization would presumably improve 
conditioning with the otherwise poor CS- 
UCS interval (Kimble, 1962). 

Subjects—The Ss were 90 men and 90 
women students from introductory psychol- 
ogy classes at the University of Wisconsin. 
Ten men and 10 women served in each 
group. In addition to the usual neutral in- 
structions employed in the Wisconsin labora- 
tory, these groups were informed that there 
would be a series of simple addition and 
subtraction problems flashed on the screen, 
and that the answers to some of the prob- 
lems might be wrong. The Ss who were to 
verbalize were instructed further, “When 
a problem appears on the screen, you are to 
say as quickly and as accurately as possible 
whether the problem is ‘right’ or ‘wrong. 


ARITHMETIC CORRECTNESS IN DIFFERENTIAL CONDITIONING 289 


In addition, if any word should appear on 
the screen please state that word." 


RESULTS 


All eyelid responses greater than 1 
mm. in magnitude which occurred from 
200 to 1,700 msec. after CS onset were 
scored as CRs. The Hartman and 
Ross (1961) criterion was used for 
classifying Ss in terms of eyelid re- 
sponse form as either voluntary-form 
responders (Vs) or conditioners (Cs). 
Overt verbalization of the rightness or 
wrongness of the stimulus problems 
served to reduce markedly the rate of 
voluntary-form responding. Only 9 of 
the 80 Ss in the verbalizing groups as 
compared with 33 of the 80 Ss in the 
nonverbalizing groups were classified 
as Vs, x? (1) = 17.07,  « 01. Sim- 
ilarly only 4 of the 20 Ss in the RW 
group were V-form responders. In re- 
ducing V-form responding Ss’ verbal- 
ization seemed to serve the same func- 
tion as Spence’s masking task (Spence, 
Homzie, & Rutledge, 1964) and Ross’ 
movie distraction activity (Ross, Koski, 
& Yaeger, 1964). It is perhaps note- 
worthy that in the present experiment 
S was making an instrumental response 
of explicitly verbalizing the very cog- 
nitive feature of the stimulus that 
served as the relevant discriminal cue 
for the differential eyelid CR, and 
that this instrumental verbal response 
tended to be accompanied not by a 
voluntary-form eyelid CR but by the 
classical form. Because V-form re- 
sponders were plentiful only in the non- 
verbal groups, it was possible to com- 
pare the performance of V-form and C- 
form responders in these groups but 
not in the verbal groups. 

Acquisition—The data on acquisi- 
tion of conditioned discrimination are 
presented in Fig. 1 where the percent- 
ages of CRs to the positive and negative 
stimuli are plotted for successive 10- 
trial blocks during differential training. 


VERBAL GROUPS (Cs) 
——R+W- (N*37) 
T-—W*R- (i34) 


PER CENT CRs 


FINVENEEI LX 
TEN-TRIAL BLOCKS 

Fic. 1. Acquisition of the initial condi- 
tioned discrimination. (Percentages of CRs 
are plotted separately for the reinforced, +, 
and nonreinforced, —, stimuli for successive 
10-trial blocks of five reinforced and five 
nonreinforced stimuli. The nonverbalizing 
groups are separated by form of the CR 
into Cs, who show the gradual eyelid closure, 
and Vs, who show the abrupt voluntary 
response form according to the objective cri- 
terion of Hartman and Ross, 1961.) 


The data of the Cs in the verbalizing 
groups appear at the top in Panel A, 
the data of the Cs in the nonverbal 
groups appear in Panel B, and the Vs 
of the nonverbalizing groups appear in 
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Panel C. The data of two groups are 
plotted in each panel. Thus in Panel 
A the dashed upper curve gives the 
percentages of CRs to the wrong 
problems when they were reinforced 
(W+). The responses of the same 
group to the nonreinforced stimulus 
(R—) are shown by the dashed line 
connecting the open circles. Similarly, 
the solid line connecting the solid circles 
graphs the data of Ss for whom the 
right problems were reinforced (R+), 
and the solid line connecting the open 
circles gives the data of the same Ss on 
trials where the wrong problems were 
unreinforced (W—). The other panels 
are similarly coded. 

First of all, Fig. 1 shows that con- 
ditioned discrimination occurs under all 
procedures. The difference between 
number of responses to CS+ minus the 
number of responses to CS— on Trials 
40-60 was highly significant; the over- 
all F (1, 156) = 150.18, error mean 
square = 5.37. The data thus show 
that the concept of rightness or wrong- 
ness, the truth value of the arithmetic 
problems, was a satisfactory differential 
stimulus even though the nominal CS 
differed on every single trial. The 
actual terminal levels of responding are 
similar for the Cs and Vs reported by 
Hickok, Grant, and North (1965) ex- 
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cept that the level of responding to 
CS— is elevated in the present investi- 
gation. a 
It is evident from Panels A and B 
that Ss giving the C form of the CR. 
show better discrimination when wrong 
problems are reinforced and right prob- 
lems are unreinforced than when right 
problems are reinforced and wrong 
problems are not. The discrimination 
between R+ and W-— is particularly 
poor for the nonverbalizing Cs in Panel 
B. The superiority of the W+ R= 
discrimination as opposed to the R+ 
W- discrimination can be tested by 
subtracting the number of CRs to CS— 
from the number of CRs to CS+ on. 
Trials 40-60. This difference between — 
W+ R— and R+ W- discriminations 
was significant, F (1, 108) =: 4.96, p < ` 
.05, error mean square = 4.68. With — 
the V-form responders in nonverbal- 
izing groups in Panel C there appears 
to be no difference between the dis- ` 
crimination when right problems or 
wrong problems were reinforced, and 
this was borne out by statistical tests. 
In Fig. 2 are presented the latencies — 
of eyelid CRs to CS+ for the C-form 
responders of the  nonverbalizing — 
groups, the verbalizing groups, and 
the ninth group, RW. The percentage — 
of CRs falling within each 100 msec. 
interval from 200 to 1,700 msec. is } 
plotted separately for each group. The - 
latencies of the CRs are much longer 
than those ordinarily found in eyelid 
conditioning (Boneau, 1958), but it 
must be remembered that the effective 
differential cue, rightness or wrongness 
of the problem, did not become avail- — 
able to S until almost 1 sec. after the — 
onset of the stimulus. This means that — 
many of the eyelid CRs occurred before 
an explicit verbalization of "right" or 
"wrong" could have been made by S. 
Figure 2 shows that explicit ver- — 
balization tends to postpone the onset — 
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of the CR even though the upper two 
panels of Fig. 1 show that this ver- 
balization does not improve discrimina- 
tion nor does it reduce responding to 
CSa 

The latencies of the incorrect re- 
sponses to CS— did not differ from 
group to group. Generally speaking, 
their latency distributions resembled 
those of the nonverbalizing groups, i.e., 
the latencies of the CRs to CS— were 
somewhat shorter on the average than 
the latencies of CRs to CS+. The 
shorter latency of responses to CS— 
suggests a disinhibiting mechanism. 

Discrimination reversal.—The results 
on discrimination reversal will not be 
presented, because they did not differ 
from other results in the literature 
(Hickok, Grant, & North, 1967 ; Levy, 
Grant, & Clark, 1964), and none of 
the main effects nor interactions of the 
experimental variables were statistically 
significant. There was a highly sig- 
nificant difference between responding 
to CS+ and CS-, F (1, 152)— 
191.94, which simply indicates that the 
discrimination reversal was rapid and 
complete. 

Generalization test.—The percentage 
of eyelid CRs to the stimulus words, 
RIGHT and wrone, on the generaliza- 
tion test trials is given in Table 1. 
Generalization produced fewer CRs 
than the terminal portions of the curves 
in the training and reversal phases of 
the experiment. Significantly more 
CRs were given to the word corre- 
sponding to CS+ than to the word 
corresponding to CS—, F (1, 152) = 
20.47, error mean square = 73. Ver- 
balization of the discriminative cue dur- 
ing conditioning did not facilitate trans- 
fer nor did it appear to serve as an 
effective mediator. Neither did re- 
versal of the discrimination interfere 
with the generalization to the verbal 
stimulus. There was an unexplained 


TABLE 1 


PERCENTAGE OF EvELID CRs or Ss SHOWING 
THE C For oF THE EYELID CR TO THE GEN- 
ERALIZATION Test Wonps, “RIGHT” AND 
"WRONG" THAT CORRESPONDED TO 
THE RIGHT AND WRONG ARITH- 

METIC PROBLEMS USED IN 
THE TRAINING PHASE 
OF THE EXPERIMENT 


Percentage of CRs to: 
Grotp Word Cor- | Word Cor- 
responding] responding 
toCS+ | toCS— 
All Ss 23.2 134 
Verbalizing Ss 24.8 14.6 
Nonverbalizing Ss 20.9 11.5 
Continuous Differential 
Trainin; 25.5 16.1 
Reversed i iterential 
Trainin; 20.4 10.0 
Right Problems last 
inforced 28.7 16.7 
Wrong Problems last 
Reinforced 184 10.5 


Note.—N = 118. 


significant difference, t (116) = 244, 
between responding to the word "right" 
if correct problems had last been rein- 
forced as compared with responding to 
the word “wrong,” when wrong prob- 
lems had last been reinforced. 
Acquisition in the RW  group.—In 
the RW groups Ss were instructed to 
verbalize the rightness and wrongness 
of the 60 right problems and 60 wrong 
problems that were randomly inter- 
spersed, but all stimulus problems were 
followed by the UCS. The acquisition 
curves for CRs to right and wrong 
problems are graphed separately in Fig. 
3, percentage of CRs being plotted 
against successive 10-trial blocks of five 
right and five wrong problems each, 
Three points are noteworthy about 
the acquisition curves in Fig. 3. First, 
the curves are essentially linear; only 
the linear trend component, F (1, 19) 
= 1771, p € .01, error mean square = 
3.09, was significant. Second, the gen- 
eral level of acquisition is much de- 
pressed as compared with typical ac- 
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Fic. 3. Acquisition of the eyelid CR by 
Ss of the RW group who showed the C 
form of the eyelid CR. (Percentage of 
CRs is plotted separately for responses 
to the right and wrong problems for succes- 
sive 10-trial blocks of five right and five 
wrong problems.) 


quisition over 120 reinforced trials. 
This may be accounted for by the use 
of two conditioned stimuli or by the 
long ISI, although the verbalization of 
"right" or “wrong” should have re- 
duced the adverse effect of the poor 
ISI (Kimble, 1962, pp. 40-41). 

The most noteworthy fact, however, 
is that there were more CRs elicited by 
right problems than by wrong prob- 
lems, and this difference was highly 
significant, F (1, 19) = 1477, p < .01, 
error mean square = .65. Thus the 
El'kin result was obtained when right 
and wrong stimuli are randomly inter- 
spersed and all were reinforced. 


Discussion 


The correctness or incorrectness of 
arithmetic problems, their truth value, 
has been shown in this experiment to be 
a good differential conditioned stimulus. 
The acquisition curves to the positive and 
negative stimuli were essentially similar 
to those obtained in other experiments 
with simpler differential stimuli. Fur- 
thermore, there was a significant transfer 
from the arithmetic problems to the words 
“right” and “wrong” that corresponded 
with the CS-UCS contingencies used in 
the training. Thus the results obtained 
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by Naroditskaia and Faddeéva (Ivanoy- 
Smolensky, 1956) with the instrumental 
bulb-squeezing response are paralleled in 
classical conditioning. In the present ex- 
periment, the phenomenon would presum- 
ably be called elective generalization 
through a “lexical integrator” (Ivanoy- 
Smolensky, 1956, p. 164) rather than 
dynamic transmission as in the Narodit- 
skaia and Faddeéva experiments. Curi- 
ously, however, the amount of transfer 
was not increased by having S verbalize 
“tight” or “wrong” during the differential 
conditioning. 

The correctness of a problem was by no 
means equivalent to incorrectness of a 
problem, however. Right problems were 
better conditioned stimuli than wrong 
problems if both were reinforced, which 
agrees with the results reported by El'kin 
(1957). The nonequivalence of right and 
wrong problems was shown in a more 
complicated phenomenon of asymmetry in 
the conditioned discrimination for Ss 
showing the C form of the eyelid CR. 
Here better conditioned discrimination 
was found when wrong problems were 
reinforced and right problems were not 
(W+ R—). The poor R+ W-— dis- 
crimination was caused by lower respond- ' 
ing to CS+, a finding suggesting that 
El'kin's result may not occur with dif- 
ferential training. 

A central problem for any interpreta- 
tion is that it should account, on the one 
hand, for higher responding to continu- 
ously reinforced R+ when interspersed 
with continuously reinforced W-+, and 
must, on the other hand, also account for 
a lower response rate to R+ when it is 
interspersed with an unreinforced W—. 

It seems crucial to any explanation to 
propose that $ has thoroughly learned that 
the rightness or wrongness of arithmetic 
is important. Wrong arithmetic often has 
annoying or serious consequences, and 
viewing wrong arithmetic presumably has 
at least a slightly disturbing effect on S. 
Furthermore, since the problem is never 
corrected, S remains disturbed. In con- 
trast, correct arithmetic problems result 
in closure and presumably leave S in a 
more quiescent state. Interspersing wrong 
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problems among the right problems prob- 
ably preserves their disturbing character. 
Since Pavlov it has usually been observed: 
that better conditioning occurs after the 
initial orienting reflexes and arousal to the 
CS have been habituated. The authors 
conjecture that the arousal and orienting 
to W+ problems does not.habituate, but 
that the arousal and orienting to R4- prob- 
lems does tend to habituate when correct 
and incorrect problems in a mixed series 
are both followed by the same conse- 
quence, the UCS. Under these circum- 
stances the persisting arousal to wrong 
problems would be expected to impair con- 
ditioning to W+ as compared with condi- 
tioning to R+ stimuli, and this could 
account for the results of the RW group. 

In contrast to those of the RW group, 
the differential training procedures are 
more complicated, involve more com- 
plicated psychological processes, and re- 
veal more complicated phenomena. Dif- 
ferential training means that a different 
UCS contingency is associated with each 
of the differential conditioned stimuli, In 
this experiment the reinforcer is a slightly 
aversive corneal air puff. One differen- 
tial CS, W, is disturbing, has usually led 
to unfortunate consequences or punish- 
ment, etc, The other, R, is not disturb- 
ing, has usually led to reward, and rarely 
to punishment. Therefore, in the R+ 
W-— procedures, the CS-UCS contingency 
is inappropriate, or incongruous, or dis- 
sonant. The authors conjecture that the 
congruent CS-UCS contingency leads to 
closure, and that the incongruous CS- 
UCS contingency leads to a persisting 
disturbance that impairs the conditioning 
process. The congruent W+ R— differ- 
ential training would, in contrast, lead to 
better discrimination learning as was 
found in the present experiment. 

The foregoing complicated interpreta- 
tion was admittedly a post hoc formula- 
tion, It is here presented, however, be- 
cause it led to the prediction that a re- 
warding, nonaversive reinforcement pro- 
cedure would reverse the asymmetry of 
the conditioned discrimination found in 
this experiment. The authors have tested 
and confirmed this prediction (Fleming, 


Cerekwicki, & Grant, 1968), so that the 
post hoc interpretation has received suffi- 
cient support to justify presenting it as 
a proposed explanation of the present re- 
sults. Although this appears to be going 
far afield for an interpretation of a condi- 
tioning experiment, it seems impossible to 
ignore the long history of the modes of 
reinforcement associated with truth value 
of arithmetic, and indeed it may be fool- 
hardy to propose that this long history of 
reinforcement has no relevance to the 
particular contingencies presented to a 
college sophomore in one of his required 
psychology experiments. 
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“APPROPRIATENESS” 


OF THE  STIMULUS-REINFORCE- 


MENT CONTINGENCY IN INSTRUMENTAL DIFFERENTIAL 
CONDITIONING OF THE EYELID RESPONSE TO THE 


ARITHMETIC CONCEPTS OF “RIGHT” 
AND "WRONG": 
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Eyelid responses of 72 Ss to 60 simple solved addition and subtrac- 
tion problems, presented visually, were instrumentally conditioned with 
positive reinforcement. Each problem was different and could be 
correctly (R) or incorrectly (W) solved in the stimulus display. 
A within-Ss comparison revealed that significantly more eyelid CRs 
were evoked by the R problems than by the W problems, but there 
was no difference in a between-Ss comparison, where each S re- 
ceived only R or only W problems. With intermittent positive and 
negative reinforcement there was no difference in conditioning to R 
and W problems. 2 additional groups of 18 Ss each were differen- 
tially conditioned to 30 R and 30 randomly interspersed W problems. 
In the R+ W— group, responses to R problems were positively rein- 
forced and responses to W problems were negatively reinforced; in 
the W+ R— group, these contingencies were reversed. Better dif- 
ferential responding was obtained in the R+ W— group, supporting 
the preexperimental hypothesis that CS-reinforcement contingencies 
that were congruent with Ss' past experience would be more effec- 
tive than contingencies that were incongruent or dissonant. All 
groups showed considerable transfer to the words "right" or "wrong" 
as generalization stimuli, but there was no differential generalization 


even in the 2 groups that had received differential training. 


The aim of this research was to 
study instrumental reward training 
and instrumental differential training 
of the eyelid response when the dif- 
ferential stimulus was the truth value 
(correctness or  incorrectness) of 
solved arithmetic problems. 

In an earlier study with classical re- 
inforcement (air puff UCS) of arith- 
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metic problems it was found that bet- 
ter conditioned discrimination was ob- 
tained when the incorrectly added or 
subtracted (wrong) problems were 
reinforced with the puff and the cor- 
rectly added or subtracted (right) 
problems were nonreinforced (W+ 
R—) than when the right problems 
were reinforced with the puff and the 
wrong problems were not (R-- W—) 
(Fleming, Grant, North, & Levy, 
1968). In addition, in the earlier in- 
vestigation, when right and wrong 
problems were both reinforced, RW 
(100), more CRs were evoked by the 
right than by the wrong problems. 
This latter finding confirmed a re- 
port by El'kin (1957) who conditioned 
the eyeblink to true and false state- 
ments. The present experiment fur- 
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ther explored the El'kin finding and 
tested an hypothesis framed to ac- 
count for the asymmetry of W-- R— 
and R+ W— classical differential con- 
ditioning. 

To account for the superiority of 
W+ R- discrimination as compared 
with R+ W- discrimination it was 
proposed that pairing the slightly aver- 
sive air puff with the wrong problems 
was more congruent with S’s previous 
experience with reward-punishment 
contingencies associated with incor- 
rect arithmetic, and that pairing the 
aversive puff with right problems was 
incongruent and dissonant. The ap- 
propriateness or congruence of stim- 
ulus-reinforcement contingencies in 
W+ R-— aversive conditioning was 
hypothesized to account for its supe- 
riority as a differential reinforcement 
arrangement. If this interpretation 
were valid, and if a technique for dif- 
ferential instrumental training of the 
eyeblink could be devised such that 
responses to right problems were re- 
warded and responses to wrong prob- 
lems were punished, this rearrange- 
ment of stimulus reinforcement con- 
tingencies should be congruent, and the 
findings with classical aversive differ- 
ential conditioning should be reversed. 


METHOD 


Apparatus.—The laboratory and the basic 
recording apparatus were the same as those 
described by Cerekwicki and Grant (1967). 
The S, seated in a comfortable ophthalmo- 
logical examination chair in a double sound- 
proof room, faced a white enclosure at table 
height. At the back of the enclosure, 120 
cm. from S’s eyes, was a 45 cm. X 303-cm. 
ground-glass screen. The S wore a foam- 
padded headpiece on which was mounted the 
microtorque potentiometer that picked up 
the eyelid movement. The shaft of the 
potentiometer was coupled to a stiff wire 
that was attached to the right eyelid with 
plastic electrical tape. 

The stimuli were the same arithmetic 
problems employed by Fleming, et al. 
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(1968). They were of the form: 6+3=9, 
or 7—1=4. Half of the problems were 
correctly added or subtracted (R) and half 
were wrong (W). Each problem was pre- 
sented once only for 1,900 msec. by means 
of a strip-film projector with an intertrial 
interval of 15-25 sec. between problems. 
The numerals were approximately 25 mm. 
wide and 32 mm. high with a luminance of 
40 mL. against a constant 4.5 mL, back- 
ground. The total width of a problem was 
approximately 170 mm. so that no eye move- 
ments were required. 

All eyelid responses greater than 1 mm. 
in recorded amplitude were detected elec- 
tronically by a Schmitt trigger that was 
activated by the first time derivative of the 
eyelid movement. Responses with latencies 
from 200 to 1,700 msec. were reinforced, 
Positive reinforcement was defined as a 
500-msec. illumination of a small green pilot 
lamp and the simultaneous click and ad- 
vance of a Veeder-Root electric counter 
labeled “Number Correct.” Negative rein- 
forcement was defined as the similar illum- 
ination of a red pilot lamp and the advance 
of a counter labeled “Number Incorrect.” 
The reinforcement was given 500 msec. after 
an appropriate response, because this delay 
was found by Cerekwicki and Grant (1967) 
to produce optimal conditioning. The green 
and red lamps were mounted left and right, 
respectively, on a small block of wood above 
their respective counters for those Ss who 
received differential reinforcement. A single 
green lamp and single counter were used 
with Ss who received positive reinforcement 
only. The lamps were centered on the floor 
of the enclosure approximately 60 cm, in 
front of S and approximately 20 cm. below 
his line of sight to the ground-glass screen. 

Procedure—All Ss received 60 trials. 
For four of the groups the 60 trials consisted 
of 30 R and 30 W problems in a random 
sequence with the restriction that there be 
no more than 3 consecutive R or 
problems and that successive 10-trial blocks 
contain 5 R and 5 W problems, The 
reinforcement contingencies for these groups 
were as follows: For the RW(100) group, 
appropriate eyelid responses to all problems, 
right or wrong, were positively reinforced. 
For the RW(50) group, all appropriate 
responses were reinforced, half positively 
and half negatively in a random sequence. 
For the R+ W- group, responses to 
problems received positive reinforcement and 
responses to W problems received negative 
reinforcement, and for the W+ R— group 
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these reinforcement contingencies were re- 
versed. 

Two other groups, R(100) and W(100), 
received all right or all wrong problems, 
respectively. For these two groups, all eye- 
lid responses meeting the criterion were posi- 
tively reinforced. 

Following all the conditioning procedures, 
the eyelid responses of Ss were recorded 
on 10 generalization trials, five to the word 
"right" and five to the word "wrong," pre- 
sented in a random sequence. None of the 
generalization trials was reinforced. 

Subjects and instructions—The Ss were 
108 students who volunteered from intro- 
ductory psychology courses at the University 
of Wisconsin. There were 8 men and 10 
women in each group. The instructions, 
read to S before the experiment, were modi- 
fied from those employed earlier (Cerekwicki 
& Grant, 1967; Fleming & Grant, 1966; 
Hansche & Grant, 1965). The critical portion 
for those Ss receiving differential reinforce- 
ment was as follows: 


During the experiment a series of simple 
arithmetic problems will appear on the 
screen in front of you. The answer to 
any problem may be either right or 
wrong. The object of the experiment 
is for you to get the small green light 
in front of you to come on as often as 
possible, and for the red light to come 
on as little as possible. I can’t tell you 
what you have to do to get the green 
light to come on, but if you make the 
correct response when a problem is on 
the screen, the green light will come on 
like this. (Demonstrate green light.) 
If, however, you make an incorrect re- 
sponse, the red light will light like this. 
(Demonstrate red light.) When you do 
not make any response, neither light will 
come on. The counter labeled Number 
Correct will keep track of the number 
of times the green light comes on and 
the counter labeled Number Incorrect 
will keep track of the number of times 
the red light comes on. As I said be- 
fore, I can’t tell you what you have to 
do to get the green light to come on, 
but I can say that it will only come 
on while the arithmetic problem is on 
the screen. 


. For those Ss receiving only positive re- 
inforcement the instructions were the same 
except that references to the red light and 
Number Incorrect counter were deleted. 


RESULTS 


The dependent variable was the 
number of responses that had been de- 
tected by the electronic pickoff unit. 
Hand scoring of several records con- 
firmed that the unit was highly accu- 
rate. The hand scoring also revealed 
that the vast majority of the responses 
were of short latency and of a form 
which would be classified as voluntary 
(Vs) by the Hartman and Ross 
(1961) criterion. This latter finding 
is in agreement with previous research 
from this laboratory (e. g., Cerekwicki 
& Grant, 1967; Fleming & Grant, 
1966). 

The main results of the experiment 
are shown in Fig. 1. Figure 1 is 
divided into four panels. In each panel 
the percentage of CRs is graphed 
over successive 10-trial blocks for dif- 
ferent experimental groups. In Panel 
A a within-Ss comparison between re- 
sponses to correct problems and incor- 
rect problems is shown from the data 
of the RW(100) group. The Ss of 
this group gave significantly more eye- 
lid responses to R problems than to 
W problems, #(17) = 2.70; p< .02, 
SE, 2.22. All statistics reported 
are based on the total number of re- 
sponses during the 60-trial training 
series. 

In Panel B a between-Ss compari- 
son of responding to R problems and 
W problems is permitted by compari- 
son of the acquisition curves of the 
R(100) group with the W(100) 
group. Here there was no significant 
difference in response to right and 
wrong problems, t (34) = 0.66, SEay 
— 3,62. Comparing the curves in 
Panels A and B it appears that inter- 
spersing R and W problems reduces 
responding generally, but particularly 
to the wrong problems as compared 
with stimulus series containing only 
R problems or only W problems. The 
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Fic. 1. Acquisition of instrumentally con- 
ditioned eyelid response, Panels A, B, and 
C, and differentially conditioned response, 
Panel D. (Percentage of CRs is plotted 
for the two stimulus classes, right and wrong 
arithmetic problems, for successive 10-trial 
blocks, In Panel D the positively rein- 
forced class is designated with a plus sign, 
the negatively reinforced class with a minus 
sign.) 


total number of responses given by 
the RW(100) group was significantly 
less than the R(100) group, t (34) = 
2.32, p<.05. The difference in re- 
sponse rate to the R problems between 
the RW(100) group and the R(100) 
groups was not significant, £ (34) = 
1.70, but the difference in response rate 
to the W problems was. significant 
when the response rate to the W prob- 
lems in the RW(100) group was 
tested against the response rate to the 
W problems of the W(100) group, ¢ 
(34) = 3.07, p < .01. 

In contrast to the results in Panel A, 
the within-Ss comparison "of the 
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RW(50) group in Panel C, where R 
and W problems were given 50% 
positive reinforcement and 50% nega- 
tive reinforcement intermittently, 
showed approximately equal respond- 
ing to the two classes of problems, : 
t (17) — 0.99. 

In Panel D the results of the R+ 
W- and the W+ R- groups are 
shown. The solid curves, top and bot- 
tom, show the high degree of dif- 
ferential responding in the R+ W— 
group, and the dashed curves show the 
lower degree of discrimination in the 
W+ R- group. When the discrimi- 
nation scores (total responses to CS+ 
minus total responses to CS—) for the 
two groups were compared, the dis- 
crimination in the R+ W- group 
was significantly better than in the 
W+ R- group, t (34) = 2.71, p< 
02, SEaite = 3.07. 

In the generalization tests where Ss 
responded to the stimulus words, 
“right” or “wrong”, the percentage 
responding ranged 48-70, almost three 
times as high as the generalization 
rate in the classical conditioning ex- 
periment of Fleming et al. (1968). 
Tn no instance, however, was there sig- 
nificant differential responding to the 
two words; even following differential 
training on the R and W problem in- 
stances, the largest ¢ (17) = 1.84. 


Discussion 


The differential instrumental training 
method employed here produced excellent 
differential conditioning of the eyelid re- 
sponse. Under optimal stimulus reinforce- 
ment contingencies (Group R+ W—) Ss 
differentiated better after 60 training trials 
than did Ss showing the voluntary form 
of the eyelid response in the experiment 
of Hickok, Grant, and North (1965). 
Their Ss received 60 trials of classical 
differential reinforcement with simpler 
differential stimuli, In the present ex- 
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periment the terminal response levels 
elicited by positively reinforced problem 
stimuli were also substantially higher than 
those obtained in earlier simple positive 
instrumental training of the eyelid re- 
sponse in this laboratory (Cerekwicki & 
Grant, 1967; Fleming & Grant, 1966; 
Hansche & Grant, 1965). The superior 
conditioning in the present study may be 
attributed to the improved instructions 
which; by specifying that the critical re- 
sponse must be made while the CS was 
still present, were less ambiguous than 
the instructions used in the earlier ex- 
periments. 

Stimulus-reinforcement  congruences.— 
The significantly superior discrimination 
of the R+ W— group (where responses 
to R problems were followed by positive 
reinforcement and responses to W prob- 
lems were followed by negative reinforce- 
ment) over the discrimination of the 
W+ R— group (where responding to 
W problems was followed by positive re- 
inforcement) was predicted by the hy- 
pothesis developed to explain the reverse 
result—better discrimination to W+ R— 
than to R+ W— (Fleming et al., 1968)— 
obtained with classical aversive reinforce- 
ment. -There, it was conjectured that 
pairing a slightly aversive classical air- 
puff reinforcer with the W problems and 
leaving the R problems unreinforced was 
more congruent with S's previous right- 
wrong reinforcement experience than was 
the dissonant relationship of pairing the 
aversive reinforcement with the R prob- 
lems. It was hypothesized that a con- 
gruent or appropriate CS-reinforcement 
relationship would lead to better discrimi- 
nation than an incongruent or dissonant 
one. If this interpretation were valid, 
changing to a scheme of reinforcement 
that was differentially rewarding (posi- 
tive) and punishing (negative), as in the 
present experiment, would lead to the 
prediction that the previous result would 

be reversed; i.e, a R+ W-— procedure 
would now produce better discrimination 
than a W+ R— procedure. This reversed 
result was obtained, so that the congruency 
hypothesis was supported by the data of 
this experiment. 


“Truth value” and effectiveness of the 
conditioned stimulus.—As was the case 
with classical eyelid conditioning (Flem- 
ing et al, 1968), in the present experi- 
ment better conditioning was obtained 
to R arithmetic problems than to W 
problems in the within-Ss comparison of 
the RW(100) group where responses to 
all problems were positively reinforced. 
The authors failed, however, to obtain a 
difference in conditioning to R and W 
problems when they were compared on a 
between-S basis; i.e., when only R or only 
W problems were presented to any given 
S, Groups R(100) and W(100). In addi- 
tion, there was no difference in a within- 
Ss comparison of R and W problems 
when responses to both types were ran- 
domly given 50% positive and 50% nega- 
tive reinforcement, Group RW(50). The 
within-$s comparison with continuous 
positive reinforcement further confirmed 
El'kin's (1957) findings concerning true 
and. false statements as conditioned stim- 
uli. However, apparently El'kin found 
the superiority of true statements is a 
between-Ss comparison, where the pres- 
ent authors found no difference. 

Thus, the present authors find that the 


"within-Ss difference in conditioning per- 


formance related to the truth or falsity of 
the conditioned stimuli is again con- 
firmed, and it remains to comment on 
discrepancies between Elkin's findings 
and the present results comparing per- 
formance to R and W problems in the 
R(100) and W(100) and the RW (50) 
groups. El'kin's Ss were 10-16-yr.-old- 
boys and girls; in this study they were 
college students. It was conjectured that 
these more mature Ss may have had less 
trouble recoding the W problems in a 
homogeneous series of trials than did 
EPkin's youngsters, but that the wrong 
problems in the RW( 100) mixed series 
preserved the negative or disturbing char- 
acter of the W problems when they were 
continually contrasted with R problems in 
the mixed stimulus series. Failure to 
obtain an R-W difference in the within- 
Ss comparison with intermittent positive 
or negative reinforcement, Group RW- 
(50), may simply mean that the "truth 
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value" effect is too fragile to survive the 
complexities of an intermittent reinforce- 
ment schedule. 

Finally, when contrasted with the dif- 
ferential generalization to the words 
"right" and “wrong” following classical 
training that was obtained by Fleming et 
al. (1968), the absence of differential gen- 
eralization following instrumental train- 
ing in the present study was somewhat 
surprising. This is particularly true be- 
cause the Soviet investigators (e.g., Fad- 
deéva, 1956; Naroditskaia, 1956) obtained 
their differential generalization to concept 
class names with an instrumentally condi- 
tioned response. Also semantic general- 
ization seems to be more readily obtained 
with an instrumental than with a classi- 
cally conditioned response (e.g., Feather, 
1965; Levy & Murphy, 1966; Novak & 
Grant, 1965; Razran, 1961). In spite of 
these considerations it would seem that 
differential cognitive generalization is not 
always greater in an instrumental as com- 
pared with a classical conditioning situa- 
tion. 
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INTERACTION BETWEEN SENSORY SPATIAL AFTER- 
EFFECTS AND PERSISTENCE OF RESPONSE 
FOLLOWING BEHAVIORAL COMPENSATION 


J. K. COLLINS anv G. SINGER 
University of Sydney 


The sensory spatial aftereffect, due to prolonged stimulation, and re- 
sponse persistence of behavioral compensation, due to transformation 
of sensory input, were investigated in 3 experiments. In Experi- 
ment I it was shown, by using transformed visual input, that making 
kinesthetic spatial judgments can be regarded as a training period 
in the learning of new kinesthetically controlled responses. The lack 
of visual reinforcement in the postexposure period produces experi- 
mental extinction of these newly learned responses. In Experiment II, 
the act of making a single spatial judgment during the period when 
a kinesthetic system was undergoing adaptation was shown to be 
sufficient to produce long lasting perceptual distortions, i.e, behavioral 
compensation persisting after sensory effects dissipate. In Experiment 


III the reults of the previous experiments were confirmed. 


Recently Day and Singer (1967) 
distinguished between sensory spatial 
aftereffects (SAE) and persistence of 
response (PR) following behavioral 
compensation for spatially transformed 
input. Both effects may occur with 
modifications of the visual input by 
means of optical systems involving 
prisms, lenses, and mirrors. For ex- 
ample, when S views an object through 
a wedge prism it is usually displaced in 
the direction of the prism base. If, 
while viewing the object, S reaches for 
it, he at first reaches too far to one 
side, but after a number of such re- 
sponses he reaches accurately. This 
new direction of responding occasioned 
by a changed visual-kinesthetic rela- 
tionship persists so that when required 
to point straight ahead without vision, 
the direction of S’s pointing differs 
from that prior to viewing through the 
prism. 

In addition, repeated reaching in one 
direction, with or without vision, pro- 
duces kinesthetic adaptation such that 
subsequent pointing straight ahead will 
differ from that prior to reaching in a 
certain direction. The first, PR, is a 


new response to a changed visual- 
kinesthetic relationship and the second, 
SAE, a response concomitant on adap- 
tive change in the kinesthetic system. 
Since responses indexing the SAE may 
be in the same direction as those index- 
ing PR, the two are commonly con- 
founded. 

It was also shown earlier (Day & 
Singer, 1967) that while the SAE dis- 
sipates exponentially and disappears 
after about 2-3 min., the PR, a learned 
response, has a much greater duration 
and is measurable after delays of up to 
15 min. 

The argument underlying these three 
experiments can be summarized as fol- 
lows. Judgments of horizontality 
which can be made visually and/or 
kinesthetically are learned. If S learns 
to make such judgments during a pe- 
riod in which the visual horizontal is 
prismatically transformed, subsequent 
judgments made kinesthetically will dif- 
fer from those made prior to transfor- 
mation. If, however, S learns to make 
judgments of horizontality while the 
kinesthetic system is adapted through 
prolonged stimulation, judgments will 


301 


302 


again differ from those prior to adapta- 
tion. That is, both visual distortion 
and kinesthetic adaptation may lead to 
kinesthetic judgments which persist for 
some time. The sensory aftereffect it- 
self, however, is of briefer duration. 
The purpose of the experiments re- 
ported here was to demonstrate the 
manner in which SAE and PR inter- 
act. The first experiment was designed 
to show that a single response made 
during spatially transformed visual in- 
put and involving kinesthetic stimula- 
tion of insufficient duration to produce 
an SAE, gives rise to marked PR. The 
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second and third experiments were de- _ 
signed to show that adaptive changes 
in the kinesthetic system can themselves 
result in PR. Specifically, Exp. II 
and III demonstrate that a SAE in- 
volving the kinesthetic system can pro- 
duce PR. 


EXPERIMENT I 


The aim of Exp. I was, first, to ob- 
serve the effect of a single kinesthetic 
spatial judgment on PR following be- 
havioral compensation, and, second, to 
observe the effect of the number of 


Fic. 1. Apparatus used in Exp, I in which a double Dove prism system permitted optical 
slant of a physically horizontal bar. (The bar could be judged visually and kinesthetically 


simultaneously, or kinesthetically alone.) 
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judgments on the rate of dissipation of 
PR. 

If a response to a new spatial rela- 
tionship between visual and kinesthetic 
input is learned, it would be expected 
that PR would occur even after a single 
response made during the period of the 
changed relationship (Hamilton, 1964). 
Further, its rate of extinction would 
be expected to vary directly as a func- 
tion of the number of trials (i.e., judg- 
ments) undertaken during the period 
in which only kinesthetically controlled 
responses are made. A single response 
during the period of visual transforma- 
tion would not be expected to result in 
a kinesthetic SAE and, even if an SAE 
did occur during such a brief stimula- 
tion period, its dissipation would be 
very rapid (Singer & Day, 1965). 


Method. 


Apparatus.—The apparatus used has been 
described previously (Singer & Day, 1966), 
and is shown in Fig. l. By changing the 
relative angular orientation of two Dove 
prisms the retinal projection of a horizon- 
tal bar is slanted. The orientation of the 
bar itself could be judged visually by view- 
ing monocularly through the prism (Fig. 1) 
or kinesthetically by moving the hand across 
its edge. The moving hand was, of course, 
also viewed through the eyepiece. A con- 
trol which could be turned with the other 
hand enabled S to adjust the bar in either 
direction so that it appeared to be horizontal. 

Subjects.—There were three groups of 12 
Ss, each group comprising about equal num- 
bers of men and women. All were recruited 
from an introductory course in psychology 
and were alternately assigned to groups on 
reporting. 

Procedure.—Each S made five preexpo- 
sure kinesthetic settings of the bar to the 
apparent horizontal by moving the hand 
across it while the eyepiece was closed. 
During the exposure period which followed, 
the bar was adjusted to the apparent hori- 
zontal while the hand was moved laterally 
across the edge and while it was viewed 
through the optical system which changed 
the visual projection through 20°. All 
groups made one such adjustment during a 
10-sec. exposure period. During the post- 
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TABLE 1 


Means AND SDs, IN DEGREES, OF THE MEA- 
SURE OF THE AFTEREFFECT FOR ALL 
THREE GROUPS IN Exp. I 


Posttest | Posttest | Posttest 
Immedi- | 7 Min. | 14 Min. 
ately after| after after 
Exposure | Exposure | Exposure 
Group 1 (15 
posttests) 
3.39 1.01 1.21 
SD 24 1.6 VR.) 
Group 2 (7 
min. 
M = 3.10 = 
SD > 2.3 = 
Group 3 
(14 min.) 
— — 2:2 
SD = — 2.7 


Note.—Groups 1 and 2 at 7 min.: za 2 = 647, 
p € -05. Groupe 1 and 3 at 14 min.: F(1, 22) = 1,48, 
p.05. 


exposure period Group 1 made 15 nonvistial 
adjustments to the horizontal at 1-min. 
intervals. Group 2 made a single adjust- 
ment after resting for 7 min., and Group 3 
made a single adjustment after 14 min, 
Thus there were three postexposure condi- 
tions: 15 responses at 1-min. intervals, one 
response after a 7-min, interval and one 
response after a 14-min. interval. 


Results 


The means and standard deviations 
for all three groups are shown in Table 
1. The results suggest that a single 
adjustment to the horizontal results in 
a change which lasts for at least 7 min. 
but not as long as 14 min., and that 
kinesthetic judgments made repeatedly 
during the postexposure period accel- 
erate the extinction of the exposure 
induced change. 


EXPERIMENT II 


The aim of the second experiment 
was to examine the effects of kines- 
thetic spatial judgments immediately 
following adaptation of the kinesthetic 
svstem through protracted stimulation. 
` Long-lasting effects from prolonged 
stimulation have been frequently re- 
ported (Gibson, 1933; Heinemann, 
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splinted right arm 


hand plate 


Fic. 2. Apparatus used in the experiment 
showing the method of stimulation with an 
inset at the top showing the various controls. 


1961; Wertheimer & Leventhal, 1958). 
The view adopted here is that these 
prolonged effects are due to a judg- 
mental response being learned during 
the period in which the sensory system 
is transformed through protracted stim- 
ulation. In this sense the outcome of 
transformed input and that transformed 
by devices such as prisms are essentially 
similar. Under both conditions the 
spatial relationships are changed and 
new responses learned. In this experi- 
ment, therefore, it was predicted that 
an induced kinesthetic SAE would re- 
sult in PR if a response was made im- 
mediately after induction but would not 
if made after the dissipation of the 
SAE. 
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Method 


Apparatus—The apparatus which is 
shown in Fig. 2 (Collins, 1967a) was de- 
signed to move the extended arm and hand 
laterally across a slanted surface. The 
movement was achieved by a motor-driven 
carriage to which the splinted arm and hand 
was fixed during the stimulation period. 
Throughout, the stimulation angle was 15°, 
and the carriage moved from side to side 
at 100 excursions/min for 90 sec, Before 
and after this period of stimulation, S ad- 
justed a 14X 3-in. bar to apparent hori- 
zontality. 

Subjects.—There were two groups of 
16 Ss, each group comprising about equal 
numbers of men and women. All were re- 
cruited from an introductory course in psy- 
chology and were randomly assigned to 
groups on reporting. 

Procedure—Each S wore opaque goggles 
throughout and was given five practice trials 
on entering the laboratory to familiarize 
him with the apparatus and to ensure that 
instructions were understood. Both groups 
undertook four pretests followed by 90-sec. 
stimulation. Following stimulation, Group 
1 rested for 180 sec. and then undertook 
the four pretests of the next trial and so on 
for four trials. Group 2 undertook one 
posttest immediately after stimulation, rested 
for 180 sec. and then undertook the four 
pretests of the following trial and so on for 
four trials, Thus, while Group 1 made no 
judgment of horizontality during the period. 
in which the kinesthetic system of the arm 
was adapted, Group 2 did. 

All Ss were told to adjust the bar as 
quickly as possible to the horizontal but not 
to sacrifice accuracy for speed. Before pre- 


TABLE 2 


MEANS AND SDs, IN DEGREES, FROM THE 
True HORIZONTAL or Pretest ADJUST- 
MENTS TO THE HORIZONTAL OVER FOUR 
TRIALS FOR Groups 1 AND 2 


Trials 
1 2 3 4 

Group 1 

M. 0.39 | 0.95 | 148 | 123 

SD 1.87 | 2.03 | 2.29 | 2.97 
Group 2 

M : —0.67 | 0.92 | 1.33 | 1.55 

SD 3.15 | 3.70 | 3.89 | 3.26 


and posttest the bar was set at the true 
ntal by E. An adjustable splint re- 


only the shoulder joint could be used 
the adjustment (Collins, 1967a). 
or half the Ss in each group the 15° 
wulation angle was in one direction (up 


i the other direction. Half the Ss used 
preferred hand and the other half the 


"The means and standard deviations 
"the pretest judgment are shown in 
ble 2 for both groups. 

"rend analysis (Edwards, 1960) on 
of the groups showed that there 
no significant trend in the trial 
eans for Group 1, F (3, 45) = 1.09, 
05, but that the trend over trials 
r Group 2 was significant, F (3, 45) 
8.61, p < .01. 


EXPERIMENT III 


The aim of the third experiment 
to investigate further the sensory 
d learned aspects of prolonged stim- 
ation, especially the extinction of the 
ned effect. Three groups received 
al amounts of sensory stimulation 
(90 sec.). For Group 1 there was no 
CO posttest and it was predicted 
the sensory effect would dissipate 
r 180 sec. In the two remaining 
groups the learned effect was induced 
‘by an immediate posttest. Group 2 
"received multiple posttests in the ex- 
ectation that faster extinction of the 
armed effect would occur compared 
th Group 3, which only received one 
ditional posttest after 3-min. rest. 
the prediction for the extinction of the 
earned effect in the latter two groups 
derived from Exp. I. 
_To demonstrate the wider generality 
these phenomena, a different type of 
esthetic aftereffect (Collins, 1967b; 
turner, 1961) was used. 
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Fic. 3. Apparatus used to produce after- 
effects in the pronation of the forearm. 


Method 


Apparatus.—The apparatus shown in Fig. 
3 (Collins, 1967b) was designed to allow 
different angular displacement of the 
shoulder joint without movement. It con- 
sisted of a handgrip attached to a metal 
shaft which protruded through a wooden 
screen. To the rear of the shaft was 
attached a sharpened metal pointer which 
enabled readings to be taken on a 360° 
protractor scale to the nearest 5°. A screw 
clamp enabled the handgrip to be fixed in 
any position. 

Subjects—There were three groups of 
21 Ss with about equal numbers of men and 
women in each. All were recruited from an 
introductory course in psychology and were 
alternately assigned to groups on reporting, 

Procedure-—Before commencing each S 
was blindfolded by tight-fitting goggles. A 
special splint, which kept S's arm straight, 
but which did not restrict the pronation or 
supination of the forearm, was fitted. Five 
practice trials, setting the handgrip in the 
horizontal plane by pronation of the fore- 
arm, were given to familiarize S with the 
apparatus, All three groups were given 
four pretest adjustments followed by a stim- 
ulation period of 90 sec. while the grip was 
set at an angle tilted 20° from the hori- 
zontal and gripped firmly by S. 
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Subsequent to stimulation Group 1 rested 
for 180 sec. with arms hanging loosely by 
the side. One posttest was given at the 
end of this period. Groups 2 and 3 made 
an immediate posttest adjustment to the 
horizontal Following this, Group 2 made 
further judgments at 60, 120, and 180 sec. 
after stimulation whereas Group 3 rested 
for 180 sec. before making a single further 
judgment. 

The difference between the mean of the 
pretests and each posttest was the measure 
of the aftereffect. The grip was set to the 
true horizontal before the commencement of 
each adjustment. Half the Ss were given 
stimulation with the grip tilted up on the 
right and the other half with it up on the 
left. Half the Ss used their preferred arm 
while the other half used their nonpreferred 
arm. 


Results 


The means and standard deviations 
of the measure of the aftereffects are 
shown in Table 3. 

Since the present study was primarily 
concerned with the dissipation of the 
sensory effect and the experimental ex- 
tinction of the learned effect no elabo- 
rate statistical treatments of the results 
are necessary. It is sufficient to estab- 


TABLE 3 


MEANS AND SDs, IN DEGREES, OF THE MEA- 
SURE OF THE AFTEREFFECT IN THE CONTROL 
AND EXPERIMENTAL GROUPS WITH 
VARYING NUMBERS or Post- 

TESTS AND CONDITIONS 


or DELAY 
Magnitude of Aftereffect 
Imme- | 60 Sec. | 120 Sec. | 180 Sec. 
Group 1 
M x a ztn 0.08* 
SD = = € 3.50 
Group 2 
M 3.84 1.39 | 1.63** | 1.03* 
SD 3.01 2.57 | 3.24 3.90 
Group 3 
M 4.26 — = $:71** 
SD 2.66 = = 3.35 


Note.—The significance of the difference (t tests) of 
each E from zero is indicated. 
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lish the significance of the differences 
of each group from zero in order to 
examine the above effects. An analysis 
of these data showed that the mean for 
Group 3 was significantly different 
from zero after 180 sec., t = 228, p < 
.05, whereas the means of the other two 
groups were not. 


Discussion 


The data from Exp. I can be interpreted 
in terms of a learning process which oc- 
curs during a period in which the usual 
spatial relationship between vision and 
kinesthesis is modified by prismatic trans- 
formation of visual input. The individual 
is trained to make kinesthetic responses 
during a period of exposure to a visually 
transformed field. It matters little for 
present purposes that the kinesthetic ad- 
justment to horizontality during exposure 
is grossly inaccurate in objective terms. 
Vision is clearly dominant under these 
conditions (Rock & Harris, 1967). The 
S adjusts the bar so that it appears vis- 
ually horizontal while moving his hand 
across the bar. This kinesthetic judg- 
ment of horizontality is reinforced by 
visual information. If the exposure pe- 
riod is regarded as a period of training 
during which new kinesthetically con- 
trolled responses are learned (behavioral 
compensation), then the postexposure 
phase can be regarded as an extinction 
period, In this period it is to be expected 
that newly learned responses in the ab- 
sence of visual reinforcement would give 
way to earlier learned and more firmly 
established responses. 

The data from Exp. II show that when 
a sensory modality undergoes prolonged 
stimulation, aftereffects will dissipate 
over a period of 180 sec. provided S is 
not required to make use of the trans- 
formed sensory system in a judgmental 
task. If, however, judgments are made 
during the period of modification of the 
sensory system, the subsequent modifica- 
tion of the judgments will outlast the 
sensory aftereffect. The contention that 
there is a distinction between the purely 
sensory aspects of stimulation and the 
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learned aspects which outlast them is 
therefore confirmed. 

In Exp. III, the results from Group 1 
show the complete dissipation of the 
sensory aftereffect after 180 sec. during 
which S makes no judgments of position 
as in Exp. II. In the other two groups, 
behavioral compensation is induced by the 
immediate posttest and for Group 3 this 
PR of behavioral compensation is still 
significantly different from zero after 180 
sec. For Group 2, PR seems to be re- 
duced through intermittent judgmental 
activity as would be expected from the 
results of Exp. I. PR is significantly 
different from zero after 120 sec., but not 
after 180 sec. 

The experiments thus demonstrate that 
kinesthetic spatial judgments made during 
a period of transformation will affect sub- 
sequent judgments. The transformation 
can be achieved through changing the 
visual input with an optical system (Exp. 
I), or through prolonged stimulation of 
the kinesthetic system (Exp. II and III). 
Both these phenomena have been studied 
separately and extensively for many years. 
The present study indicates that the ef- 
fects are similar, whether transformations 
are produced by prolonged stimulation or 
by input via another sensory system. It 
has been shown that the conditions of ex- 
tinction of PR are different from dissipa- 
tion of the SAE. The generality of these 
conclusions is evidenced by the use of two 
different types of kinesthetic tasks in 
Exp. II and III. 

The functional similarity of effects ob- 
tained under two different conditions of 
transformed sensory input strongly sup- 
ports the hypothesis that the effect ob- 
tained in Exp. II and shown again in 
Exp. III is an aftereffect of behavioral 
compensation, ie. PR. It also explains 
the permanent and semipermanent effects 
which have been reported by Gibson 
(1933), Heinemann (1961), Wertheimer 
and Leventhal (1958), and other in- 
vestigators. These findings have meth- 
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odological implications for the study of 
sensory aftereffects of all types. They 
also extend the generality of behavioral 
compensations and strengthen the view 
that behavioral compensation is an in- 
stance of a learning process. 


REFERENCES 


Coruiws, J. K. Freedom and restriction of 
the limb in the kinesthetic spatial after- 
effect, Aust. J. Psychol., 1967, 19, 63- 
67. (a) 

Corus, J. K. A kinesthetic spatial after- 
effect with pronation of the forearm, Aust. 
J. Psychol, 1967, 19, 117-120. (b) 

Day, R. H., & Srncer, G. Sensory adapta- 
tion and behavioral compensation with 
spatially transformed vision and hearing. 
Psychol. Bull., 1967, 67, 307-322. 

Epwarps, A. L. Experimental design in 
psychological research. New York: Holt, 
Reinhart & Winston, 1960. 

Geson, J. J. Adaptation, aftereffect and 
contrast in the perception of curved lines. 
J. exp. Psychol., 1933, 16, 1-31. 

Hamon, C. R. Intermanual transfer of 
adaptation to prisms. Amer. J. Psychol., 
1964, 77, 457-462. 

Hernemann, E. G. Figural aftereffects in 
kinesthesis: Effects of object width and 
repeated presentations. J. exp. Psychol., 
1961, 61, 51-66. 

Rock, L, & Hamus, C. S. Vision and 
touch. Scient. Amer., 1967, 216, 96-104. 

Sincer, G., & Dav, R. H. Temporal deter- 
minants of a kinesthetic aftereffect. J. 
exp. Psychol., 1965, 69, 343-348. 

Sıncer, G, & Day, R. H. The effects of 
spatial judgments on the perceptual after- 
effect resulting from prismatically trans- 
formed vision. Aust. J. Psychol, 1966, 
18, 63-70. 

TuurNeR, F. K. Kurzzeitige Nacheffekte 
unserer Wahrnehmungen Eine Allgemein- 
und Differentialpsychologische Unter- 
suchung. Beih. Schweiz. Z. Psychol. 
Amwend, 1961, 41, 1-104. 

H., & LEVENTHAL, CERN, 


WERTHEIMER, ) 
Permanent satiation phenomena with 
kinesthetic figural aftereffects. J. exp. 


Psychol., 1958, 55, 255-257. 
(Received June 10, 1967) 


Journal of Experimental Psychology 
1968, Vol. 77, No. 2, 308-313 £ y 


EFFECTS OF SENSORY ADAPTATION ON THE FORM OF THE i 
PSYCHOPHYSICAL MAGNITUDE FUNCTION ; | 
FOR CUTANEOUS VIBRATION? | 


GEORGE A. GESCHEIDER 
Hamilton College 


JOHN H. WRIGHT 
Virginia Polytechnic Institute 


AND 


The form of the psychophysical magnitude function for 60-cps cutane- 
ous vibration was found to be influenced by the state of adaptation 
of the stimulated skin area. Through the use of a combination of 
intensity-matching and magnitude-estimation procedures the relation- 
Ship between sensation magnitude and vibration amplitude was 
examined following 5, 60, 120, 180, and 360 sec. of recovery from 10 
min. of continuous and intense vibrotactile stimulation of the fingertip. 
For each recovery period sensation magnitude on the fingertip was 
found to increase as a power function of vibration amplitude according 
to the general formula Y = K ($ — qo)", where V is sensation magni- 
tude, K is a constant, ¢ is stimulus amplitude, go is the absolute 
threshold, and » is the power exponent. Recovery from adaptation 
had the effect of increasing the value of K and reducing the values 
of $» and m. The results provided further support for the general 
applicability of Stevens' revised power law. 


Stevens (1957) has proposed that 
the form of the relationship between 
sensation magnitude and stimulus in- 
tensity is the power function y = Kọ" 
where y is sensation magnitude, $ is 
stimulus intensity, K is an arbitrary 
constant determining the scale unit, and 
n is the power exponent. The power 
function has the convenient feature of 
becoming a linear function with a slope 
equal to the value of the power expo- 
nent when a logarithmic transformation 
is performed (ie, log y= log K+ 
n log $). Thus, the exponent of the 
power law for a particular set of ex- 
perimental results can be found by plot- 
ting the logarithm of the psychological 
scale values against the logarithm of 
the corresponding stimulus values and 
measuring the slope of the straight line 
fitted to the points. 

In most psychophysical experiments 
y = Kẹ" has been found to be an accu- 
rate statement of the relationship be- 

1 This investigation was supported by Pub- 
lic Health Service Research Grant No. NB- 


07620 from the National Institute of Neuro- 
logical Diseases and Blindness. 


tween sensory magnitude and stimulus 
intensity. For weak stimuli near abso- 
lute threshold, however, this equation 
becomes highly inaccurate. When log 
apparent magnitude is plotted against 
log stimulus intensity the relationship 
is linear only at the higher stimulus 
values. At stimulus values near abso- 
lute threshold the relationship becomes 
concave downward. On the other 
hand, the power law has been found to 
hold for the entire range of perceptible 
stimuli when stimulus values are speci- 
fied in units above the absolute thresh- 
old instead of in units above the zero 
point on the scale of physical intensity. 
The general form of the power law thus 
becomes y = K(¢ — ġo)”, where ġo is 
the value of stimulus intensity at abso- 
lute threshold (see Stevens, 1961). 
This revised statement of the power 
law has also been found to be applicable 
to situations in which the absolute 
threshold is temporarily raised by sen- 
sory adaptation. Stevens and Stevens 
(1963) found that each member of a 
family of psychophysical magnitude 
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functions for brightness, obtained fol- 
lowing adaptation of the eye to a stim- 
ulus of a particular intensity, was linear 
on log-log coordinates only with stim- 
ulus values specified in units above 
absolute threshold. McBurney (1966) 
has recently reported similar results for 
the taste of salt following adaptation 
of the tongue to a solution of sodium 
chloride. 

The present experiment, by exam- 
ining the effects of adaptation on the 
form of the psychophysical magnitude 
function for another sense modality, 
constituted a further test of the general 
validity of the revised power law. The 
relationship between the sensation mag- 
nitude of 60-cps cutaneous vibration 
applied to the index fingertip and vibra- 
tion amplitude was examined after 
varied periods of recovery from 10 min. 
of adaptation to an intense 60-cps vi- 
bratory stimulus. Stevens (1959) has 
shown that the subjective impression of 
60-cps vibration is a power function of 
vibration amplitude. Hahn (1966), 
however, has reported that the absolute 
threshold for 60-cps vibration of the 
fingertip may be raised as much as 16 
db. by 25 min. of adaptation to a 34-db. 
SL stimulus of the same frequency. 
Consequently, it was expected that the 
elevated absolute thresholds, present 
immediately following adaptation and 
following short periods of recovery 
from adaptation, would result in psy- 
chophysical magnitude functions devi- 
ating greatly from the power law at 
low stimulus amplitudes. The present 
experiment sought to determine 
whether such deviations from the 
power law could be eliminated by 
specifying vibration intensity in terms 
of amplitude above absolute threshold. 


METHOD 
Subjects 


Four Hamilton College students, trained 
for several sessions prior to the beginning 
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of the experiment in detecting weak stimuli, 
in equating the sensation magnitude of stim- 
uli on the left and right index fingertips, and 
in magnitude estimating, served as Ss. 


Apparatus 


In an audiometric testing room (Indus- 
trial Acoustics Company) 1-sec. presenta- 
tions of 60-cps vibration were delivered to 
S’s right and left index fingertips through 
a pair of vibrators (Goodmans V-47). 
Pressure of the 4-in, diameter vibrator con- 
tactor on the skin was maintained at a con- 
stant value of 30 gm. by counterweighting 
the shaft on which the vibrator was mounted. 
With his hands resting on his knees S ex- 
tended his index fingertips to make contact 
with the vibrators. The electrical signals 
were generated by a beat-frequency audio os- 
cillator (General Radio), the output of which 
was applied to an electronic switch (Grason- 
Stadler) which permitted the turning of 
stimuli on and off with a gradual 25-msec. 
rise and decay time to eliminate perceptible 
switching transients. The output of the 
electronic switch became the input of a 
phase-shifter (Grason-Stadler) set at 0° 
phase difference. One of the outputs of the 
phase shifter was applied to the left vibrator 
and the other was applied to the right vi- 
brator. Each of the two outputs of the 
phase shifter was also applied to a decade 
attenuator box to permit regulation of 
stimulus intensity in 1-db. steps, The 
attenuator box was followed by an imped- 
ance-matching transformer and, lastly, a 
vibrator. To permit continuous variation 
of stimulus intensity in the left channel a 
T-pad was inserted in the circuit imme- 
diately before the left vibrator. 

In order to determine $’s absolute 
threshold a graphic recording attenuator 
(Grason-Stadler) was inserted after the 
attenuator box in either the left or right 
channel. The S tracked his threshold by 
pressing and releasing a footswitch to reg- 
ulate stimulus intensity. As long as the 
switch remained depressed intensity level 
decreased continuously at the rate of 1 db/ 
sec, When the switch was released inten- 
sity level increased at the same rate. The 
continuous recording of S's intensity ma- 
nipulations by the graphic recording attenua- 
tor yielded precise estimates of each S’s 
absolute threshold for vibrotactile stimula- 
tion on the fingertip. 
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Procedure 


Stage l.—Specified in microns peak-to- 
peak displacement of the vibrator contactor, 
the absolute threshold for vibrotactile stim- 
ulation on the fingertip was measured for 
each S as a function of recovery time fol- 
lowing 10 min. of adaptation to 60-cps vi- 
bration of 465 microns peak-to-peak ampli- 
tude. With both his left and right index 
fingertips in contact with the vibrators S; 
by manipulating the footswitch, tracked his 
threshold continuously during the 5-min. 
period immediately following the termina- 
tion of the adapting stimulus. He was in- 
structed to depress the footswitch until the 
vibration on his fingertip could no longer be 
felt and at that point to release the foot- 
switch until the stimuli became barely per- 
ceptible, at which time he was to depress 
the footswitch again until the stimulus dis- 
appeared. To prevent the occurrence of 
additional adaptation during the 5-min. 
threshold-tracking period the stimuli were 
presented during this period as a train of 
1-sec. periods of vibration interspersed with 
lesec. periods of no vibration. Recovery 
from 10 min. of adaptation of the right index 
fingertip was measured separately for the 
right and left index fingertips. To evaluate 
(a) the possible effects of the static pres- 
sure of the vibrators during the adaptation 
period and (b) S's adjustment to the thresh- 
old-tracking task during the recovery 
period, thresholds were also measured on 
the left index fingertip following a 10-min. 
period in which § sat with both fingers in 
contact with inactive vibrators. Fach of 
these three conditions was employed once 
in random order within each of six sessions. 
A smooth curve was fitted to the recovery 
curve formed by the kymographically re- 
corded changes in each S’s threshold inten- 
sity manipulations as a function of time 
since the end of adaptation. 

Stage 2—The method of magnitude esti- 
mation was used to determine the relation- 
ship between sensation magnitude and 60-cps 
vibration amplitude on the left index finger- 
tip. By applying a .0241-v. signal across 
the vibrator (about 25-db. SL) for 1 sec, 
a standard stimulus having an arbitrarily 
designated sensation magnitude of 10 was 
presented to S' on each trial Tt was S’s 
task to assign a number to the sensation 
magnitude of a comparison stimulus, which 
on half of the trials preceded and on half 
of the trials followed the standard stimulus. 
He was instructed to assign a number to 
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Fic. 1. Absolute threshold for vibration 
of the right index fingertip (filled circles) 
and for vibration on the left index finger- 
tip (open circles) as a function of recovery 
time following 10 min. of intense stimula- 
tion of the right index fingertip. (The 
filled-squares curve shows the absolute 
threshold on the left index fingertip mea- 
sured after a 10-min. control period in which 
S sat with both fingertips in contact with 
inactive vibrators.) 


the comparison stimulus reflecting the ratio 
between the sensation magnitudes of the 
comparison and standard stimuli, Each S 
made 12 such magnitude estimations of the 
eight values of the comparison stimulus pro- 
duced by applying voltages of .0020, .0038, 
-0068, .0124, .0241, .0400, .0730, and .1300 to 
the vibrator. A psychophysical magnitude 
function relating the apparent magnitude of a 
vibration to voltage at the vibrator was 
plotted for each of the four Ss. T 

Stage 3—The voltage across the left vi- 
brator required for S to report equal ap- 
parent magnitudes of vibration on the left 
and right index fingertips was measured 5, 
60, 120, 180, and 360 sec. following 10 min. 
of adaptation of the right index fingertip to 
60-cps vibration of 465 microns peak-to-peak 
amplitude. At each of the five recovery 
times following the 10-min. adaptation period 
a l-sec. vibratory stimulus of a particular 
intensity was presented to S’s right index 
fingertip. Each of the five presentations of 
this stimulus to the previously adapted right 
index fingertip was followed by continuous 
low amplitude vibration on the left finger- 
tip which E gradually increased by changing 
the setting of the T-pad. The rate of ampli- 
tude increase and the starting amplitude 


were varied from trial to trial The S 
id "stop" when the sensation on his left 
lex fingertip was as intense as the sen- 
sation on his right index fingertip had been. 
The voltage across the left vibrator was then 
measured by a V.T.V.M. (Hewlett-Pack- 
d 400D). Following a 10-min. adapta- 
m period the same right-vibrator ampli- 
‘tude (either 5.8, 8.3, 11.7, 147, 18.5, 262, 
—46,5, 827, or 147.0 microns) was employed 
at each of the five recovery times. The 
‘order in which these nine right-vibrator 
amplitudes were employed over 45 adapta- 
‘tion-recovery periods in 15 sessions was ran- 
‘domly determined for each S. The sensory 
“magnitudes on the right index fingertip pro- 
duced by each of the nine vibration ampli- 
‘tudes at each of the five adaptation-recovery 
‘times were computed separately for each S 
_ by converting the left-vibrator voltages re- 
quired to achieve sensation matches on the 
left and right index fingertips into the sen- 
sation units of the psychophysical magni- 
tude functions for the left index fingertip 
‘obtained in Stage 2. 


RESULTS AND DISCUSSION 


Effects of Adaptation upon Vibrotactile 
Thresholds 


Tt is evident from Fig. 1 that dis- 
placement amplitude of the right 
vibrator at absolute threshold was a 
negatively accelerated decreasing func- 
tion of recovery time following adapta- 
tion of the right index fingertip. The 
absolute threshold on the left index 
fingertip following adaptation of the 
right index fingertip showed little 
change over the recovery period. Fur- 
thermore, absolute thresholds on the 
left index fingertip following adaptation 
of the right index fingertip were only 
slightly higher than thresholds on the 
left index fingertip measured after a 
- l0-min. control period in which S sat 
- With both index fingertips in contact 
with inactive vibrators. The absence 
Of contralateral adaptation effects for 
Vibration of the fingertips contrasts 
Sharply with the small but significant 
= amounts of contralateral masking on 
the fingertips reported by Sherrick 
(1964). Sherrick’s findings suggest 
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that the locus of cutaneous masking is 
within both the peripheral and central 
nervous systems. It was therefore sur- 
prising not to find evidence for central 
neural interaction between contralateral 
fingertips in the present experiment. 
Perhaps contralateral masking of the 
fingertips is not masking at all in the 
usual sense of the word but reflects in- 
stead an inability to attend to a weak 
cutaneous stimulus presented to the in- 
dex fingertip while a more intense stim- 
ulus is presented concurrently to the 
opposite index fingertip. Therefore, 
with extensive training the amount of 
contralateral cutaneous masking might 
diminish to zero. 


Effects of Adaptation upon Psycho- 
physical Magnitude Functions for Cu- 
taneous Vibration 


The relationship between sensation 
magnitude and voltage across the left 
vibrator is shown in Fig. 2 for each of 
the four Ss. Each of the functions is 
linear on logarithmic axes for all but 
the weakest stimulus intensities. The 
exponents of the power functions esti- 
mated from the slopes of the functions 
presented in Fig. 2 vary between .60 
and .76, which is somewhat smaller 
than the exponent of .95 reported by 
Stevens (1959) for 60-cps vibration of 
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Fic. 2. Sensation magnitude as a func- 
tion of voltage across the left vibrator for 
each of the four Ss. 
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Fic. 3. Sensation magnitude as a func- 
tion of vibrator amplitude on the right in- 
dex fingertip 5, 60, 120, 180, and 360 sec. 
after termination of the adapting stimulus. 


the fingertip. As is typical of psycho- 
physical magnitude functions obtained 
by magnitude estimation and fraction- 
ation techniques at stimulus values near 
absolute threshold, each of the functions 
deviates from a power function by be- 
coming concave downward. 

The effect of adaptation upon the 
form of the psychophysical magnitude 
function for cutaneous vibration is seen 
in Fig. 3. The family of curves pre- 
sented in Fig. 3 represents the relation- 
ship between sensation magnitude and 
the amplitude of vibration applied to 
the right index fingertip at 5, 60, 120, 
180, and 360 sec. following termination 
of the adapting stimulus. Each data 
point is the arithmetic mean of 20 
sensation magnitudes, 5 for each of the 
four Ss, derived by converting into 
sensation units the voltages of the left 
vibrator required to match the subjec- 


TABLE 1 


EFFECTS OF RECOVERY FROM ADAPTATION ON 
PARAMETERS OF THE PSYCHOPHYSICAL 
MAGNITUDE FUNCTION FOR 
CUTANEOUS VIBRATION 


Recovery Time K » go 
5 sec .52 64 6.00 
1 min -16 60 3.77 
2 min. 1.05 58 2.70 
3 min 1.30 56 2.27 
6 min. 1.60 54 2.00 
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tive intensity of left-vibrator stimulation 
to that of right-vibrator stimulation, 
The shapes of the sensation mag- 
nitude functions for each S were es- 
sentially the same and variability be- 
tween Ss as represented by the standard 
deviation for the means of the four Ss 
was typically only about 10% of the 
grand mean. It is readily apparent 
that each of the magnitude functions in 
Fig. 3 deviates from linearity by be- 
coming concave downward for the 
range of stimulus values near the abso- 
lute threshold temporarily raised by 
adaptation. 

In their attempt to apply the revised 
power equation y= K($ — $$)" to 
psychophysical magnitude functions for 
brightness, Stevens and Stevens (1963) 
found that the values of K, o, and n 
were affected by the conditions of light 
adaptation of the eye produced by 
varying the intensity of the adapting 
stimulus and the length of time in dark 
following light adaptation. As seen in 
Table 1 these parameters of the psycho- 
physical magnitude function for vibra- 
tion of the fingertip were affected in a 
similar fashion by adaptation. The de- 
creases in the value of ¢, over the 5- 
min. recovery period in Fig. 1 are pre- 
sented again in Table 1 to facilitate 
comparison with the changes in the 
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Fic. 4. Sensation magnitude as a function 

of vibration amplitude on the right index 

fingertip specified in microns above absolute 
threshold 5, 60, 120, 180, and 360 sec. after 
termination of the adapting stimulus. 
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values of n and K as functions of re- 
covery time following adaptation. As 
was the case with vision, (a) both $, 
and » decreased with increases in re- 
covery time following adaptation, and 
(b) the magnitude of this decrease was 
considerably greater for $, than for m. 
In addition, recovery from adaptation 
produced rather large increases in the 
value of K, as was also the case for 
scales of brightness. 

To test the applicability of Stevens" 
revised power law y = K($ — $y)” to 
psychophysical magnitude functions for 
cutaneous vibration of the fingertip fol- 
lowing recovery from adaptation, sen- 
sation magnitude was plotted as a func- 
tion of vibration amplitude in microns 
above absolute threshold (i.e., $ — $o 
rather than in microns above the zero 
point on the scale of physical intensity 
as in Fig. 3). The appropriate value 
of d, for each adaptation-recovery time 
(Fig. 1 and Table 1) was subtracted 
from the corresponding values of $ for 
that particular adaptation-recovery 
time. It is clearly apparent from exam- 
ination of Fig. 4 that this transforma- 
tion of the stimulus scale produced a 
linear growth, on log-log coordinates, 
in sensation magnitude as a function of 
vibration amplitude over the entire 
range of stimulus intensities. Further- 
more, this transformation successfully 
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converted psychophysical magnitude 
functions obtained at the peak of adap- 
tation (5-sec. recovery time) as well 
as those obtained after complete re- 
covery from adaptation (6-min. re- 
covery time) into power functions. 
These findings clearly extend the gen- 
erality of the revised power law for 
describing the relationship between 
physical properties of the stimulus and 
psychological magnitude to the sensory 
modality of cutaneous vibration, 
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CHILDREN'S MOTOR AND VERBAL RESPONDING 
IN A TWO-CHOICE SITUATION? 
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64 preschool children were run in a 2-choice prediction situation, with 
probability of the more frequent event — .6, and divided into 4 groups 
on the basis of grade (nursery or kindergarten) and response mode 
(motor or verbal). Each S guessed, either by stating verbally "red" 
or "green" or by placing a card in a slot, which light would come 
on next, each time a buzzer sounded, for 200 trials. All groups made 
about the same proportion of predictions of the more frequent event. 
The number of response alternations was significantly greater for 
kindergarten Ss. The mode of response (motor vs. verbal) did not 
significantly affect response alternation. 


Two experiments reported by Bo- 
gartz (1965) evaluated sequential de- 
pendencies in children's probability 
learning. In both studies, his 4- and 
S-yr-old Ss were required to predict 
the next event on each of more than 
100 trials. The major difference in 
results was that Exp. I yielded con- 
siderable response alternation while 
Exp. II did not. Bogartz recognized 
that other considerations complicated 
the interpretation (i.e., difference in 
reinforcement procedures, and partici- 
pation of most of Exp. II Ss in Exp. 
I), but suggested that the difference 
in response mode produced the differ- 
ence in alternation tendency. In the 
first study the predictive response in- 
volved pushing a lever; in the second, 
the response was a verbal guess of 
marble color. 

Other researchers have obtained al- 
ternation with both verbal responses 


1This research was supported by funds 
from National Institutes of Health Grant 
MH-03803-06. (The study represents the 
Senior Honors project of the first author, 
conducted under the direction of the second 
author. The authors wish to thank Jerome 
L. Myers for his comments on the manu- 
script, and the Research Computing Center, 
University of Massachusetts, for assistance 
in analyzing data.) 


(Kessen & Kessen, 1961; Offenbach, 
1965) and motor responses (Craig & 
Myers, 1963; Derks &  Paclisanu, 
1967), using kindergarten Ss. In 
addition, the Kessens and Derks and 
Paclisanu have observed little alter- 
nation, in fact strong perseverative 
tendencies, among 3- and 4-yr.-old Ss. 

Because of the equivocal data cur- 
rently available on motor and verbal 
prediction, and because of the perti- 
nence of response mode differences to 
the generality of theorizing about chil- 
dren's sequential decision behavior, the 
present experiment was designed to 
directly compare the effects of both 
motor and verbal responding on choice 
strategies of nursery and kindergarten 
children. 


METHOD 


Sixty-four Ss attending nursery school 
(mean age = 4 yr., 2 mo.) and kindergarten 
(mean age=5 yr, 8 mo.) in Amherst, 
Massachusetts ? were tested individually and 
predicted which of two lights (one red, one 
green) would come on next, when a buzzer 

?'The authors wish to thank Margaret 
Fernandes of the University of Massa- 
chusetts Nursery School, and Claire Drew 
of the First Congregational Church Kinder- 
garten, Amherst, Massachusetts, for their 
cooperation in providing facilities and sub- 
jects for the study. 
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CHILDREN'S RESPONSE ALTERNATION 


sounded on each of 200 trials. They were 
assigned to four groups of 16 Ss each, half 
male, half female, differing with respect to 
the type of response required for prediction, 
and grade level. The Motor groups were 
given a set of 200 blank cards on the table 
before them, and handed a few at a time 
to keep in their hands. They were asked 
to place a card in the slot below the light 
they predicted would go on mext. The 
Verbal groups were instructed to make their 
predictions by saying out loud "red" or 
"green" each time the buzzer sounded. All 
Ss were told the light sequence was pre- 
arranged. Test sessions consisted of 200 
trials with one light occurring on 60% of 
the trials (*=.6), and the other light on 
40% of the trials, according to a standard 
sequence generated by Nicks’ (1959) sys- 
tem of randomizing runs throughout the 
series. For half of the Ss, the more frequent 
event was red; for the other half, it was 
green. The position of the colored lights 
was also counterbalanced. 


Resutts AND DISCUSSION 


Marginals and some sequential de- 
‘pendencies.—The percentage of A, re- 
sponses (predictions of the more fre- 
quent event, E,) is about the same for 
all four experimental groups, approxi- 
mately 54%, indicating no maximiza- 
tion behavior even for the nursery 
school groups, with a m (probability of 
the more frequent event) of .6. These 
data are consistent with that of Craig 
and Myers’ (1963) .6r kindergarten 
group and Kessen and Kessen’s (1961) 
7m nursery school groups, but differs 
from the optimizing found by Derks 
and Paclisanu’s (1967) .75 nursery 
school group. Apparently, with young 
children a r of at least .75 is necessary 
in order for Ss to clearly differentiate 
the more frequent event. 

The probability of an A, response 
was also plotted as a function of run 
length, i.e., the number of consecutive 
E,’s preceding the prediction. These 
Curves showed a little sensitivity on 
the part of Ss to run length character- 
istics; the Kindergarten-Motor group 
essentially replicated the Craig and 
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TABLE 1 


MEAN PROPORTION OF A; PREDICTIONS 
AFTER Four CLASSES or OUTCOMES 


P(A1) AIE: | Asi | AiEs | AE: 
Nursery Motor 61 | .52 | .56 | .50 
Nursery-Verbal 359.153] 555: 1:260. 


Kindergarten-Motor | .56 | .81 | .30 | .52 
Kindergarten-Verbal | .34 | .74 | .27 | .74 


Myers (1963) .67 kindergarten group, 
and the data were almost identical. 

In addition, the mean proportion of 
A, responses after four classes of out- 
comes (A,E,, A,E,, A,E,, A,E,) were 
computed, and are summarized in Table 
1. Both kindergarten groups made an 
A, response after A, more often than 
after A,, regardless of event occurrence. 
This tendency was not observed in the 
nursery groups. A third finding was 
that the first-order conditionals for the 
Kindergarten-Motor group are much 
more similar to those of the fourth 
graders than to the kindergarten Ss in 
the Craig and Myers (1963) study. 
This may be related to the one pro- 
cedural change: Ss in this study were 
run individually, while in the Craig and 
Myers study they were run in groups of 
four. With less distraction, and self- 
pacing, Ss in this study performed like 
an older group run under more con- 
fusing circumstances. 

Alternations.—No differences ap- 
peared between groups in event alter- 
nations: all Ss predicted an event dif- 
ferent from the last event approxi- 
mately 4896 of the time. 


TABLE 2 


Proportion OF RESPONSE ALTERNATION FOR 
Eacu Brock or 100 TRIALS FOR EACH 
EXPERIMENTAL GROUP 


Blocki | Block2 
Nursery-Motor T AS 
Nursery-Verbal 54 48 
Kindergarten-Motor P. m 


Kindergarten-Verbal 
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Table 2 presents the proportion of 
response alternations for each block of 
100 trials for each experimental group. 
Kindergarten Ss, for whom 68% of all 
responses are alternations, alternate to 
a greater extent than nursery school Ss 
for whom 50% of all responses are 
alternations; the relevant F (1, 56) — 
22.6, p « 001. Since the Nursery 
School group was comprised of both 
3-yr.-olds and 4-yr.-olds, correlations 
between age in months and amount of 
response alternation were computed to 
determine if, within this nursery school 
range, any relation existed between age 
and response alternations. Correla- 
tions of near-zero indicate little differ- 
entiation within the 3- to 4-yr. age 
range. Apparently, the strong alterna- 
tion tendency develops after 44 yr. of 
age. The Blocks effect is significant, 
F (1, 56) = 10.2, p < .01, reflecting 
the consistent decrease in response al- 
ternation during the second 100 trials. 
No significant sex or interaction effects 
were noted, 

The response mode (motor vs. ver- 
bal) did not lead to significant differ- 
ences in response alternation behavior ; 
F (1, 56) = 2.6, p < 20. However, 
it is interesting to note that with both 
kindergarten and nursery groups, the 
verbal task produced more response al- 
ternation than the motor task. This 
finding is in agreement with Offen- 
bach's (1965) data for kindergarten 
children, but is contrary to Bogartz’ 
results. He found much alternation 
with the first, motor task requirement, 
and no alternation with the second, ver- 
bal task. However, 12 of his 16 Ss in 
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the second experiment had participated 
for 210 trials in the first experiment, 
The significant decrease in alternation 
observed by Bogartz is consistent with 
that obtained over time in all four 
groups in this study. A further test of 
the hypothesis that Bogartz’ effect is 
merely due to practice would be to re- 
verse procedures and require Ss to 
make a verbal response on early trials 
and a motor response on later trials, 
In any case, the results of this experi- 
ment strongly suggest that the alter- 
nation tendencies of kindergarten chil- 
dren observed in sequential decision 
situations are a reflection of cognitive 
style or preference rather than specific 
motor habit. 
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An experimental procedure called the observation technique was 
introduced for probability learning (PL). It consisted of presenting 
outcomes to S without a signal and without requiring him to make 
a prediction response. A number of questions concerning this pro- 
cedure were investigated and the results were compared with the 
traditional PL technique. The experimental procedure was to use 
the observation technique as training and to test with 500 trials 
of traditional PL. The rate at which events were presented (2/sec 
vs. 1 every 4 sec.) did not produce any difference in asymptotic 
responding on test trials. 100 observation trials appeared equivalent 
to 100 regular trials. 2,000 observation trials did not significantly 
change the asymptotic responding. A group which did not receive 
events during the test trials did shift asymptotic responding signif- 
icantly. The asymptote decreased from .84 to about the probability 
of the reinforcing event, 81. An analysis of recency curves showed 
no differences in procedures, but sequential statistics did produce 


significant differences among the various groups. 


In traditional two-choice probability 
learning (PL), S is required to predict 
which of two events, E, or E,, will 
occur on each of a series of trials. The 
S generally responds by pressing one 
of two buttons, A, or A,, corresponding 
to the E, and E, events. In noncon- 
tingent PL, the event sequence is deter- 
mined before the experiment is run and 
the events occur independently of S’s 
responses. The probability of the E, 
event on trial n, P (Ern), is indepen- 
dent of n and is denoted as v. The 
basic dependent variables are various 
probability measures on Ss' responses 
on each trial. 

Although the authors' main interest 
lies in presenting a different experi- 
mental approach to PL, the study 

1This paper is based on the PhD dis- 


sertation of Arthur S. Reber. The research, 
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should have theoretical implications. 
Theoretical considerations of PL have 
been largely of two types. In the ma- 
jority of cases, the analysis has been 
according to Statistical Learning The- 
ory (SLT, Estes, 1964), which is basi- 
cally a simple conditioning model. Most 
other approaches can best be character- 
ized as cognitive; eg., Galanter and 
Smith (1958), Goodnow and Pettigrew 
(1955), and Feldman (1963), but 
there is little to tie these approaches 
into a single theory. There are a great 
many experimental studies which indi- 
cate weaknesses in both these ap- 
proaches, as the articles by Estes 
(1964), Anderson (1964), and Rose 
and Vitz (1966) have summarized, 
The experimental paradigm of the 
research reported here was motivated 
by the following conceptualization of 
learning in PL. Two processes are 
assumed: a perceptual learning process 
and a decision process. In perceptual 
learning, S is thought of as developing 
an internal model of the sequence of 
events. The model is assumed to de- 
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velop implicity, i.e., S gains information 
about the structure of the event se- 
quence simply by exposure to it. The 
emphasis in perceptual learning is on 
the event sequence and the manner in 
which it is perceived by S. No assump- 
tions are made about S's explicit pre- 
diction behavior or his hypothesis be- 
havior. The point is that, regardless of 
S's behavioral strategies, certain facts 
about the event sequence are going to 
be learned. 

The decision process is assumed to 
be cognitive in nature. The S is viewed 
as having a decision-making process 
which modifies the response sequence. 
The decision rule is assumed to be 
under the control of instructional and 
payoff variables. Changes in the de- 
cision-making process are assumed to 
affect only the rule for responding and 
not the perceptual model. The authors 
feel that by starting with this two-stage 
model it might be easier to account for 
(a) the conditioning aspects of PL 
which are supported by SLT, and (b) 
the cognitive aspects which seem to ap- 
pear in a good many studies. 

The authors do not call these ideas a 
new “theory” since they are much too 
inexact to meet the strict criteria used 
today in defining a theory. Nor is it 
believed that all the ideas are new. The 
hope is that a few notions about percep- 
tual learning have been emphasized to 
the extent that some qualitative deduc- 
tions about PL can be made and tested. 


METHOD 
Apparatus NES 


Each S sat in front of a standard Hum- 
phreys’ board. On the board were two hori- 
zontally spaced event lights, two correspond- 
ing response buttons, and a centrally lo- 
cated signal light. Three or four Ss were 
run at one time, each in an isolated booth. 
The event lights were counterbalanced 
across Ss but all Ss within a subgroup of 
three or four received the same sequence of 
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events. The E sat at a master control panel 
behind Ss and programmed the event lights 
and recorded Ss’ responses. 


Design 


There were seven groups of Ss. Groups 
500-PL, 500-FO, and 500-SO were run first; 
Groups 100-FO and  2000-FO second; 
Group INST third, and Group NE last. All 
Ss received a training conditio: and a test 
condition. The training condition con- 
stituted the independent variable in every 
case. For all groups except Groups 500-PL 
and INST, the training condition was a 
series of observation trials. For all groups 
except Group NE, the test condition was a 
500-trial block of ordinary PL run at 4 
sec. per trial. 

The series of observation trials used in 
the training conditions was run as follows: 
On each observation trial, S r>ceived only 
the event lights. No signal light occurred 
and S did not predict which light would 
come on. Two presentation rates for the 
event lights were used. One was a control 
rate which matched the rate of event pre- 
sentation under the regular PL procedure, 
ie, one event every 4 sec. The second 
was a very fast rate, two events per second. 
Finally, the INST group received as train- 
ing only a special set of instructions, and the 
500-PL group received regular PL trials. 

The test condition was as follows: A 2- 
sec. red signal cued S to respond. This was 
followed by one of the event lights which 
was on for 2 sec. The Ss were required to 
respond during the 2 sec. that the signal 
light was on. Group NE had a different 
test condition in that the reinforcing event 
light did not occur. 

The groups run and the rationale for 
each were: 

Group 500-PL.—Simple probability learn- 
ing—a control group. Training: 500-PL; 
Test: 500-PL. (N=15) à 

Group 500-FO.—Training: 500 trials of 
FO (fast observation—2/sec). . Test: 
500-PL. (N=16) Perceptual learning i5 
assumed to be a codification of environ- 
mental events which does not require the 
presence of some measurable response. The 
point here is that an S learns a great deal - 
about the sequence of events without neces- 
sarily making an overt (or perhaps even a 
covert) prediction response. In terms of 
the PL paradigm, this suggests that simple 
observation of the event sequence is suf- 
ficient to produce PL. 


EVENT OBSERVATION IN PROBABILITY LEARNING 


Group 500-SO.—Training: 500 trials of 
SO (slow observation—1 every 4 sec.). 
Test: 500-PL. (N=14) The main point 
of this procedure was to provide an exact 
duplicate of the control group except for the 
overt prediction response. 

Group 100-FO.—Training: 100 trials of 
FO: Test: 500-PL. (N —15) The interest 
was in answering the question: Is an obser- 
vation trial equivalent to a prediction trial? 
If they are indeed equivalent, the data from 
the initial test trials should correspond to 
those observed in Group 500-PL at Trial 
101. 

Group 2000-FO.—Training: 2,000 trials 


of FO. Test: 500-PL. (N=14) The 
effect of extensive observations was investi- 
gated. 


Group INST.—Training: Explicit instruc- 
tions about the event probabilities. The Ss 
were told that one event light would come 
on about 800 times in 1,000 trials, while 
the other would occur only about 200 times. 
Test: 500-PL. (N=14) It is important 
to ascertain the degree of similarity between 
PL based ou observation of events and PL 
based on prediction of events. One possi- 
bility, e.g., is that the observation of events 
produces only knowledge of the relative fre- 
quency of the events, knowledge which could 
more easily be imparted to S by instructions 
informing him of their relative. frequencies. 
Group INST provides a check on this pos- 
sibility. 

Group NE.—Training: 500-FO. Test: 
500 trials with signal light but without 
events and presented at the regular rate, dur- 
ing which Ss made prediction responses. 
(N —14) The point here concerns the role 
of continued presentation of the events once 
S has established his internal model for the 
environmental events. How stable is his 
Scheme for the events without continued sup- 
Dort from the event sequence itselí? 

The event sequences were generated using 
the RAND table of random numbers with 
no restrictions on the randomization. All 
event sequences had a P(E)-—7-.8 
There were four sequences for training and 
Íour sequences for testing which were 
counterbalanced across groups, The groups 
did not differ in event sequences except inso- 
far as four Ss were not always run on each 
Sequence in each group. 

_ The instructions to Ss receiving observa- 
tion trials were as follows: 
This is a very simple cxperiment with 

two separate parts. During Part 1, 

the right-and left-hand lights in your 
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booth will light up at regular intervals. 
Your job is simply to sit and observe 
the sequence of lights. During Part 2 
of the experiment, your task will be 
to guess which one of the lights will 
come on before it lights up. If you 
think the right-hand light will come on, 
you will press the right-hand button; 
if you think the left-hand light will 
come on, you will press the left-hand 
button. Now we will run Part 1— 
remember, just observe the sequence of 
lights. 

Before proceeding with Part 2, Ss were 

read the following instructions, which were 


essentially those read at the outset to the 
500-PL control group. 


At the beginning of each trial, the red 
signal light in the middle of the board 
will come on. This is the signal for you 
to make your guess as to which light 
will follow. You will have about 2 sec. 
in which to make your prediction. 
Please respond while the red signal 
light is on. Are there any questions? 


Questions were answered by rereading the 
relevant parts of the instructions. The Ss 
in Group NE were read instructions inform- 
ing them of the absence of the event lights. 


Subjects 


The 102 Ss were distributed within groups 
as indicated in the design section. They 
were selected from an undergraduate intro- 
ductory psychology course which required 
their participation in a psychological experi- 
ment as part of a course requirement. 


RESULTS 
Response Proportions 


Groups 500-PL, 500-SO, and 500- 
FO.—The proportion of A, prediction 
responses per block of 50 trials was 
obtained for each S. Figure 1 presents 
the standard probability-learning curve 
with the 500-FO and 500-SO groups 
beginning at Trial Block 11 after sim- 
ply observing events for 500 trials. An 
analysis of variance on the last 500 
trials involving groups and sequences 
as between-Ss factors, and trial blocks 
as a within-Ss factor is appropriate for 
these data. Group 2000-FO was in- 
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cluded in the analysis for completeness. 
In order to handle the unequal cell 
frequencies, an unweighted means anal- 
ysis was run (Winer, 1962, Pp. 241- 
244). In a number of these analyses, 
many very small Fs were observed, 
which raised some question concerning 
their correctness. Also, the lack of 
error terms meant that sequence differ- 
ences could not be tested and the only 
interaction that could be tested was the 
Trials X Groups interaction. There- 
fore, a Groups X Sequence X Trials 
analysis of variance was run; it yielded 
only a significant trials effect, F (9, 
396) = 2.49, p < .005. To meet as- 
sumptions more exactly, two analyses 
in which one S was randomly discarded 
from each sequence with four Ss were 
also run. Similar results were ob- 
tained. These analyses seem perfectly 
correct. A significant trials effect can 
be attributed only to fluctuations due 
to event sequence changes. There is 
no hint of any consistent increase or 
decrease in these curves. It is also 
important to note that the asymptotes 
reached by these curves are not at a 
level matching r, but instead are .844, 
.841, and .828 for groups 500-PL, 
500-FO, and 500-SO, respectively. 
Groups 100-FO and 2000-FO.—Fig- 
ure 2 presents the learning curves for 
the 100-FO and 2000-FO groups along 
with the 500-PL group for comparison. 
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Fic. 1. Mean Learning curves for the 
500-PL, 500-FO, and 500-SO groups. 
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| 3245 6 r 8 GOH 2 i5 M I5 6 7 V 20 
sotaat BLOCKS 
Fic. 2. Mean Learning curves for the 
500-PL, 100-FO, and2000-FO groups. (The 
2000-FO group is plotted at Block 11 in- 
stead of Block 41.) 


Group 100-FO is plotted from Block 3 
owing to the 100 observation trials; 
Group 2000-FO is plotted from Block 
11 simply for convenience in graphing. 
The analysis of variance discussed 
above included the 500-trial test block 
of the 2000-FO group, and it did not 
differ significantly from the other ob- 
servation groups. Thus, 2,000 trials 
of observation do not seem to appreci- 
ably affect the asymptote. Our interest 
in the 100-FO group was to determine 
whether 100 trials of observations were 
equivalent to 100 trials of traditional 
PL. Therefore, an analysis of vari- 
ance of the same design as the one 
above was run, involving Trials 101— 
500 for both groups. In this case, two 
Fs were significant: trials, F (7, 154) 
—5.23, p<.001, and Trials X Se- 
quences, F (21, 154) = 2.34, p < .005. 
Reanalyses with proportional cell fre- 
quencies insured by randomly elimi- 
nating one S from 4-S subgroups did 
not alter the results. Thus, as can be 
seen from Fig. 2, some learning still 
occurred, but there is no evidence that 
suggests a difference in response pro- 
portions between the 100 trials of ob- 
servation and 100 trials of regular PL. 
The interaction effect with sequences 
is not surprising since sequence effects 
are usually found and are indeed ex- 
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pected according to most theoretical 
interpretations. 

Group INST.—The instruction 
group was introduced to insure that 
observations were effective in a way 
not equivalent to simple knowledge of 
the event probabilities. Figure 3 pre- 
sents the comparison between the 500- 
PL and the INST groups. The in- 
structions about the event probabilities 
had no noticeable effect. This observa- 
tion is confirmed by an analysis of 
yariance in which only the trials effect 
was significant, F (9, 189) = 20.30, p 
<.001. The Trials X Sequences and 
Trials X Sequences X Groups interac- 
tions were marginally significant, but 
the latter did not hold up when re- 
analyzed with proportional cell frequen- 
cles. In Fig. 3, the first block of 50 
trials was divided into two blocks of 
25 trials to indicate the close agree- 
ment between the curves even on early 
trials, Clearly, probability learning in- 
volves more than the learning of prob- 
abilities. 

Group NE.—Finally, the authors 
were interested in the effect of the 
events themselves during the 500 test 
trials. The NE group did not receive 
these events and simply predicted with- 
out feedback. Previous studies have 
indicated that “blank” trials are either 
neutral or produce a tendency for re- 
Sponse proportions to regress to a 5 
level (Greeno, 1962; LaBerge, Greeno, 


: sotaat BLOCKS 
Fic. 3. Mean Learning curves for the 
500-PL and INST groups. 
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Fic. 4. Mean Learning curves for the 
500-FO and NE groups. (The NE group 
received no reinforcing events during these 
500 trials.) 


& Peterson, 1962). Figure 4 shows the 
response proportions for Blocks 11-20 
for Group NE compared with the 500- 
FO group—its comparable control 
group. The first block of 50 trials was 
divided into two blocks of 25 trials to 
show that, initially, the response pro- 
portions are almost identical. Then the 
NE group declines to an asymptote of 
about .81. The difference between the 
groups is significant, t(9) = 4.62, p < 
005. 


A question of some importance is 
whether every S declined or whether 
only some declined and/or whether 
some increased. To check on this, the 
authors computed the variance of the 
frequency of A, responses in each block 
and tested whether the variance in- 
creased significantly. The slope of the 
regression line across trial blocks was 
284, which was not significant, 1(8) = 
87, p=.5. Since the response prob- 
ability declined, a slight increase in var- 
iance is expected. Rough calculations 
indicate this to yield a.slope of .1 in the 
variances. Although this does not ac- 
count for the slope of .28 observed, it 
does argue further against any tendency 
for Ss’ response proportions to spread 
during the change in asymptotic re- 
sponding of the NE group. It seems 
safe to conclude that the overshooting 
ovserved in all groups except the NE 
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group has something to do with the 
presence of the events themselves. A 
possible explanation for this is dis- 
cussed below. 


Response Variances and Distributions 


Although there was no significant 
asymptotic difference among the 500- 
PL, 500-FO, 500-SO, and 2000-FO 
groups, it seemed possible that they 
might differ in variance. By Trial 101, 
Group 100-FO was almost asymptotic 
and so it was also included in a Bart- 
lett’s test for homogeneity of variances. 
For this test the total number of A, 
responses for each S was used. The 
result was not significant, x? (4) = 
5.24, p < .30. 

The frequency of Ss making a total 
number of A, responses within 100 trial 
blocks was tallied. Comparisons be- 
tween groups on the distributions yield 
no obvious discrepancies. Unfortu- 
nately, frequencies were too low to 
justify any statistical tests. All in all, 
it must be concluded that the groups 
do not seem to show pronounced dif- 
ferences in the shape of these distribu- 
tions, 


Runs Analysis 


Table 1 presents the proportion of 
A, and A, responses following runs of 
E, and E, events, respectively. The 
authors have carried the runs of E, 
events to a length of 8 and combined all 
larger runs into the proportion under 
the heading k — 9--. The run propor- 
tions are computed separately for the 
first 500 training: trials and the last 500 
test trials. Because the observed pro- 
portions are based on very different 
frequencies, it is not reasonable to as- 
sume homogeneity of variance for vari- 
ous k values. Therefore, the correct- 
ness of a parametric statistic would be 
questionable. Instead of parametric 
statistics, the authors ran simple sign 
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lies significantly above the 500-PL and 
INST groups for the training trials, but 
is not significantly different on the tes 
trials. The 500-PL and INST training 
trials are significantly different from all 
other groups on the test trials. The p. 
test-trial comparisons between groups 
do not differ, except that the 2000-FQ. 
groups is significantly different from _ 
the 500-FO group. It appears that, 
after 2000 observation trials, there isa — 
slight increase in response proportions - 
following long runs over the propor- 
tions observed with only 500 observa- 
tion trials. i. 

A closer look at the run proportions — 
indicates that over the run lengths 5-9, — 
the E,-run curves are not consistently 
rising; they seem to have stabilized at 
around .85-.90. While this may not be — 
negative recency, it is certainly not. 
positive recency. On the other hand, 
the A, proportions following E, runs - 
rise fairly consistently, indicating posi- — 
tive recency, at least over the neces- 
sarily limited range considered. The 
last points of these curves, when k = 3, 
are based on very low frequencies and - 
are unreliable. From the overall sim- b 
ilarity between the 500-PL test trials — 
and test trials after observation trials, - 
it seems safe to say that simple event 
observation produces the same changes — 
in responses to runs of events as does - 
ordinary prediction probability learn- - 
ing. 


Sequential Statistics 


The authors looked primarily for — 
group differences in the statistic P 
(Ay mal Enas E52; Ermi, Ex); Chi- 
square contingency tables were run for - 
each event sequence. Thus, when con- 
sidering the three groups for which 
there are learning data, Groups 500-PL, - 
INST, and 100-FO, there were 16 x^s 
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to compute each with df — 2. Table 2 
presents the average proportion of A, 
responses following the 16 event se- 
quences, the frequencies upon which 
the proportions are based, and the y’s. 
In general, x? was significant in cases 
with the largest frequencies. The 
power of these tests is very high. For 
example, the largest x? was for the 
Sequence 1111, which had a total fre- 
quency of 9001. 

A similar analysis for the four groups 
at asymptote, 500-PL, 500-FO, 500- 
SO, and 2000-FO, yielded five signifi- 
cant cases. Although no significant dif- 
ferences in average response propor- 
tions or in variances were obtained, the 
groups do differ in sequential statistics. 


Response Patterns for the NE Group 


Beginning with every fifth trial, runs 
of three responses were classified ac- 
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cording to the eight possible patterns: 
A,A,A,,...,A,A,A,. Collapsing over 
the fourth and fifth trials of each five 
trial block was necessary owing to the 
very low frequencies for patterns with 
four and five A, responses. The data 
in Table 3 are presented as divided into 
five blocks of 20 five-trial sets each, 

There is no evidence that the pat- 
terns change over trials. A contin- 
gency table composed of five-trial blocks 
and the eight patterns yielded x? (28) 
= 36.64, p < .20. Individual Ss, how- 
ever, do differ in their response pat- 
terns. Table 4 presents the frequencies 
for the 14 Ss and the five patterns 
with sufficient frequency. The result 
is significant, x? (52) = 247.07, p< 
.001. 

Finally, the authors were interested 
in the shape of the resulting distribution 
of responses, collapsed over Ss and 


TABLE 1 


RESPONSE PROBABILITY AS A 


Function or Run LENGTH 


k value 
Group 0 1 | 2 | 3 4 | 5 | 6 7 8 94 
P(A, nperi | Et, 427 * Es, n) X 1,000 
Training Trial | 
500-PL had 751 | 856 | 825 | 798 | 793 | 795 | 787 | 821 802 | 827 
100-FO 764 | 828 | 880 | 888 | 900 | 892 | 876 | 848 860 | 881 
INST 788 | 834 | 827 | 782 | 799 | 845 | 770 759 | 839 | 827 
Test Trials 
500-PL sig | 857 | 825 | 849 | 843 | 868 | 846 | 876 | 849 
500-SO fea 837 | 891 | 849 | 891 | 920 | 889 | 889 866 | 797 
500-FO 821 | 855 | 868 | 861 | 865 | 853 | 842 882 | 839 B 
2000-FO 808 | 840 | S88 | 900 | 902 | 889 | 861 897 | 856 | 84 
P(As siii Es n * *Ei,^) X 1,000 

Training Trial: 
500-PL, d 181 | 230 | 412 | 727 

100-FO 280 | 228 | 274 | 416 

INST 239 | 277 | 395 | 273 
Test Trials 
500-PL 152 | 248 | 233 | 667 
500-SO 208 | 225 | 243 | 455 
500-FO 206 | 151 | 091 | 250 
2000-FO 209 | 128 | 270 | 455 
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TABLE 2 
SEQUENTIAL STATISTIC P(Ay,n44|Einys Ej n2, Ego ii, Emn) 
Event Pattern (foie EE Tr 

i j k m Freq. Prop. xt Freq. Prop. x 

2 2 2 2 60 517 4.89 42 476 4.10 

2 2 2 1 160 .900 4.72 102 .843 2.78 

2 2 1 2 182 .835 1.08 133 .122 5.54 

2 2 1 1 679 884 5.31 488 883 1.45 

2 T 2 2 178 .843 3.30 127 .716 2.71 

2 1 2 1 750 «784 0.60 548 .123 11.21* 

2 1 1 2 700 .806 5.60 543 .159 0.41 

2 1 1 1 2935 .875 5.40 2270 846 11,215 

1 2 2 2 161 .195 5.02 112 .643 1.85 

1 2 2 1 704 .845 12.08* 509 .864 1.24 

1 2 1 2 146 .854 1.00 545 .820 1.16 

1 2 1 1 2960 .837 1.17 2322 .834 7.01 

1 1 2 2 683 .804 13.24* 494 .165 1.11 

1 1 2 1 2940 .197 29.70** 2312 .164 50.53** 

1 1 1 2 2940 .795 25.26** 2266 .160 6.64* 

1 1 1 1 12454 .859 11.91* 9001 .832 86.21** 
Sum 130.28** 195.16% 


a Bioercon Test Trials df = 3, sum of Test Trials df = 18; on Training Trials df = 2, sum of Training Trials 


* p < 05: 7,82 required for 3 df, 5.99 for 2 df. 
** ) € 001: 16.27 required for 3 df, 13.82 for 2 df. 


trials. The observed distribution was 
tested against a binomial distribution 
with a p = .806, the average P(A,) 
over the entire 500 test trials. The last 
three columns of Table 3 present the 
theoretical and observed distribution, x? 
(6) — 9.58, p < 20. Thus, the col- 
lective response data for Group NE do 
not differ from a binomial distribution. 


Discussion 


One clear result of these experiments 
is that, for PL to occur, it is not neces- 
sary that S make a prediction response. 
Furthermore, since it seems reasonable to 
assume that Ss do not verbalize about 
the sequences in the same way when 
events are presented at a fast rate of 2/sec 
as they do when they are presented at the 
slow rate of the traditional prediction ex- 


TABLE 3 
TRIAL BLOCK BY PATTERN FREQUENCIES For REsPONsES or NE GROUP 

Trial Block 1 

a d 

Pattern tct nre es 

1 2 3 5 

Meet oot 149 140 159 157 161 766 547 733 
dU 2 34 32 22 26 35 149 106 176 
$1209 39 39 36 38 21 173 124 176 
ETEY 11 11 12 7 if 48 .034 42 
2 35 32 26 32 35 160 414 176 
IMAS 8 15 13 6 5 47 .034 42 
25281 4 7 11 11 11 44 .031 42 
292/92 0 4 1 3 5 13 .009 10 
Total 280 280 280 280 280 1400 1400 
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TABLE 4 
SUBJECT BY PATTERN FREQUENCIES FOR REsPONsES oF NE Group 
Pattern 
s 

111 112 121 1228 211 212* 2128 2225 

1 41 11 11 4 22 4 
2 43 20 17 1 12 4 3 0 
3 36 15 12 10 13 5 8 1 
4 TI 2 12 1 8 0 0 0 
5 29 13 23 6 16 6 4 3 
6 45 12 16 7 11 6 3 0 
7 62 7 14 1 11 2 2 1 
8 53 10 9 8 10 7 2 1 
9 57 11 9 1 16 3 3 0 
10 66 10 14 0 6 1 2 1 
11 72 7 8 2 7 3 1 0 
12 68 8 9 2 12 1 0 0 
13 28 17 15 5 15 5 12 3 
14 89 6 4 0 1 0 0 0 
Total 766 149 173 48 160 47 44 13 

Ji 


a These patterns were combined for the chi-square analysis. They had a sum of 152, 


periment, it may be that the verbalizations 
are not relevant to the basic learning 
which occurs in PL. The procedural 
difference between Groups 500-FO and 
500-PL should be emphasized. The 
former received 500 observatoin trials in 
about 4 min, while the latter took nearly 
40 min. for the same number of trials. 
Yet both began the second 500 trials at an 
asymptote of around .84. The practical 
implication of this procedure for studies 
primarily interested in asymptotic results 
is obvious. 

Another result of theoretical interest 
was the change in asymptotic responding 
of the NE group. Previous studies in- 
volving trials without feedback (so called 

blank" or Ep trials) have suggested that 

nonreinforced trials change probability in 
the direction of .5 (Anderson & Grant, 
1957; Greeno, 1962; LaBerge, Greeno, 
& Peterson, 1962). Greeno concluded 
that the reduction depends on the relative 
frequency of events and not on the re- 
Sponse proportions per se. These studies 
did not present a single block of trials 
with E, events but rather mixed Eg with 
E, and E, events. An explanation then, 
is needed which depends upon the change 
in stimulation which results from intro- 
duction of Eg events. 


In their paper on the component model, 
Burke and Estes (1957) noted that one 
could take into account the stimulus ef- 
fects of events on previous trials. In par- 
ticular, two learning curves are devel- 
oped, one which considers P(Ajs|[Eini) 
and one which considers P(Aj,n|Ea,n-1)- 
In the analysis, the reinforcing effect of 
the conditional event on the previous trial 
is taken into account and leads to asymp- 
totic response probability, P(A4) equal to 
a+6n(1—2)(2m—1) in the noncon- 
tingent case with independent event se- 
quences. With a 6 value of around .4-.5, 
the amount of overshooting observed here 
can be accounted for. In the NE group 
the event on the previous trial is not pres- 
ent and, without the differential effects of 
stimulation from the previous trials, 
P(A,) might shift to . There are two 
problems with this idea: One is that the 
0 required for the asymptote is too large 
for the sequential effects observed. The 
second is that the curve should drop 
sharply if the stimulation from previous 
reinforcing events is withdrawn—there 
should not be the gradual decline which 
was observed. 

One further result concerning the 
asymptotic responding should be men- 
tioned. The effects of 2,000 trials of ob- 
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servation were not significantly different 
from those of only 500 trials of observa- 
tion. This might imply that a relatively 
stable asymptote has been reached. If 
this implication is to be extended to 
traditional PL, then the asymptotic results 
observed seem to be more stable than any 
previously observed. Some of the prob- 
lems mentioned by Estes (1964) concern- 
ing asymptotic behavior may be bypassed 
by the observation technique. 

The runs analyses do not show negative 
recency on early trials as strongly as other 
investigators have observed. However, at 
asymptote, the authors’ results do not seem 
to be very different from those of other re- 
searchers such as Friedman, Burke, Cole, 
Keller, Millward, and Estes (1964), Jones 
and Myers (1966), Edwards (1961), 
Witte (1964), Nicks (1959), and Derks 
(1963). If there is any difference, it 
seems to be in the over-all level of re- 
sponding, which in the authors’ experi- 
ments was nearer the asymptotic prob- 
ability itself. A comment about these 
studies is in order since all Es have not 
interpreted their results in a way con- 
sistent with the authors’ results, In the 
first place, since the curves start with 
P(Ain1|E2n), the initial point is usu- 
ally below P(A;). Derks (1963) has 
pointed out that this initial point increases 
as a function of trial blocks to a level equal 
to the response probability after long runs. 
This initially low point means that the 
curve must rise and, since the graphs pre- 
sented usually stop with runs of about 
Length 5, the curves seem to be continu- 
ally rising. However, close analysis of 
the last few points in Table 1 shows that 
they in fact begin to level off. Witte’s 
(1964) curves also illustrate this. In a 
comparison of his results with the pre- 
dictions of the linear operators, the data 
curves are much too shallow. The re- 
cency curves of Edwards, Nicks, Derks, 
and Witte show a similar leveling-off, 
and, when carried out beyond runs of 
Length 4, the Friedman et al. curves do 
not increase monotonically. Finally, a 
recent study by Myers, Gambino, and 
Jones (1966) presents data very com- 
parable to the authors’. One can only 
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conclude that, for runs analyses, the 
fects of 500 trials of observation are abo 
as other investigators have observed. 
regular PL studies. 

One theoretical problem with these re- 
sults is that of making a consistent 
planation for the probability learning o 
the one hand and the nonmonotonic n 
cency curves on the other, 


the length of runs experienced on earlier 
trials does not seem appropriate since, 
the Friedman et al. experiment, long ri 
were not experienced with a very hig 
frequency. But one must note that in 
Friedman et al. study, Ss did obsery 
a variety of values and thus were exposed. 
to a large variance in the run lengths. 
Gambino and Myers ( 1966) specifically 
investigated run-length variance and found. 
that it tended to increase positive recency, 
Such a result may have occurred in the 
Friedman et al. study. What is compar- 
able among all these experiments is the 
total experience, i.e, at least 500 trials. 

One problem with the authors' approach. 
is that they are trying to demonstrate that 
there is no difference between the observa- 
tion procedure and the prediction proce- 
dure. They are trying to accept the Null 
Hypothesis, a statistical procedure which 
can emphasize sloppy experimental pro- 
cedure and the use of very weak-powered 
tests (see Grant, 1962). However, there 
is a reverse problem, which is to use such 
powerful tests that effects accounting for - 
very little of the variance are significant, ' 
Unfortunately, x? tests do not allow a 
measure of the strength of the association. 
In the sequential analysis, the significant 
X*s were those with the largest Ns. It 
has been demonstrated that the observa- 
tion and prediction procedures produce 
some differences, but the authors have no 
measure of the size of these differences. | 
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IMAGES AS MEDIATORS IN ONE-TRIAL PAIRED- 
ASSOCIATE LEARNING: 


II: SELF-TIMING IN SUCCESSIVE LISTS + 


B. R. BUGELSKI 


State University of New York at Buffalo 


College student Ss learned 6 successive lists of 10 words in nu- 
merical order using a mnemonic system designed to employ imagery 


as a mediational device. 


Experimental Ss learned the lists better 


and remembered significantly more of the associations after all lists 
had been learned than did control Ss. Experimental Ss reported use 
of strong imagery with characteristics of color, size, direction, posses- 
sion, etc. Experimental Ss showed little evidence of interference; con- 
trol Ss showed a significant pattern of rises and falls in their learning 
suggesting a strong interference effect. 


In a previous study (Bugelski, Kidd, 
& Segmen, 1967) it was demonstrated 
that Ss using a mnemonic system were 
able to learn the ordinal position of 10 
common words quite effectively in one 
learning trial when the response words 
were paired with words rhyming with 
the numerals 1-10. The Ss could use 
the system, however, only if given 
enough time which appeared to be 
somewhere between 4 and 8 sec. The 
system employed was presumed to in- 
volve the use of imagery by Ss although 
no conclusions about imagery were 
drawn in that study. The present 
study was undertaken to develop lines 
of investigation that might support the 
hypothesis that images can function as 
mediators in paired-associate learning. 
The basic strategy was to have Ss 
learn a large number of paired-associ- 
ates (in successive lists of 10 words) 
using the same mnemonic system as 
was employed earlier and to interrogate 
Ss after learning. In this study Ss 
controlled the intralist learning time as 
an incidental aim was to determine if 
such personal control would demon- 


1This study was partially supported by 
the Center for Human Learning and Mem- 
ory Processes at the State University of 
New York at Buffalo. 


strate the same kinds of constancy 
found in ordinary  paíred-associate 
learning (Bugelski & Rickwood, 1963). 
Reports of experimental and control Ss 
about how they went about the task 
were expected to reveal systematic dif- 
ferences, It was also expected that 
differences in the pattern of learning 
successive lists would develop in the 
two groups of Ss from which inferences 
about imagery might be drawn. Spe- 
cifically, it was predicted on the basis 
of pilot data that the use of a mnemonic 
system was less subject to retroactive 
inhibition as well as to proactive inter- 
ference, Such a prediction would also 
be justified on the basis of the Barnes 
and Underwood (1959) study which 
demonstrated superior retention for 
mediated learning. 


METHOD 


Subjects—Thirty-six college seniors, ma- 
joring in psychology, were recruited from a 
course in the history of psychology to serve 
as Ss (18 experimental, 18 control). Since 
it was not a volunteer situation, all members 
of the class participated. The students were 
told that the exercise would be pertinent 
to their study of the history of psychology. 
The experiment was completed before the 
topics of association or imagery were studied. 

Procedure.—As in the prior study (Bugel- 
ski et al, 1967), the experimental Ss were 
asked to memorize a nursery jingle (one- 
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TABLE 1 
MNEMONIC List AND THE Six Womp Lists USED IN Tus STUDY 


List 2 


Mnemonic List 1 List 3 List 4 List 5 List 6. 
1-bun car dime neil — | bird shi wheel 
2-shoe flag barn Kite chair Bean cigar 
3-tree clock nail mask brick snake fan 
4-door book rat table gun bed face 
5-hive knife ball radio cat purse broom 
6-stick shirt lamp stamp bicycle glasses candle 
T-heaven drum rug cup ring medal desk 
8-gate scissors apple window hat hammer umbrella 
9-wine pen telephone soldier couch sword rose 

10-hen ski airplane key magazine box garage 


bun, two-shoe, three-tree, four-door, five- 
hive, six-sticks, seven-heaven, eight-gate, 
nine-wine, ten-hen). They recited the jingle 
until they were correct five times in suc- 
cession. They were then told that the jingle 
could aid memorization of lists of 10 words 
through the simple practice of imagining the 
first word in a list as related in some way 
to a bun, perhaps as a sandwich filler, the 
second as connected with a shoe, etc. They 
were then told that a list of 10 words would 
be read to them one at a time and that they 
were to signal the E as soon as they had 
found an image of the named object relating 
the object to the corresponding jingle word. 
The E recorded the time of each signal. 
The Ss were told to take as much time as 
they felt necessary to "make a connection." 
The self-controlled timing was introduced to 
see if Ss would actually take more time than 
the earlier experiment had demonstrated to 
be sufficient, as well as to be sure that Ss 
had a chance to use the system. It had been 
noted in other studies that Ss sometimes 
complained that they were still trying to 
form an image when time had been called. 
In the earlier study, in fact, it was dis- 
covered that a professional night-club enter- 
tainer tried to use the already well-practiced 
system but, being restricted to a short in- 
terval, failed to do an adequate job. 

As soon as S signalled that the tenth 
Word was suitably “imaged,” E tested S 
by asking for the 10 words, previously 
arranged in the list, in random order. The 
Ss were scored two points for a correct 
response and one point if they recalled a 
Word in incorrect numerical order. Later 
examination of the data indicated that the 
granting of single points did not materially 
affect the outcomes; hence, all of the data 
Teported here are based on the two-point 
System. While this procedure may appear 


awkward, computer analysis made it feasible 
to test the dual scoring system and to ascer- 
tain its adequacy, 

As soon as the test on the first list was 
completed, E read off another list, tested 
S again, and proceded to the next list for 
six successive lists. After all six lists had 
been tested, E asked S to recall all of the 
Number 1 words, all of the Number 2 
words, etc. to determine how many of the 
60 words were retained. The recall scores 
were tabulated by list. When the total re- 
call was completed, E asked S to describe 
how he had learned each item by going 
through the six lists again. For example, 
E would ask S: “How did you know that 
the first word was ‘dime’ in the second list?" 
(Ss were interrogated only on their cor- 
rect responses). À 

Control Ss learned the same six lists with 
the same free timing procedure and were 
tested in the same way. Nothing was said 
to them about mnemonics, systems, imagery, 
or anything other than the simple instruc- 
tion to learn the 10 words according to 
their serial positions. 

The six lists were counterbalanced in 
presentation by the use of a simple Latin 
square to eliminate any possible list- diffi- 
culty differences. All the words were com- 
mon English nouns, mostly of one syllable 
as demonstrated in Table 1. 


RESULTS 


The data consist of time scores, indi- 
vidual list-recall scores, total-recall 
scores, and protocols, and are presented 
in that order. 

Time scores.—lndividual Ss were 
free to take all the time they wanted 
for each number-word association, As 
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TABLE 2 


MEANS AND SD FOR THE EXPERIMENTAL (MNEMONIC) Group (N —18) AND THE CoNTROL 
Groups (N=18) IN ORIGINAL LEARNING AND SUBSEQUENT RECALL AFTER 
LEARNING ALL Six Lists 


1 2 3 4 5 6 
Group 
M |SD| M |SD| M'|SsD| M|sSD| M |sD| M | sp 
Original Learn- 
ing List 
Experimental | 18.00 | 2.57 | 16.00 | 5.36 | 17.11 | 3.38 | 17.00 | 2.93 | 17.56 | 2.12 16.78 | 2.84 
Control 12.00 | 4.45 | 10.00 | 5.57 | 14.33 | 4.91 | 12.44 | 4.93 | 14.33 | 4.77 | 14.44 | 5.51 
t 4.96*** 3,29*** 1.98**** SIR Ead 2.62*** ns 
Recall from all 
Six Lists 
Experimental | 11.00 | 4.24 | 9.89 | 5.68 | 12.67 | 4.65 | 13.00 | 4.41 | 13.89 | 4.31 | 15.67 | 3.45 
Control 3.00 | 2.59} 3.56 |3.73| 5.11 |4.61 | 5.89|3.18| 8.89 | 4.61 | 11.89 | 4.63 
t 6.83* 3.96* 4.89* 91954 3:361 2.78** 
*p = 001. 
En 
bL = 01. 
wet) = 0S, 


expected wide individual differences 
were displayed. The Ss took from 15 
sec. to 200 sec. per list of 10 words in- 
dicating that some associations were 
formed within 2 sec., others taking as 
long as 20+ sec. There were no sig- 
nificant differences in mean scores be- 
tween experimental and control Ss, 
each group averaging about 7 sec/ 
item (Control = 6.11; Experimental = 
8.11). The median scores suggested 
that the experimental Ss took slightly 
less time than did the controls, but the 
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learning (OL) of each of the six lists and 
total recall scores for experimental and con- 
trol groups. 


wide individual differences precluded 
any meaningful tests of significance. 
The experimental group displayed some 
tendency for learning more effectively 
with longer intervals (r = .3). This 
tendency was not apparent for control 
Ss (r=.0). Meaningful correlations 
could not be derived because of the 
limited spread of scores on the learning 
measure, 

Individual list recall scores.—As ex- 
pected, the experimental Ss were 
clearly and significantly superior in 
learning scores for each of the six lists 
of items. (See Table 2). What was 
not expected, however, was that the ex- 
perimental Ss showed no meaningful 
change from List 1 to List 6. They 
started out clearly superior to the con- 
trols and maintained the same high 
level of performance showing neither 
improvement nor loss. It was pre- 
sumed that learning six successive lists 
would produce at least some degree of 
negative transfer. If such negative 
transfer was present in the experi- 
mental group, it was effectively masked 
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by a practice effect. The experimental 
Ss consistently learned over 8 words/ 
list (m = 17.07). ; 

The control group showed a curious 
tendency to alternate high and low 
scores, thus scores for Lists 1, 3, and 
5 are relatively higher than scores for 
Lists 2 and 4. List 6 could conceivably 
represent an end-spurt effect. This 
tendency for alternating high and low 
performance levels was reported pre- 
viously by Bugelski (1948) and will 
be discussed later. For the moment it 
should be noted that the pattern of 
learning demonstrated by the control 
group deviated significantly from a 
straight line and most closely approxi- 
mated a rise-and-fall curve such as de- 
velops in a quintic analysis (Hayes, 
1963, p. 550f). From Fig. 2 it is ap- 
parent that the actual data fit the quin- 
tic curve more closely than they do the 
straight line function. A pronounced 
practice effect was shown by the con- 
trol group. The average recall scores 
for the control Ss was 12.93 or about 
6 words/list. The overall difference in 
favor of the experimental Ss was sig- 
nificant at greater than the .001 level. 

Total-recall scores.—ln the post- 

‘learning test, after all six lists had been 
attempted, Ss were asked to recall all 
of the words learned by ordinal num- 
ber. The experimental Ss recalled 
from 25 to 51 of the 60 words for a 
mean recall score of 12.69 or 63.4% 
or about 6 words/list. The control Ss 
did rather poorly in total recall. Their 
range of recall was 7-35 words with a 
mean score of 6.39 or 22%, about 3 
words/list. The differences in total 
recall were highly significant ($ < 
001). Figure 1 indicates that the ex- 
perimental Ss recalled almost equally 
well from the several lists although List 
2 differs significantly (p =< .01) from 
List 6, while the control Ss recalled 
mainly from the last two lists. The 
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first list recall amounted to little more 
than one word for the control Ss; the 
experimental Ss averaged 6 words from 
the first list. 

Protocols.—The control Ss had little 
to say by way of report on how they 
learned the individual associations. 
They did report trying various schemes 
(rapid rehearsal, trying to fit the words 
into a story, and sporadic associations 
from their individual histories). The 
common - response, when successful, 
however, was “I just remembered it.” 

In sharp contrast every experimental 
S reported copious imagery. They re- 
ported pleasure at the effectiveness of 
the system and claimed they easily vis- 
ualized the combinations. Each S was 
closely interrogated about each image. 
If he did not volunteer the information 
he was asked about the color of shoes, 
gates, their number, the ownership, and 
direction of movement, if any. It was 
noted if S was apparently using the 
same shoe or wine throughout the six 
lists or varying them. The Ss com- 
monly reported a host of details about 
each image with apparent conviction 
that they were describing real phe- 
nomena. 

Color was reported 958 times out of 
the possible 1,080 opportunities. Pos- 
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sessors were identified on 158 occa- 
sions, and action was described 205 
times. In 95 instances Ss reported the 
use of the same mnemonic image, i.e., 
the same shoe, hive, or hen. The same 
hive was reported 15 times( the most 
frequent report) suggesting the inex- 
perience of college students with hives. 
On five occasions Ss built up a con- 
tinuous sequence of items including 
previous object names, Thus, for Item 
9-wine, an S would be able to report a 
drunken soldier, cutting a rose with his 
sword, while lying on a couch. 

An interesting sidelight noted by the 
writer in some earlier work was the 
strong avoidance of and resistance to 
any suggestion. If, for example, E 
would ask, “Was it red?” the S would 
say, “No,” and report some other or 
no color. Similarly Ss would not ac- 
cept suggestions about size, number, 
etc. 


Discussion 


There appears to be no question about 
the efficacy of the mnemonic system de- 
scribed in this report. The Ss are able 
to remember a great deal of material after 
one presentation and can retain this for 
a relatively long time. (The experimental 
sessions would sometimes run an hour 
and a half.) Even days later Ss can 
readily recall a large number of the asso- 
ciations. The writer has perfect recall of 
a list learned a year ago. The present 
study bears out the results previously re- 
ported (Bugelski et al., 1967). 

There is little to say about the time 
factors in this study except to note the 
close correspondence between self-deter- 
mined intervals and the effective learning 
intervals of the earlier study. Other ex- 
perimenters might be encouraged to ex- 
plore more freely the factor of time rather 
than remain constrained by the limits of 
memory drums and other devices. By 
restricting Ss to fixed intervals one may 
be forcing Ss to move along without learn- 
ing much of anything, and thereby artifi- 
cially distorting the nature of some hypo- 
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thetical learning curve. According to the 
observations of this study, as with an 
earlier study (Bugelski & Rickwood, 
1963), an S needs a certain amount of 
time to find an association. He will take 
this time whether E restricts presenta- 
tion time or not. If he finds an associa- 
tion or mediator, it is generally applicable 
for some time. In effect, Ss were not 
really learning anything new, they were 
putting already learned materials together 
in a novel conjunction. 

The testimony of Ss about their imagery 
represents, of course, a subjective report. 
The Ss were not lying, to be sure, but 
they could have been describing simple 
verbalizations that they had undertaken, 
Had this been so, the explanation of their 
efficient recall would rest on the recall of 
sentences or phrases. The probability of 
a more effective recall of sentences than 
of mere words will not be challenged here, 
but it is worthy to note that if Ss were 
using sentences they were also adding a 
great deal of extra detail (color, size, 
number, ownership, direction, etc.) to the 
words and the sentences. In such an 
event, recall of sentences or phrases would 
have been too long to verbalize in the in- 
tervals employed. When one S identifies 
the shoe involved in some image as his 
mother’s pink bedroom slipper, and denies 
that it could have been his father’s black 
overshoe, one must wonder at the tenacity 
with which S defends his reported image. 
The refusal to accept suggestions coupled 
with the excess of detail suggests that Ss 
were doing more than talking to them- 
selves. They were describing phenomenal 
experiences that they had no difficulty in 
labeling as images. 

The most convincing evidence regard- 
ing imagery comes from the reports of 
many Ss who expressed the belief that 
they did not know some or any of the 
words when either the original learning 
or recall test began. They would then 
mumble the numeral, state the rhyme 
word, and then report “Oh, yes, hen-ski^ 
They asserted that the “little hen on skis 
had to appear before they could report 
"ski." There seemed to be no memoriza- 
tion of sentences or phrases. This com- 
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mon report of no knowledge or complete 
blanks followed by correct reports belies 
any verbal rehearsal or running off of 
verbal associations and testifies to the 
mediational function of the rhyme cues. 
The relatively superior total-recall scores 
again support the reality of the imagery 
function. When asked to recall all of 
the Number 2 words, e.g. the experi- 
mental Ss would report one after another 
unique image patterns involving shoes. 
The control group which had equal oppor- 
tunities for verbal operations merely 
floundered. 

Assuming the reality of imagery, why 
should it be effective as a mnemonic de- 
vice? To this question the data do not 
provide any clear-cut answers. Simple 
verbalisms, such as are presupposed for 
the control group, apparently interfere 
with each other to an extent not true of 
images. It may be that each attempt at 
imaging involved somewhat different ar- 
rangements and interactions of components 
whereas the verbalisms assumed for the 
control group could very well be identical 
from list to list and lead to greater inter- 
ference. 

The peculiar rise and fall effect dis- 
played by the control Ss may be recalled 
here as pertinent. If the control S 
learned relatively well on List 1 he had 
a large number of potential sources of 
interference for learning List 2. Poor 
learning on List 2 left him freer to learn 
List 3, etc. As mentioned earlier, this 
same pattern was reported by the writer 
in 1948 when Ss learned 10 successive 
lists of paired-associate nonsense syllables 
where the stimulus syllables were the 
same in each list. In that study Lists 1, 
3, 5, and 7 were learned more effectively 
than Lists 2, 4, 6, and 8. It was argued 
then that when the Ss learned more, there 
was more to interfere with a following 
list and the subsequent learning was poor; 
being poor, it could not interfere effec- 
tively with the succeeding list, etc. This 
Phenomenon is probably restricted to 
relatively poorly learned material, but it 
bears consideration in situations which 
call for successive learning experiences. 
It is interesting to note that Ebbinghaus 
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(1964) described this peculiar oscillation 
in 1885 (see p. 44) when he rererred to 

. . “another tendency which would not 
a priori be so readily suspected, namely 
the tendency of comparatively low values 
to be followed by comparatively high 
ones and vice versa.” 

The mnemonic system employed enabled 
the users to learn better throughout the 
six exercises and to retain more (in per- 
centage terms) than was retained by the 
control Ss, Because of the superiority of 
the experimental Ss in the original learn- 
ing the results cannot be treated in tradi- 
tional-inhibition terminology. An attempt 
to do so would lead to confusion, yet, it is 
necessary to emphasize the relatively long 
term retention of such image-assisted 
learning and its relative freedom from in- 
terference, Experimental Ss could report, 
in many instances, all six of the associates 
for some or other rhyme word. None of 
the control Ss could do so. 

From the considerations mentioned 
above, it appears that an image can be 
entertained in relative isolation from what 
went before, or even incorporated into, a 
series of new associations without losing 
its identity in some other context, 

That imagery was indeed being used 
received a possible (though feeble) meas- 
ure of support from some incidental ob- 
servations on some of the Ss who hap- 
pened to be of Jewish backgrounds and 
who always lived in large metropolitan 
communities. These Ss had difficulty with 
two of the mnemonic items, “heaven” and 
“hen.” They had no problem, of course, 
in saying these words, but they could not 
use them as effectively in the exercise as 
Ss of Christian background. For some 
Ss “chicken little’ seemed to be their 
total experience with hens, and they were 
unable to “image” hens effectively. Sim- 
ilarly, “heaven” was just a word to many 
of these Ss and as such did not work well. 
Such modest hints may lead to more ef- 
fective tests of the reality of imagery than 
our current reliance on subjective reports. 

The use of imagery, if in fact it was 
used, suggests that paired-associate learn- 
ing can be achieved in one trial if Ss are 
able to use their backgrounds and, in 
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effect, tie two old experiences together. 
When such is the task, imagery can serve 
as an effective mediator. Such imagery 
takes time to occur and unless the appro- 
priate background is available, learning 
will not be enhanced. 
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180 college students solved 3 concept identification problems in a 
3X2X2X53 repeated-measures factorial design with the variables: 
(a) sensory modality (visual, auditory, or mixed visual and auditory), 
(b) information (incomplete: specification of instances as positive 
or negative, or complete: specification as positive, negative, or partially 
positive), (c) number of persons (1 or 2), and (d) problems (3 for 
each individual or pair). Major results were: (a) fewer concept in- 
stances to solution for mixed visual and auditory stimuli than visual 
(p €.001) or auditory (p <.05), and fewer for auditory than visual 
(p <.05), (b) fewer instances to solution for complete information 
than incomplete, (c) fewer instances for cooperative pairs than indi- 
viduals, (d) an interaction between sensory modality and the number of 
persons, and (e) linear improvement over the 3 successive problems. 


Although natural concepts are typ- 
ically based on attributes that impinge 
upon several sensory modalities simul- 
taneously, virtually all studies of human 
conceptual behavior have used visual 
stimuli. In the first of three studies 
using nonvisual modalities, Lordahl 
(1961) found that Ss were less atten- 
tive to auditory than visual dimensions 
in a sorting task where concepts were 
defined by a combination of auditory 
and visual dimensions, and thus in- 
creases in the amount of irrelevant 
auditory dimensions did not result in 
decreased performance as typically 
found with visual stimuli (e.g., Archer, 
Bourne, & Brown, 1955; Walker & 
Bourne, 1961). Using only auditory 
stimuli, Bulgarella and Archer (1962) 
found that performance decteased with 
both increased relevant dimensions and 
increased irrelevant dimensions. Like 
Lordahl (1961), they concluded that 
auditory stimuli tend to be ignored 
When used in conjunction with visual 
stimuli, while with auditory stimuli 
alone the relationships between per- 
formance and increased relevant or ir- 
relevant dimensions are comparable to 


those for visual stimuli alone. Hay- 
good (1965) directly compared visual, 
auditory, and mixed visual and audi- 
tory stimuli for problems with both one 
and two relevant attributes, There was 
no difference between visual, auditory, 
and mixed visual and auditory stimuli 
for problems with one relevant attri- 
bute, and the mixed stimuli were sig- 
nificantly more difficult than either vis- 
ual or auditory with two relevant 
attributes. However, a 2 X 2 analysis 
of one vs. two attributes and visual vs. 
auditory stimuli showed no main ef- 
fects, which Haygood interpreted as 
due to the significantly larger within- 
group variance in auditory conditions. 
Thus, as an extension of these three 
studies, the present experiment directly 
compared visual, auditory, and mixed 
visual and auditory stimuli. 
Beginning with Smoke (1933) a 
number of experiments have compared 
concept identification from positive and 
negative instances. Hovland (1952) 
emphasized that the amount of informa- 
tion conveyed by positive and negative 
instances often differs. The difficulty 
of negative instances was reduced in a 
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study by Hovland and Weiss (1953) 
which equated the information con- 
veyed by each type of instance. Glan- 
zer, Huttenlocher, and Clark (1963) 
extended the comparison to simple vs. 
complex positive and negative in- 
stances, and Freibergs and Tulving 
(1961) found that the relative differ- 
ence in favor of positive instances 
greatly decreased over a series of 20 
problems. Thus, all previous studies 
have considered positive and negative 
instances a dichotomy. However, with 
concepts having two or more relevant 
attributes, positive and negative in- 
stances are really points on a dimension 
of information specification. As one 
end point of this dimension a positive 
instance indicates that all relevant at- 
tributes are present, and the other end 
point is the negative instance on which 
all relevant attributes are absent. Other 
types of negative instances are partially 
positive because they have anywhere 
from one relevant attribute to the num- 
ber of relevant attributes minus one 
present, and are thus intermediate on 
the continuum of number of relevant 
attributes present. Thus, the present 
study compared two types of informa- 
tion specification: (a) incomplete, the 
typical specification of concept instances 
as positive or negative, and (b) com- 
plete, specification of instances as posi- 
tive, negative, or partially positive. 
Laughlin and his associates have re- 
cently extended research in conceptual 
behavior to a comparison of individ- 
uals and cooperative pairs. Laughlin 
(1965) found that'two persons required 
fewer card choices and used the focus- 
ing strategy more than individuals (al- 
though the first result was not upheld 
with the Taylor-McNemar, 1955, cor- 
rection model). Laughlin and Doherty 
(1967) compared cooperative pairs in 
a factorial design in which both dis- 
ctissioni‘and paper and pencil as a mem- 
ory” martipulation "were" or - were “not 
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allowed, Pairs allowed discussion were 
superior to no-discussion pairs, while 
there was no difference between pairs 
allowed or not allowed the use of paper 
and pencil. The importance of the dis- 
cussion process was further indicated 
by Laughlin and McGlynn (1967), who 
found cooperative pairs were greatly 
superior to two competitive individuals, 
Since all three studies involved a selec- 
tion procedure, in which S selects stim- 
uli from an array of all possible concept 
instances, the present study extended 
the comparison of individuals and co- 
operative pairs to a reception proce- 
dure, in which the order of concept in- 
stances was preprogrammed. Likewise, 
since the previous studies used visual 
stimuli and incomplete information 
specification, the comparison of indi- 
viduals and cooperative pairs was 
extended to auditory and mixed mo- 
dalities and complete information 
specification. 


METHOD 


Subjects and design.—The Ss were 180 
college students in introductory psychology. 
classes at Loyola University. The experi- 
mental design was a 3 X 2X 2 X 3 repeated- 
measures factorial with the variables: (a) 
sensory modality (visual, auditory, or mixed 
visual and auditory), (b) information 
(specification of positive and negative in- 
stances, or specification of positive, negative, 
and partially positive instances), (c) number 
of persons (one or two), and (d) problems 
(three for each individual or pair). Ten 
individuals were randomly assigned to each 
of the six individual conditions, and 10 like- 
sex.pairs to each of the six, pair conditions, 
Each ‘condition had half males and half 


females. a 
` Stimuli and problems.—Visual stimuli were 
81 regular 4X6-in. index cards with 


mounted construction paper figures consist- 
ing of all possibl combinations of the four 
three-value attributes: (a) shape (triangle, 
square, or circle), (b) color (red, yellow, 
or green), (c) number of figures (one, two, 
or three), and (d) size (small, medium, or 
large). Auditory stimuli were the 81 com- 
binations of the four three-value attributes: 
(a)-number-of notes (two, three; or: four); 
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(b) range (high, medium, or low), (c) in- 
flection (steady, rising, or falling), and (d) 
volume (soft, medium, or loud). The notes 
were played on a piano at three different 
octaves to vary range, the soft or loud pedal 
to vary volume, and rising or falling major 
thirds to vary inflection. The 81 combina- 
tions were recorded in random order on a 
tape recorder. Mixed visual and auditory 
stimuli consisted of the 81 possible combina- 
tions of the two visual dimensions shape and 
color, and the two auditory dimensions num- 
ber of notes and range. The visual dimen- 
sions were presented on 4X6 in. cards 
simultaneously with the tape-recorded audi- 
tory dimensions. All problems were con- 
junctions of three relevant attributes (e.g., 
large, red, triangle or large, green, three 
notes). Thus, problems for mixed auditory 
and visual conditions necessarily involved 
two dimensions from one modality and one 
from the other. Each problem for each in- 
dividual or pair was randomly selected from 
the total set of possible three-attribute con- 
cepts for the sensory modality. The order 
of presentation of the 81 instances was ran- 
dom in all three conditions; if more than 81 
instances were necessary for solution the 
same random order was repeated. 

Procedure.—The instructions explained the 
nature of the task and pointed out the at- 
tributes and values on a reference card which 
remained available throughout the experi- 
ment. The Ss were informed that all three 
of their problems would involve three rele- 
vant attributes and were instructed to make 
one hypothesis after each stimulus instance. 
In all three sensory modality conditions Æ 
indicated the information (positive, nega- 
tive, or partially positive) for each stimulus 
instance before presenting it for 4 sec. (in 
Incomplete information conditions instances 
which were actually partially positive were 
indicated as negative). After removal of 
the stimulus instance as much time as de- 
Sired was allowed for formulation of the 
hypothesis. In group conditions Ss were in- 
Structed to discuss the problems and jointly 
formulate their hypotheses; however, either 
S could actually make any hypothesis, After 
the hypothesis E indicated orally whether it 
Was correct or incorreet, and the problem 
Was terminated when the correct hypothesis 
Was expressed. 


RESULTS 


. The means for number of stimulus 
Instances to solution for each problem 


TABLE 1 


MEAN NUMBER or STIMULUS INSTANCES 
TO SOLUTION 


Incomplete Complete 
Information Information 
Problem 
One Two One Two 
Person | Persons | Person | Persons 
Visual Stimuli 
One 224 | 11.7 | 19.5 | 10.7 
Two 21.2 81 | 17.0 | 44 
Three 132 7.6 | 12.4] 40 
Total 56.7 | 27.4 | 489 | 19.1 
Auditory 
Stimuli 
One 17.8 | 21.5 7.6 | 59 
Two 11.4 8.9 9.6 | 3.6 
Three 12.1 | 111 | 12.4 1.3 
Total 41.0 | 41.5 | 29.6 | 16.8 
Mixed Visual 
and Audi- 
tory 
Stimuli 
One 12.6 8.8 9.9 | 69 
Two 12.2 6.6 | 100 | 52 
Three 14.7 515 9.5 3.5 
Total 39.5 | 20.9 | 29.4 | 15.6 


and totals over problems for the 12 
groups are given in Table 1. 

The effect of sensory modality was 
significant, F (2, 108) = 3.47, p < .05. 
Means for total instances to solution 
over three problems for visual, audi- 
tory, and mixed modalities were 37.98, 
32.23, and 26.35, respectively. By 
Duncan comparisons visual stimuli re- 
quired more instances to solution than 
auditory (p < .05) and mixed (p< 
.001), and auditory more than mixed 
(p < 05). Incomplete information re- 
quired more instances to solution than 
complete, F (1,108) = 15.24, p < .001. 
Two persons required fewer instances 
than one, F (1, 108) = 16.04, p < .001. 

The sensory modality by number of 
persons interaction was significant, F 
(2, 108) = 4.35, p < .05. The mean 
instances to solution was 52.7 for one 
person, visual stimuli; 23.5 for two 
persons, visual; 35.3 for one person, 
auditory; 29.2 for two persons, audi- 
tory ; 34.5 for one person, mixed; and 
18.3. for two persons, mixed. By, 
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TABLE 2 


CORRELATIONS BETWEEN PAIRS OF PROBLEMS 
WITHIN EACH TREATMENT CONDITION 


Incomplete Complete 
Information Information 
Correlation 
One Two One Two 
Person | Persons | Person | Persons 
Visual Stimuli 
Problems 
one-two .63* .73** | —.35 AS 
Problems 
one-three —.33 32 —.21 | —.06 
Problems 
two-three -40 | —10 «54 50 
Auditory Stimuli 
Problems 
one-two 60" 14 —.02 15 
Problems 
one-three 06 —.07 45 86 
Problems 
two-three 65* 08 —-47 55 
Mixed Visual and 
Auditory 
Stimuli 
Problems 
one-two Ao —42 -57 | —.20 
Problems 
one-three .00 53 44 | —.13 
Problems 
two-three 440 | —.54 07 11 


*p = 0Swhenr = .58. 

** 5 = Ol whenr =.71. 
multiple-range comparisons one person 
with visual stimuli required more in- 
stances to solution than each of the 
other five conditions (p < .001 for all 
five comparisons). Two persons with 
visual stimuli required fewer instances 
than one (p < .001), and two persons 
with mixed stimuli required fewer in- 
stances than one (p < .001), but two 
persons did not differ from one for 
auditory stimuli. No other interactions 
were significant. 

The effect of problems was signifi- 
cant, F (2, 216) = 3.81, p < .05, indi- 
cating a significant linear decrease over 
problems, Fy, (1, 216) = 634, p< 
.05, and a nonsignificant quadratic ef- 
fect, Faua (1, 216) = 1.28. By multi- 
ple-range comparisons there were fewer 
card instances to solution on Problem 2 
than Problem 1 (p < .05) and fewer 
on Problem 3 than Problem 1 (p< 
.05), with no difference between Prob- 
lems 2 and 3. 

The correlations between pairs of 
problems within each of the 12 treat- 
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ment conditions are given in Table 2, 
Inspection of the correlations indicates 
a largely random pattern, since only 
5 of the 36 correlations were sig- 
nificant, and 14 were actually negative, 
Thus, there was little interproblem con- 
sistency in performance, since high 
positive correlations would have re- 
sulted if some Ss were consistently 
superior or inferior problem solvers, 
This finding supports the assumption 
of the mathematical models of Bourne 
and Restle (1959) and Bower and Tra- 
basso (1963) that concept attainment 
is essentially a process of random sam- 
pling of hypotheses by individuals who 
do not differ in problem-solving ability. 
Consequently, the pervasive problem in 
all individual vs. group problem solv- 
ing, the possibility that any obtained 
group superiority is merely an artifact 
of the probability that one person in 
each group will be above the overall 
mean in ability (see Taylor & Mc- 
Nemar, 1955, for further discussion) 
would not seem to be a plausible ex- 
planation of the superiority of the pairs. 


Discussion 


The finding that the mixed visual and 
auditory stimuli required fewer concept 
instances to solution than either visual or 
auditory, and auditory fewer than visual, 
differs from previous research. Haygood 
(1965) found that mixed visual and audi- 
tory stimuli required more instances than 
either visual or auditory, which did not 
differ. Likewise Lordahl (1961) found 
that auditory information tended to be 
ignored when presented in conjunction 
with visual information. The reason for 
these contrary findings does not seem 
readily apparent. Possible procedural dif- 
ferences include the redundant instances 
used in some conditions by Haygood, al- 
though redundancy had no significant ef- 
fects in her experiment, and the use of 
concepts with three relevant attributes in 
the present study in contrast to the two- 
attribute concepts of Haygood. However; 
both experiments indicate that the sensory 
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modality is more important than generally 
assumed. The typical laboratory experi- 
ment with visual stimuli is an abstraction 
from the extralaboratory experience of Ss, 
in which concepts are formed from in- 
formation impinging on several sensory 
modalities simultaneously, and in which 
the influence of mixed modality stimula- 
tion such as television and movies is 
pervasive. 

The finding that complete information 
conditions, in which partially positive in- 
stances were specified in addition to the 
usual specification of positive and nega- 
tive instances only, required fewer in- 
stances to solution indicates that concept 
identification is facilitated by information 
beyond that usually given. Thus, in- 
formation specification. should be more 
carefully controlled in conceptual re- 
search as a dimension whose endpoints 
are positive and purely negative in- 
stances. This is analogous to the demon- 
stration of Bourne, Goldstein, and Link 
(1964) that successive and simultaneous 
presentation methods are not a dichotomy 
but end points of a dimension of stimulus 
availability. 

The obtained superiority of cooperative 
pairs over individuals with a reception 
procedure and nonvisual modalities ex- 
tends previous research with selection 
procedures and visual stimuli (Laughlin, 
1965; Laughlin & Doherty, 1967; Laugh- 
lin & McGlynn, 1967). Furthermore, the 
interaction between stimulus modality and 
number of persons indicated that the su- 
periority of pairs was most pronounced 
with visual stimuli, with less difference 
for mixed stimuli, and no difference for 
auditory stimuli. Since the visual con- 
cepts were the most difficult of the three 
modalities, the superiority of pairs was 
most marked in the most difficult condi- 
tion. Basically, the superiority of pairs 
seems to be due more to the opportunity 
of the pairs to discuss the problems and 
Monitor each other’s inference than it 
does to their superior memory resources 
(Laughlin & Doherty, 1967). 

Finally, the significant improvement 
Over problems extends the emerging gen- 
eralization that positive interproblem 
transfer occurs with reception procedures 


(e.g, Haygood & Bourne, 1965; Neisser 
& Weene, 1962; Wells, 1962; Wells & 
Watson, 1965), while it is infrequent with 
selection procedures (e.g., Bruner, Good- 
now, & Austin, 1956; Conant & Trabasso, 
1964; Laughlin, 1966; Laughlin & Jordan, 
1967). 
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, 


This experiment investigated 2 variables that may influence Ss 
discovery of systematicity while learning structured materials: (a) 
the number of items from within a structured system that are pre- 
sented and (b) the arrangement of the learning series. The Ss 
were presented with either 8, 12, or 16 items from a system (16 was 
the maximum possible). The stimuli were presented in either an 
overlap (OV) configuration in which values of the stimulus dimen- 
sions were often the same from item to item, or a nonoverlap 
(NOV) configuration in which stimulus values were never the same 
from item to item. 6 groups of 8 Ss were tested. Results showed 
that number of items to learn did not affect learning while learning 
was significantly faster in the OV groups than in the NOV groups. 
Results were discussed in terms of discovery mechanisms in learning. 


When Ss learn part of a miniature even the amount to be learned, with- 
linguistic system (MLS), they are able out specifying which (systematic) as- 
to respond appropriately to items from pects of the situation Ss have dis- 
the system that were not originally covered. 
learned. For example, Esper (1925) Although Ss may discover or invent 
showed that if Ss learn 14 of the 16 a great variety of systems, it can be 
items defined by the Fig. 1 matrix argued that when they acquire mate- 
(e.g., if the stimulus is a red circle, the rials represented by Fig. 1, they learn 
response is ZinTep), then they can cor- the marginals of the matrix and not 
rectly respond to the 2 remaining items. 


In a previous paper (Foss, 1968), Responses -Tep -Fub Pi -Gom 
Esper’s result was extended to a variety 
of learning situations and it was argued 
; Stimuli 
that what is learned when an MLS is Oe Q VN A Y 


acquired is a function of the systematic 
aspects of the learning materials that Zin- 
are discovered by S. In brief, it is not 
Possible to state what is learned or o- 
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just the cell entries which were pre- 
sented. "That is, an item consists of 
pairs such as: red-Zin, circle-Tep, etc. ; 
and not pairs such as: red circle-Zintep, 
yellow triangle-Jorfub, etc. 

Little has been said, however, about 
variables that might influence these dis- 
coveries and the reorganization of the 
input they affect. This experiment in- 
vestigated two variables that may be 
involved in this process. 

Assume that due to item forgetting 
the mechanism for discovering sys- 
tematicity makes more efficient use of 
material recently presented than mate- 
rial presented earlier in time or earlier 
in the learning sequence. Then sys- 
tematicity is likely to be discovered 
sooner if items which share stimulus 
values and response syllables occur 
near to each other in the learning list. 
The present study investigated this in- 
put variable by simply varying the 
order of the pairs presented during 
learning. 

Another variable which may influ- 
ence learning and the discovery of sys- 
tematicity is the number of individual 
pairs (items) from within the MLS 
that are presented for learning. It 
may be argued (Foss, 1968) that, over 
a broad range of items, learning is in- 
dependent of the number of items pre- 
sented. As long as enough items are 
present to logically permit system dis- 
covery and (perhaps) as long as the 
task is difficult enough to make such 
discovery advantageous, all Ss will 
learn an eight-item list (the matrix 
marginals). The present experiment 
provided a test of this hypothesis. 


DEsIGN AND METHOD 


The stimuli were presented in either an 
overlap (OV) or. a nonoverlap (NOV) con- 
figuration. In the OV condition, the mate- 
rials were arranged so that a colored form 
was followed by another which shared either 
its color or its form about 70% of the time. 
In the NOV cotidition, a colored form was 


never followed by another with an identical 
attribute on either of its dimensions, 

The Ss learned either 8, 12, or all 16 items 
from within the Fig. 1 matrix. Two items 
from each row and column were chosen for 
the Ss who received 8 items, 3 from each 
row and column for the 12-item Ss, Thus, 
there were six groups in a Two (OV or 
NOV) X Three (number of items presented) 
crossed design. The Ss were not informed 
of the systematic relations existing between 
the stimuli and the responses. After learn- 
ing, all Ss were tested on all 16 items. 

Subjects—The Ss were 48 undergraduate- 
male volunteers. Eight Ss were assigned to 
each of the six groups. The Ss were tested 
individually. 

Materials—The stimulus materials con- 
sisted of slides of the various colored forms 
arranged in either the OV or NOV con- 
figurations. The colors, forms, and responses 
assigned to them are shown in Fig. 1. The 
materials were arranged in five different 
series to preclude serial learning; the details 
can be found in Foss (1968). 

Procedure.—The Ss were read paired- 
associate instructions modified for the non- 
verbal stimuli. A response familiarization 
task was then presented in which eight two- 
syllable responses (two from each row and 
column) were read 10 times in varying orders 
with S repeating each two-syllable response 
after E. Equal stress was placed on each 
syllable. After one practice trial (a trial 
consisted of 8, 12, or 16 items depending 
upon which group S was in) S began antic- 
ipating the responses. He was informed of 
the results on each item by E who said the 
correct response. Approximately 6 sec. were 
allowed between stimulus onset and E's pre- 
sentation of the correct response if S did not 
respond.  Interitem interval was approxi- 
mately 3 sec. The learning criterion was 
one errorless trial. If .S did not learn 
within 22 trials, the experiment was termi- 
nated. X 

After criterion was reached, S was in- 
structed that he would see a list in which 
some items might seem new. He was told 
to respond to all items if possible or to say 
"No response Then all 16 items were pre- 
sented to S with no time limit on responding 
and no feedback about correctness. Correct 
responses to new cell entries indicated 
mastery of the system. i 


RESULTS 


The primary measure of original 
learning was the number of item tokens 
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TABLE 1 
MEAN NUMBER OF ITEM TOKENS AND TRIALS BEFORE CRITERION 
AND NUMBER OF ERRORS ON TEST TRIAL 
Overlap (OV) Nonoverlap (NOV) 
8 12 16 8 12 16 
Learning 
Item tokens 7 
M 4.0 109.5 80.0 127.0 147.0 128.0 
SD 40.1 58.6 38.2 48.3 8142 60.5 
Trials + 
T M 9.25 9.12 5.0 15.88 12.2 a 8.0 
est pw 
Items incorrect 16 2 21 0 v 2 


(actual slides) presented to S up to 
but not including the trial on which 
criterion was reached. The trial on 
which criterion was reached was omit- 
ted from the count of item tokens since 
counting that trial would add 16 items 
to the 16-item group but only 8 items 
to the 8-item group, thus strongly bias- 
ing the results. 

The mean number of item-tokens and 
trials per S before criterion are shown 
in Table 1. „Analysis of variance on 
the item-token data indicated that the 
effect due to number of items from 
within the matrix (8, 12, or 16) was 
not significant, F (2, 42) — 1.09; the 
overall difference between the OV and 
NOV groups was highly significant, F 
(1, 42) — 7.62, p « .01; and the inter- 
action of these two variables was not 
significant, F (2, 42) < 1.0. In addi- 
tion, an orthogonal contrast was con- 
Structed to test whether the perform- 
ance of the 12-item group differed from 
the mean performance of the 8-item 
and 16-item groups. This contrast was 
not significant, F (1, 42) = 2.15, p > 
0. A more conservative analysis of 
variance, one which included the data 
from the criterion trial, showed an 
identical pattern of significance. Five 
Ss, all in the NOV condition, failed to 
learn the lists within 22 trials. Hence, 
the overall difference between the OV. 


and NOV groups was somewhat under- 
estimated. Note that since the 8- and 
16-item groups took the same number 
of item-tokens to learn, the latter group 
took about one-half as many trials as 
the former. Thus, the second half of 
the 16-item list is approximately equiv- 
alent to a new trial on the 8-item list. 

The number of incorrect responses 
to test items is also given in Table 1. 
Analysis of variance (adjusted for 
unequal Ns) indicated that the overall 
difference between the OV and NOV 
groups was far from significant, F (1, 
37) < 1.0, while the effect due to num- 
ber of items presented was significant, 
F (2, 37) =7.14, p< 01. The inter- 
action of these variables was not sig- 
nificant, F (2, 37) < 1.0. Scheffé tests 
indicated that the effect due to number 
of items in the learning list was due 
entirely to the eight-item group differ- 
ing from the other two groups. The 
Ss in the eight-item group were, how- 
ever, correctly responding to most new 
items indicating that they had in fact 
learned the system. 


Discussion 


In general, the results were in accord 
with expectations, Learning a 16-item 
list occurred just as quickly as learning 
an &-item list when the materials were 
structured as in Fig. 1 and when learning 
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was measured by the number of item 
tokens presented. If learning were mea- 
sured in terms of trials, then list length 
was inversely related to the number of 
trials, The arrangement of materials in 
the OV and NOV series had a large and 
significant effect upon learning. 

The difference between the OV and 
NOV groups supports the notion that the 
discovery and reorganization mechanism 
is sensitive to input conditions, This re- 
sult is closely related to previous findings 
that the successive presentation of positive 
instances of a concept speeds concept at- 
tainment (Kurtz & Hovland, 1956; New- 
man, 1956; Schulz, Miller, & Radtke, 
1963). Underwood (1952) predicted such 
findings on a somewhat different basis, 
i.e., that “contiguity” of positive instances 
would aid concept attainment. Whether 
one prefers the contiguity or the discovery 
analysis, it is clear that these experiments 
implicate a memory factor since items are 
not strictly contiguous. It seems reason- 
able to assume that the discovery mecha- 
nism can efficiently utilize information 
from items that have been rote learned 
and not forgotten, items in perception, and 
recent items which are being held in a 
limited short-term store. This assump- 
tion adds a constraint to the internal 
structure of the discovery mechanism. 

Inspection of the learning curves 
showed that the OV groups were superior 
to the NOV groups at the intercept and 
in the early trial slopes. Perhaps this 


DONALD J. FOSS 


difference could have been due to guessing 
strategies adopted by Ss. That is, Ss in 
the OV groups might have quickly learned 
to use the item-to-item syllable overlap 
to advantage in guessing the responses, 
However, if these Ss were guessing and 
not learning, then they would have reached 
criterion without the ability to correctly 
respond to the new matrix items, The 
test data indicated that Ss in the OV 
groups did learn as well as the NOV 
group Ss. Thus, the performances of the 
OV groups cannot reasonably be assigned 
only to good guessing strategies. 
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DIFFERENTIAL RECALL OF STIMULI AND RESPONSES 
FOLLOWING PAIRED-ASSOCIATE LEARNING* 


SANDRA S. MERRYMAN AND COLEMAN T. MERRYMAN 
Indiana University 


Asch and Ebenholtz have proposed that failures to find associative 
symmetry have been due to a decrease in stimulus availability 
during paired-associate (PA) learning. As a test of this hypothesis, 
30 Ss were given prefamiliarization of CVC's that were later used 
ina PA task. Following PA learning, they were given a recall test 
of all CVC's, and significantly more responses than stimuli were re- 
called. The data were taken as support for the Asch and Eben- 
holtz hypothesis. It was suggested that previous experiments which 
used prefamiliarization and resulted in stronger forward associations 
may not be considered sufficient evidence against the principle of 


associative symmetry. 


Studies of paired-associate (PA) learning 
have demonstrated that two associations are 
learned: Given one verbal unit A, S may 
anticipate the paired verbal unit B; and 
given B, he may anticipate A. Usually, 
although not always, the forward associa- 
tion (A-B) is stronger than the backward 
association (B-A). 

However, Asch and Ebenholtz (1962) 
have proposed the principle of associative 
symmetry: “When an association is formed 
between two distinct terms, a and b, it is 
established simultaneously and with equal 
strength between b and a [p. 136].” Thus 
these writers imply one association which 
acts with equal strength in both directions. 
They argued that, since S does not have to 
learn the nominal stimulus during forward 
learning, the stimulus terms of the PA list 
may not be as readily available as the re- 
sponse terms. The apparent asymmetry be- 
tween A-B and B-A performance may be 
due, not to a difference in associative 
strength, but to a difference in availability of 
the verbal units. 

Attempts to eliminate associative asym- 
metry by equating stimulus and response 
availability have been relatively unsuccessful. 
Asch and Ebenholtz (Exp. V, VI, and VII) 
gave free learning of all the stimuli and 
responses before PA learning. Nevertheless, 


1 This study was conducted while the first author 
held a Public Health Service predoctoral fellowship 
in school psychology, Grant MH 08505-03. The 
second author’s participation was supported by Grant 
GB 2848 from the National Science Foundation to 
Frank Restle. The authors wish to thank James G 
Greeno for his assistance throughout the investigation. 


Exp. VI and VII showed greater strength 
of A-B than of B-A learning. Experiment 
V showed no difference between A-B and 
B-A recall, but it included free learning of 
the paired units, 

When prefamiliarization failed to eliminate 
the difference between A-B and B-A per- 
formance in Exp. VI and VII, Asch and 
Ebenholtz advanced the hypothesis that, dur- 
ing PA learning, response terms become 
more available than stimulus terms. That is, 
PA learning leads to a decrease in the 
availability of the nominal stimuli. Accord- 
ing to Ekstrand (1966), the Asch and Eben- 
holtz hypothesis implies that prefamiliariza- 
tion of the stimuli and responses by free 
learning, followed by PA learning and a 
test of availability, should result in the re- 
sponses being more available than the stimuli, 
The present experiment tested that implica- 
tion. 

Method.—The Ss were 36 female under- 
graduates from introductory psychology 
courses at Indiana University. Of these, 
6 failed to reach criterion, leaving 30 Ss 
used in the analysis. 

Eight CVC’s with Archer (1960) asso- 
ciation values between 5 and 8 were pre- 
sented in primary type on 5 in. X 8 in. white 
cards (Table 1). Syllables were chosen to 
produce a relatively large number of con- 
fusion errors. 

Each S was shown the eight syllables, for 
2 sec. each, and was instructed to spell out 
the syllable as it was exposed. After pre- 
sentation of all eight CVC’s S was given 
1 min. to recall as many syllables as possible, 
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TABLE 1 
CVC's Usep 
CVC's Archer values 
MYV XAJ 5 
QIH WYJ 6 
NIJ KYH 7 
XEK QOH 8 


in any order, spelling them aloud. This 
sequence of presentation and recall continued 
to a criterion of one trial on which all items 
were recalled. Those Ss not reaching cri- 
terion in 20 trials were discarded. Ten ran- 
domizations of the eight items were obtained ; 
each was presented once during the first 10 
trials. Another randomization of these same 
10 orders of presentation was used on Trials 

After one perfect recall trial, $ was given 
PA learning under anticipation conditions. 
The eight items were divided into four pairs, 
the syllables of each pair being matched for 
Archer value (Table 1). The function 
served by an item (stimulus or response) was 
counterbalanced. Each member of each pair 
was exposed for 2 sec. In order to reduce 
the possibility of S's rehearsing pairs in re- 
verse order, there was no intertrial interval. 
A blank line was placed to the right of each 
stimulus; the response was presented alone. 
When the stimulus (A) of each pair was 
exposed, S' spelled it and attempted to spell 
the response (B). If the anticipation was 
incorrect, S spelled B when it was exposed ; 
if correct, S' said nothing when B was pre- 
sented. Anticipation trials continued to a 
criterion of one errorless trial. Those Ss 
not reaching criterion in 20 trials were dis- 
carded. As in free learning, 10 random 
orders of pairs and two randomizations of 
orders were used. 

After reaching criterion on the PA task, 
each S was given 1 min. to recall as many 
CVC's as possible, in any order, spelling 
each aloud. 

Results—Mean number of trials to cri- 
terion for free learning and PA learning 
were 10.7 and 10.6, respectively. 

Two tests were performed to assess the 
effectiveness of counterbalancing stimuli and 
responses. During free learning, items later 
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designated as responses were not recalled 
significantly more often than items later 
designated as stimuli, ¢ (29) = 1.88, p >05. 
In addition, on the last two free learning 
trials before criterion, there was no sig- 
nificant difference in frequency of recall of 
stimuli and responses, t (29) = 0.24, p> 80. 
Thus, prior to PA learning, stimuli and re- 
sponses were equally available. 

After PA learning, more responses than 
stimuli were recalled. Means were 3.73 and 
3.20, respectively, out of a possible 4 each, 
t (29) =2.94, p<.01. 

Discussion—The results supported Asch 
and Ebenholtz’s hypothesis that PA learn- 
ing produces a decrease in the availability of 
the nominal stimuli. Thus the principle of 
associative symmetry may not be rejected 
on the basis that previous studies using pre- 
familiarization have resulted in stronger for- 
ward associations. 

That PA learning leads to lowered reten- 
tion of the nominal stimulus seems especially 
convincing, since two precautions were taken 
to insure S’s having to remember the entire 
stimulus. First, syllables were chosen to 
produce a moderate-to-large number of con- 
fusion errors. That they did so may be 
seen from the number of trials required both 
for free learning of the eight syllables and 
for PA learning of the four pairs. Second, 
S was required to produce the stimulus on 
each trial. Both of these procedures required 
that S attend to the complete nominal stim- 
ulus. 

The results suggest that previous attempts 
to equate stimulus and response availability 
through prefamiliarization may have been 
unsuccessful; given conditions highly con- 
ducive to retention of the stimuli, Ss still 
recalled more responses than stimuli follow- 
ing PA learning. 
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OPERATION OF A NOVELTY PRINCIPLE 
IN TRANSFER OF RESPONSE 


ARNOLD BINDER 
University of California, Irvine 


Earlier work by Binder and Estes established the operation of a 
relative novelty principle in transfer to new compounds following 


paired-associate learning by anticipation. 


Certain complex learning 


strategies involving cue selection were possible in these earlier experi- 
ments, the effects of which were confounded with the novelty effects, 
The conditions of the present experiment made it necessary for Ss 
to learn to associate all cues with the responses; the effect of the 
novelty principle was still in evidence. 


Binder and Estes (1966) found the opera- 
tion of a novelty principle in the responses of 
Ss to new compounds of previously learned 
cues. Following paired-associate trials dur- 
ing which cue a occurred more often than 
cue b, for example, Ss presented with the 
test compound ab (a combination which had 
not occurred during learning) were more 
likely to give a response previously associa- 
ted with b rather than one associated with a. 

This tended to be the case whether the 
difference in frequencies resulted from repe- 
tition of a learning compound like ac or the 
occurrence of a given cue in different learn- 
ing compounds, as a in ac and ad. 

In the various experiments reported by 
Binder and Estes, the characteristics of the 
learning contexts allowed the operation of 
complex learning strategies which may haye 
produced the results attributed to the novelty 
principle. To illustrate, consider the last of 
their experiments (Exp. VI), the design 
of which is shown in Table 1. Learning of 
the paired associates proceeded to a fixed 
number of blocks, after which the test com- 
pounds were presented without associated 
responses. The Ss gave only one test re- 
sponse to each such compound. 

Many of the analyses of results depended 
9n Ss having learned the association of all 
cues with their respective responses. But 
ìn certain cases it was possible to adopt a 
Strategy whereby certain cues were dis- 
regarded; in the design shown in Table 1 
Ss could learn as follows: b is an i, d is a 
k, and neither b nor d is a j. After that 
course of learning, one would expect a k 
Tesponse to test figure ad and an i response 
to cb, both of which are in the direction ex- 
Pected by the novelty principle. 

The present experiment was aimed at 
determining if results consistent with the 
Novelty principle are obtained under con- 
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ditions that force the learning of all cue- 
response associations. In particular, this 
experiment was like the previous Exp. VI 
in learning and test stimuli, but so arranged 
that Ss were forced to associate a with i 
and j, b with i, c with j and k, and d with k. 

Method.—A. total of 95 Ss was run but 
the data from 16 could not be used because 
of the failure to reach a criterion. The ap- 
paratus is fully described in the original 
monograph (Binder & Estes, 1966). 

The overall design of the experiment was 
like that of Exp. VI in Binder and Estes, 
and may be seen in Table 1. As before, the 
nonsense syllables VOP, GAK, FUH, and 
TEF were randomly assigned to the: stimulus 
letters a, b, c, and d and the numbers 1, 2, 
and 3 to the response letters i, j, and k 
(Table 1). 

The Binder and Estes experiments con- 
sisted of two parts, a training phase and a 
testing phase. During training, numerical 
labels were associated with nonsense-syllable 
pairs under a paired-associate procedure. 
During the testing phase, which followed a 
fixed number of acquisition trials, Ss re- 
sponded to new combinations of the nonsense 
syllables presented during training. 

However, whereas the method of anticipa- 
tion was used in all previous experiments in 


TABLE 1 
TRAINING AND TESTING PARADIGM 


Training 
Testing 
Syllable 
Combinations 
Syllable 
Combinations | Responses 
ab i ad 
j cb 
de k bd . 
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TABLE 2 
SUMMARY OF RESPONSES TO NEW COMPOUNDS DURING TESTING PHASE 


Responses (and Associated Training Cues) 
"Test Figures i (ab-ba) j (ac-ca) k (dc-cd) No Response 
Both First Both First Both First Both First 
All Ss 
ad-da 22 10 45 21 86 45 5 3 
bd-db 78 37 14 7 62 31 4 4 
cb-bc 86 38 27 13 34 18 11 10 
Non-error Ss 
ad-da 22 9 42 20 79 42 5 3 
bd-db 73 33 14 7 57 30 4 4 
cb-bc 83 38 26 12 30 16 9 8 


the series, simultaneous presentation of paired 
associates was used in the present case. 
The Ss were motivated to learn the pairings, 
though they did not respond during train- 
ing, by the statement in the instructions, 
“At the end of these learning trials you will 
be given test trials to see how well you 
have learned the syllables and their asso- 
ciated numbers, so it is important to pay 
close attention during the learning." 
During the first learning block each syl- 
lable compound was shown, followed by its 
reversal The particular one of the two 
compounds shown first and the orderings 
over the three twosomes were randomized. 
Within the second, the third, and the fourth 
blocks, the paired associates and their re- 
versals were presented in a completely ran- 
dom fashion, without regard to the succession 
of reversals. Exposure of pairs was 6 sec. 
with a 3-sec. interval between presentations. 
Following the fourth learning block, Ss 
were presented the sequence designated to 
them as "test trials" (not to be confused 
with trials of the testing phase when novel 
compounds were shown) which aimed at 
evaluating their previous learning. During 
these trials the syllables a, b, c, and d were 
shown (for 6 sec. each) as well as the com- 
pounds ab, ac, dc, in a random sequence 
and with the left-right order randomly 
chosen. Since each syllable was exposed 
on an individual digital display unit, the 
showing of a single syllable, like a, could 
be blank-a or a-blank (the blank referring 
to the one of two display units that showed 
no syllable). Thus, randomization of left- 
right positioning was just as meaningful for 
the single syllables as for the compounds. 
After all syllables and compounds were 
shown in this fashion, the sequence was re- 
peated but in a different random order. 
Moreover, during the second exposure of 


each, the left-right position of the first 
exposure was reversed. 

Since two of the four cues (a and c) 
were paired with two responses each, Ss 
were instructed to respond in such cases 
during the second of the two learning- 
evaluation sequences with "another number 
corresponding to the syllable if it had pre- 
viously been associated with more than one 
number.” They were told to give the same 
response twice in the other cases. 

The sequence of four training blocks (pre- 
sented in the manner of Blocks 2, 3, and 4) 
followed by trials involving syllables and 
compounds without associated responses was 
repeated. There was then a break of 1 min. 
preceding the testing phase in which the new 
compounds were introduced. These are 
shown in Table 1. 

In order to assure the required motivation, 
excluded from the analyses were the data 
of Ss who made one or more errors to com- 
pounds in the second set of trials in which 
only syllables and old compounds were shown 
without associated responses. 

This second (or testing) phase of the 
experiment consisted of a learning block of 
simultaneous paired associates (in the same 
manner as Blocks 2-8), followed by two 
more such learning blocks with interspersed 
test trials of the new compounds (see Table 
1). Three of the six new compounds occur- 
red in the initial one of these two final blocks 
and the other three in the second one. The 
Ss were instructed to observe old com- 
pounds and their paired responses, and to 
respond to a new compound (which was not 
paired with a number) with the number they 
thought was “associated most closely with 
it.” 

Results—A summary of response fre- 
quencies over all Ss to new compounds may 
be seen in the upper portion of Table 2 
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Under the column heads marked “Both” are 
given the numbers of responses to both 
left-right arrangements of the test figures 
shown to the left of the row. Thus, 22 of 
the responses given to test figures ad and 
da (both of which were shown to each S) 
were the numbers previously paired with 
ab and ba (coded as i). Under the column 
heads marked "First" are the response fre- 
quencies to the particular one of the test 
figures in the row which occurred first dur- 
ing the testing phase. 

The operation of the novelty principle in 
responding to the new compounds takes the 
form of a preference for the response k 
rather than i or j to ad, and a preference 
for i rather than j or k to cb. Of course an 
hypothesis of response choice on the basis 
of maximum cue validity or minimum am- 
biguity would lead to the same expectations, 
but that basis of response choice was re- 
peatedly shown by Binder and Estes (1966) 
to be of virtually no influence in this experi- 
mental context. Since ad and cb are experi- 
mentally and inferentially equivalent (the 
nonsense syllables and numbers were assigned 
randomly), it is natural to pool the re- 
sponses to them in testing for the movelty 
principle So doing produces, in the Both 
category, 172 responses appropriate to d (in 
ad) or to b (in cb), and 128 responses to 
a or. to c (in ad and cb, respectively). The 
bias is in the direction of preference for more 
novel cues. The corresponding frequencies 
obtained for Both in the comparable experi- 
ment of Binder and Estes (1966), where 
learning was by the anticipation method, 
were 255 and 153. That is, the present 
573 relative frequency of novel responding 
compares with the earlier .625. 

Similarly, in the First category, the d 
and b responses summed to 83 in the cur- 
rent experiment and 121 in the earlier one, 
while the a and c responses summed to 62 
currently and 85 earlier. The current and 
previous proportions of responses associated 
with novel cues are .572 and .587. 

Thus, it is clear that the tendency to 
choose the response associated with the more 
novel of two cues in a new compound is in 
evidence when the experimental procedure 
assures that neither of the cues was selec- 
tively ignored during the preceding learning. 

Table 3 contains the proportions of errone- 
ous responses (inappropriate responses plus 
response failures) to the components a, b, c, 
and d presented as tests of learning after 
Blocks 4 and 8 of the learning phase. Al- 
though the probability of am erroneous re- 


TABLE 3 
PROPORTIONS OF ERRORS TO CUE-COMPONENTS 


Errors on Test Series 
During Learning 


Cue 
After Block 4 After Block 8 
a 4209. .037 
b .120 013 
c .082 044 
d .209 013 


sponse to a or c would seem to be higher 
than to b or d in the series following the 
eighth block, the proportions of errors are 
all so small as to assume adequate learning 
of the components, 

But to remove any lingering doubt, as a 
final analysis for evidence of the novelty 
principle when the operation of cue selection 
strategies is not feasible, the responses to 
the new compounds of the testing phase were 
analyzed for only those Ss who made no 
errors to the cue components presented after 
the eighth learning block. A summary of 
these results may be seen in Table 2. The 
pooled results from ad and cb, with the 
meaning and the order discussed above, are: 
Both, 162-120 and First, 80-57. The cor- 
responding proportions of responses in ac- 
cord with the novelty principle are .574 
and .584, which may be compared with the 
proportions .573 and .572 for the larger 
group of Ss. 

In the monograph, Binder and Estes pre- 
sented a preliminary quantitative model for 
the operation of the novelty principle: The 
sampling probability of a cue in a test com- 
pound is equal to the probability that it did 
not occur last during preceding learning. 
If, for example, cues a and b occurred in the 
ratio X:Y during training, the probability 
of sampling a from the test compound ab is 
Y/(X +Y). But, in the process of applying 
the formula it is necessary to use a correc- 
tion for inappropriate responding. To illus- 
trate, notice that 14 of the responses to the 
test compound bd (all Ss, Table 2) were j 
which had been associated previously with 
neither b nor d. Assuming that on some 
proportion $ of test trials, responses are 
determined by the cues presented and on 
the remaining (1—?) by irrelevant con- 
textual elements, we can estimate p by 


2 
ue apps) —. This gives as our 
154 3 


estimate f =.727. Using the symbol P[A+] 
to represent the probability of giving a re- 
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sponse associated with the more novel cue, 
2 
we have P[A+] = (727) > + (.273) Q 


=.576. Thus, the predicted probabilities of 
.576 and .424 for more and less novel re- 
sponding compares very favorably with the 
obtained relative frequencies of .573 and 
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427 for Both and .572 and .428 for First, 
respectively. 
REFERENCE 
BINDER, A., & Estes, W. K. Transfer of response 
in visual recognition situations as a function of 
frequency variables. Psychol. Monogr., 1966, 80, 
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DIFFERENTIAL RESISTANCE TO EXTINCTION DETERMINED BY 


A SMALL NUMBER OF DIFFERENTIAL INSTRUMENTAL 
CONDITIONING TRIALS* 


JAMES R. ISON anp ALLEN A. ADINOLFI 


University of Rochester 


46 albino rats received 18 differential conditioning trials, 9 to rein- 
forced stimuli (S+) and 9 to nonreinforced stimuli (S—), followed 
by 18 extinction trials to the former S+, to the former S—, or to 
both stimuli. The response to the former S+ was more persistent 
than that to the former S— even in those Ss (N = 26) not responding 
differentially at the end of acquisition. These data indicate that re- 
sponses associated with 2 discriminanda may be differentially resistant 
to extinction in the absence of differential excitatory strengths prior to 
extinction, which must be partially responsible for the retardation in 


discrimination learning produced by presolution reversal. 


The performance of a group of Ss in dif- 
ferential instrumental conditioning exhibits 
an initial stage of increasing but nondif- 
ferential speeds to both reinforced (S+) 
and nonreinforced stimuli (S—), followed by 
a period of divergence characterized pri- 
marily by decreasing speeds to S—. This 
decrement suggests that the growth of inhibi- 
tion associated with S— is a gradually de- 
veloping cumulative process, but an examina- 
tion of individual records reveals that many 
Ss move from rapid and nondifferential 
speeds to S— to slow and differential speeds 
within two adjacent trials (Ison & Claiborn, 
1967). The gradually diverging group 
curve results artifactually because different 
Ss shift stages at different trial num- 
bers. These abrupt speed shifts in differen- 
tial conditioning are identical to the abrupt 
choice shifts observed in simultaneous dis- 
crimination problems; indeed in procedures 
using a combination of procedures both shifts 
occur simultaneously unless the choice per- 


1This research was supported. by National Science 
Foundation. Grant GB 4373 to J. R. Ison. 


formance is overlaid by a position. preference 
(Pennes & Ison, 1967). It might be: inferred 
that the initial stage of phenotypic nondif- 
ferential performance is descriptive of geno- 
typic nondifferential learning, an inference 
made by Lashley (1929) and further de- 
veloped by Krechevsky (1932). The abrupt- 
ness of the shift is also consistent with their 
position, as it is with more recent develop- 
ments in simple probabilistic learning models 
which specify a small number of possible 
values (0 and 1) for an associative link, 

The early studies of simultaneous discrim- 
ination performance indicated, however, by 
demonstrating that presolution reversal re- 
tarded subsequent choice behavior, that there 
is some learned differentiation in the absence 
of differential choice performance (McCul- 
loch & Pratt, 1934). This finding was €x- 
tended by the demonstration that presolution 
reversal retarded subsequent speed differen- 
tiation in differential conditioning (Edwards, 
1966) suggesting thereby that there is some 
learned differentiation in the absence of dif 
ferential speeds. 


— The present experiment specifies further 
characteristics of this early differentiation. 
tis noted that reversal performance involves 
“the extinction of a previously reinforced 
response (to the former S+) whereas non- 
reversal (ie., maintenance of the initial con- 
ditions) depends on the extinction of gen- 
eralized response strength accrued to S— 
in the presolution period. It might be that 
these two responses differ in their resistance 
to extinction, which would account for the 
reversal data. 

" Method.—Male albino rats (N —48) were 
obtained at approximately 90 days of age 
from the Holtzman Company, Madison, Wis- 
consin, Two died during the experiment. 

The differential conditioning box (de- 
scribed in detail in Pennes & Ison, 1967) pro- 
vided two straight alleys (black or white) to 
either of which a common gray start box 
‘could be attached. Two Hunter clocks, con- 
nected to three photocells placed on the start 
door and 12 in, and 24 in, down each alley, 
recorded start and run times. 

For about 1 wk. prior to running Ss were 
handled daily for several minutes and placed 
On a 23-hr. water deprivation schedule with 
food ad lib. They were randomly assigned 

to a group receiving reinforcement on the 

black stimulus (S+) and nonreinforcement 
on the white stimulus (S—), or a group re- 
ceiving the reverse schedule. The pattern 

Of six daily trials was either +—-—++— 
oo ~++——-+, Acquisition was continued 

for nine trials to each stimulus. Then the 

individual records were inspected and two 

Sets of Ss formed; one consisted of Ss (N = 

20) showing any minimal differentiation be- 

tween the stimuli and the second of those Ss 

(N=26) clearly responding nondifferen- 
tially. Any S showing a disruptive stop or 
Tecoil on S—, or slow speed to S— un- 
_ &Xpected from the range of adjacent speeds 

to S+ was excluded from the nondifferen- 
- tating group. Extinction training began on 
_ the next day and continued for 6 days under 
One of three conditions: Group W on both 
— Stimuli (three nonreinforced exposures to 
- each per day) ; Group B1 on the former S+; 

and Group B2 on the former S— (three ex- 

- PoSures to one stimulus per day). Approx- 
_ imately equal numbers of Ss within the dif- 

€rentiating and the nondifferentiating sets 
— Were randomly assigned to each condition. 

. the reinforcement was .2 cc water placed 
d a foodcup behind a baffle at the end of 
€ positive alley. On nonreinforced trials 
E Íood cup was absent and S detained in 

the goal region for 8 sec. An intertrial 
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Fic. 1. Response speed to S+ and S— in the 9 
acquisition trials and the 6 days of extinction for 
groups receiving both S-- — and S— — in extinction 
(Group W), or only S+—(Group B1) or S—— 
(Group B2). 
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interval of about 10 min. for Group W and 
20 min. for Groups B1 and B2 was obtained 
in extinction, 

Results—Performance of Ss which ex- 
hibited no initial differentiation is presented 
in Fig. 1, which depicts trial means in acqui- 
sition and day means on the last acquisition 
day and the six extinction days. These Ss 
responded slightly faster to S— than to S+ 
in acquisition. This is characteristic of early 
trials in differential instrumental condition- 
ing and reflects, in part at least, differences in 
the sequences of reinforcements and nonrein- 
forcements preceding S+ and S— presenta- 
tions (that is, S+ but not S— may be 
preceded by two consecutive nonreinforce- 
ments and S— but not S+ may be preceded 
by two consecutive reinforcements), Speeds 
to the former S+ (S+—) and to the con- 
tinued S— (S— —) separated in extinction 
with a greater divergence within Group W 
than between Group B1 and B2. Three sets 
of analyses of variance were performed on 
the extinction data. The first contrasted 
day mean speeds to the stimuli within Group 
W. This yielded a main effect for stimuli, 
F (1, 7) =29.63, p<.01 (start), F (1, 7) 
= 10.00, p<.05 (run); and a Days X Stim- 
ulus interaction, F (5, 35) =2.75, p<.05 
(start), F (5, 35) 2492, p<.01 (run). 
The second set of analyses contrasted 
day mean speeds to each of the stimuli be- 
tween Group Bl and Group B2. These 
yielded a Days X Groups interaction, F (5, 
80) = 1.85, p < .05 (start), F (5, 80) = 2.62, 
p € 05 (run), but no main effect attributable 
to Groups. Together these analyses indi- 
cated the greater strength of the response 
to S+— in extinction. 

Finally, speeds to S+— and to S— — in 
Group W were contrasted with speeds in 
Group Bl and Group B2, respectively. 
Analyses of speeds to S— — confirmed the 
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observation that Group W responded more 
slowly than Group B2, yielding a Groups 
main effect, F (1, 15) = 7.59, p < .05 (run); 
and a Groups X Days interaction, F (5, 
75) =2.04, p<.05 (start), F (5, 75)= 
2.65, p < .05 (run). Comparable analyses of 
speeds to S+ between Group W and Group 
B1 yielded only a Groups X Days interaction, 
F (5,75) —425, p € .01 (start). 

The Ss with minimal differentiation. in 
acquisition exhibited a similar divergence 
of speeds to S+— and to S— — in extinc- 
tion. Analysis of their data yielded a similar 
pattern of results except that Group W did 
not differ from Group B2. 

Discussion —The hypothesis suggested was 
that responses associated with different dis- 
criminanda may be differentially resistant to 
extinction even though not different in speed 
prior to extinction. This was clearly sup- 
ported in the present experiment, particularly 
in the group given nonreinforced exposure 
to both stimuli. Whatever is learned in the 
early stages of differential instrumental con- 
ditioning thus affects subsequent persistence 
but not concurrent speed or choice behavior 
in favor of S+. Within the context of the 
theory of frustrative nonreward (Amsel, 
1962) one might suggest that these early 
nonreinforcements produced a conditioned 
frustration response on S— which was sub- 
threshold for avoidance behavior. It fol- 
lows that the effect of subsequent nondif- 
ferential nonreinforcement would be first ap- 
parent on the approach to S— — because this 
stimulus was already partially conditioned 
to anticipatory frustration. Additionally, it 
may be noted that an anticipatory frustration 
response which was subthreshold for avoid- 
ance behavior would have purely energizing 
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properties. This provides an additional ex- 
planation of the faster speeds to S— in the 
early trials of acquisition, but only a careful 
analysis of different trial patterns will tell 
whether this explanation is necessary after 
the effects attributable to different reinforce- 
ment and nonreinforcement sequences are 
assessed. 

An earlier explanation of the retardation 
in discrimination learning produced by pre- 
solution reversal was that the differential 
excitation associated with the discriminanda 
was dominated by strong position habits 
(Spence, 1936). "The present account does 
not conflict with this explanation and is re- 
stricted to the early trials of discrimination 
learning in which excitatory strengths asso- 
ciated with each stimulus do not favor the re- 
sponse to the reinforced stimulus, 
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ON THE GENESIS OF ABSTRACT IDEAS! 


MICHAEL I. POSNER AND STEVEN W. KEELE 
University of Oregon 


Previous work has shown that Ss can learn to classify sets of patterns 
which are distortions of a prototype, even when they have not seen 
the prototype. In this paper it is shown that after learning a set 
of patterns, the prototype (schema) of that set is more easily classified 
than control patterns which are also within the learned category. As the 
variability among the memorized patterns increases, so does the ability 
of Ss to classify highly distorted new instances. These findings argue 
that information about the schema is abstracted from the stored instan- 
ces with very high efficiency. It is unclear whether the abstraction of in- 
formation involved in classifying the schema occurs while learning the 
original patterns or whether the abstraction process takes place at the 
time of the first presentation of the schema. 


When a man correctly recognizes an 
animal he has never seen before as a 
dog, he has manifested an ability to 
generalize from previous experience. 
What has he learned that allows him 
to make the classification success- 
fully? This question has been dis- 
cussed in various forms since Aris- 
totle. Some philosophers suggest a 
process of abstraction in which S 
builds up a representation of a figure 
(e.g., triangle )which is different from 
the instances he has seen. Others 
have denied the reality of such com- 


! This research was supported in part by 
National Science Foundation Grant GB 3939 
to the University of Oregon. A preliminary 
Version of Exp. I was included in a report 
Presented at the XVIIIth International Con- 
gress of Psychology, August 1966. The 
authors wish to thank Barbara Kerr, William 
Eichelman, and Stanley Sue for their help in 
conducting this research. 


posite representations or abstractions, 
For example, Bishop Berkeley pointed 
out that he could search his imagina- 
tion in vain for the abstraction of a 
triangle which was neither equilateral 
nor scalene but which represented 
both of these and all other triangles at 
once. The philosophical idea of ab- 
stract representations entered modern 
psychology from. clinical neurology 
through the work of Barlett (1932) on 
schema formation (see also Oldfield 
& Zangwill, 1942). 

In the areas of perception and pat- 
tern recognition, psychologists have 
studied questions related to schema 
formation. Attneave (1957) demon- 
strated that pretraining on the schema 
(prototype) of a set of patterns could 
facilitate later paired-associate learn- 
ing. Subsequently Hinsey (1963) 
showed that pretraining on the proto- 
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type pattern is superior to pretrain- 
ing on a peripheral pattern. How- 
ever, these studies suggest only that 
knowing the schema can aid later 
learning and do not reflect on whether 
S in fact abstracts information con- 
cerning the schema in the course of 
learning. 
Attneave's (1957) study, like most 
of the subsequent experiments, used 
stochastic distortion rules to obtain 
patterns which varied around a proto- 
type. Forrules of this type the proto- 
type represents a kind of average or 
central tendency of the distortions. 
Following this same line, Posner, 
Goldsmith, and Welton (1967) showed 
that the rate at which Ss learned to 
classify a list of patterns was a func- 
tion of the amount of distortion of the 
instances from their respective proto- 
types. As the amount of distortion 
increased, so did the variability among 
instances within a category. This 
increase in variability served to re- 
duce the rate at which the category 
was learned. Evans and Edmonds 
(1966) have developed much the same 
theme. They also showed that Ss 
could learn a discrimination between 
patterns generated from different pro- 
totypes without having seen the pro- 
totypes. This discrimination could 
be obtained with or withoutknowledge 
of results. These studies indicate very 
little, if anything, about the use of a 
schema. That Ss can learn to dis- 
criminate patterns without seeing a 
prototype does not indicate that ab- 
straction is involved or that the 
schema is itself being learned or used. 
The philosophical notion of abstract 
ideas is vague but it does suggest that 
information which is common to the 
individual instances is abstracted and 
stored in some form. In its strongest 
sense, this might be translated opera- 
tionally into the hypothesis that the 
commonalities among a set of patterns 
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are abstracted during learning and 
that they alone are stored. In the 
case of patterns obtained by statisti- 
cal distortion rules, this suggests that 
S abstracts the prototype. A less ex- 
treme hypothesis suggests that S stores 
the abstracted schema in addition to 
the individual instances. A still 
weaker interpretation is that Ss will 
recognize the schema better than pat- 
terns which are similar to the memor- 
ized instances but which are not their 
prototype. This last hypothesis 
would not necessarily require the ab- 
stracting process to take place during 
learning. 

The studies reported in this paper 
examine various transfer tasks in an 
effort to understand what S stores dur- 
ing the process of learning to classify 
distorted patterns. The stimuli are 
meaningful or nonsense dot patterns 
which can be distorted by statistical 
rules. In Exp. I and II, different 
groups of Ss learned to classify high 
and low variability distortions of the 
same prototype. They were then 
transferred to learning or recognition 
tasks which involved new distortions 
not previoulsy seen. In Exp. III all 
Ss learned to classify distortions of 
high variability. They were then 
transferred to the following patterns: 
old distortions just memorized, the 
schema of the memorized instances, 
and control patterns at varying dis- 
tances from the memorized patterns. 
Performance in these transfer tasks 
was used to infer the role of abstrac- 
tion and of pattern variability in recog- 
nition. 

EXPERIMENT I 

The original learning in this experi- 
ment involved instances of four differ- 
ent prototypes. One group had small 
distortions of the prototypes, while for 
the other group the distortions were 
large. After reaching critieron on the 
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original learning task, both groups 
were transferred to a list of patterns 
which were more highly distorted than 
those in either of the two original 
lists. Previous work (Posner et al., 
1967) demonstrated that the transfer 
list patterns were equal both in physi- 
cal and perceived distance from the 
patterns in the two original lists. 
Since these new patterns are equally 
similar to the two original lists, any 
differences between the groups in 
transfer must be due either to the dis- 
tance of the memorized patterns from 
their prototypes or to their distance 
from each other (variability). If a 
clearly defined schema was of primary 
importance in transfer, the small dis- 
tortion group should show better 
transfer. If variability is more impor- 
tant, the larger distortion group 
should show better transfer. A con- 
rol group with no original learning 
vas used to assess the direction of the 
transfer effects. 


Method 


Subjecis.—The Ss were 36 introductory 
Psychology students at the University of 
Wisconsin who received course points for 
participating in the experiment. 

Materials.—The prototype patterns con- 
sisted of a triangle, letters M and F, and a 
random pattern, all made from nine dots 
within a 30 X 30 matrix. Pictures of proto- 
types and some of the distortions were pre- 
viously published (Posner, Goldsmith, & 
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Welton, 1967). From each of the four origi- 
nals, six distortions were constructed at each 
of three different levels (1, 5, and 7.7 bits/ 
dot) The detailed statistical rules and dis- 
tance data have also been published (Posner 
et al, 1967). The six distortions were 
arbitrarily divided into two lists of three 
distortions each. Each list, therefore, con- 
sisted of 12 patterns in total, divided into 
three distortions of each of the four different 
prototypes (triangle, M, F, and random). 
Patterns were placed on 2 X 2 slides and each 
was duplicated three times, thus providing 
three independent orders for each list. 
Procedure.—The 36 Ss were randomly as- 
signed to one of three conditions and to one of 
the two lists within the condition. All lists 
consisted of 12 patterns of one particular level 
of distortion. The conditions were: learning 
of Level 1 patterns, learning of Level 5 
patterns, and no original learning (control). 
The exact procedure was reported previously 
(Posner et al, 1967). Briefly, a slide was 
presented and remained on until S pressed one 
of four buttons which represented his choice. 
Then a feedback light indicating the correct 
button for that slide came on and remained 
on during the 8-sec. interstimulus interval. 
The S continued through trials until he cor- 
rectly classified two complete lists in a row. 
After completing the original learning, the 
two experimental groups were transferred to 
a list of 7.7-bit distortions. The control group 
began its session with the 7.7-bit list. The 
transfer list was learned by all groups in the 
same way as the original list except that the 
learning was terminated at the end of six 


trials. 


Results 
Table 1 shows the basic results of 
the experiment. The two subgroups 


TABLE 1 


MEAN ERRORS TO CRITERION FOR ORIGINAL 


LEARNING AND TRANSFER TASKS, Exp. I 


, Original Learning 


Transfer 


Trial (Mean Errors) 


Ei MAW, 
trors to Criterion : » B 7 i e z 
oo RRA 
NI l 8.1 6.2 5.3 5.6 44 3.7 5.6 
Gi 
123 p 6.0 m5 4.5 3.7 3.2 3.0 4.3 
TEE i 8.3 7.8 7.6 6.0 5.5 5.8 6.8 
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within each condition were combined 
since the sublists were arbitrary sam- 
plings of the statistical rule which 
governed the distortions. As ex- 
pected, the group at Level 5 made 
more errors in original learning than 
did the group at Level 1. This re- 
plicated findings reported previously 
(Posner et al., 1967). 

During the transfer task Group 1 
made more errors on each of the first 
six trials than did Group 5. The con- 
trol group showed more errors on each 
trial than either of the two experimen- 
tal groups. Analyses of variance of 
both the first trial and of all six trials 
wererun. For the first trial the over- 
all effects of groups was significant, 
F (2, 33) = 10.6, p < .01. Subse- 
quent / tests showed that on the first 
trial Group 1 was significantly worse 
than Group 5 but not did differ from 
the control group. The analysis of all 
six trials also showed a significant ef- 
fect of groups, F (2, 33) = 32, p < 
01, Subsequent / tests showed that 
Group 1 was significantly worse than 
Group 5 and significantly better than 
the control. 


Conclusions 


The results of this study indicate that 
transfer from the broader (Level 5) 
concept was better. This occurs de- 
spite the fact that the average distance 
between corresponding dots and the per- 
ceived similarity of the patterns at 
Levels 1 and 5 to those at Level 7.7 are 
the same (Posner et al., 1967). More- 
over, the minimum distance from Level 5 
patterns to any of the new distortions 
is at least as great, on the average, as the 
minimum distance of the Level 1 pat- 
terns from the new distortions. Thus 
the superior performance of the groups 
at Level 5 cannot be due to perceived 
similarities or actual physical distance 
between the learned patterns and the new 
instances. In addition, Ss at Level 1 
uniformly reported the correct names of 
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the meaningful patterns, whereas Ss at 
Level 5 rarely did. Thus having the 
verbal label does not appear to help as 
much as practice in classifying patterns 
which had considerable noise or 
variability. 

The performance of the control group 
may have been suppressed somewhat 
due to lack of warm-up prior to the 
Level 7 list. However, it seems likely 
that both Level 1 and 5 are showing 
positive transfer due to their specific 
learning experience as well as generalized 
learning-to-learn. The unpaced nature 
of the learning situation would probably 
reduce learning-to-learn effects found in 
the usual anticipation methods. There 
are two serious objections which could 
be raised to the differences between 
groups at Levels 1 and 5. First, is the 
initial surprise which Ss at Level 1 had 
when confronted with highly distorted 
patterns. This is suggested by the 
finding that Level 1 is not superior tc 
the control group on the first trial. Eve] 
though Level 1 remains below Level / 
on each trial, it might be argued that a 
learning procedure confounds initial 
recognition with later performance. 
Second, is that Level 5 Ss took more 
trials to learn and it might be argued, 
therefore, that they have learned methods 
of how to deal with the storage of infor- 
mation from distorted patterns. Their 
superior performance would then be due 
to the appropriateness of the strategies 
they had previously learned to the new 
material. Experiment II was designed 
to eliminate some of these problems. 


ExPERIMENT II 


In Exp. II the transfer task was 
pattern recognition rather than learn- 
ing. It was not necessary for S to 
memorize the new material and thus 
storage strategies attained in original 
learning were not appropriate. Since 
24 different patterns were shown, per- 
formance on each slide was less depen- 
dent upon recognition of previous 
slides than it is in a learning situation 
In order to assess the relative 10- 


fluence of new learning during the 
transfer tasks, groups were run with 
and without feedback. 


C Subjects—The Ss were 32 students in 
"introductory psychology at the University of 
regon who received course points for their 
rticipation. 

Material.—The original learning lists were 
ntical to those used in Exp. I. The trans- 
er material consisted of a list of 24 different 
"slides. The 24 slides were six random samples 
‘of the 7.7-bit distortion rule for each of the 
four original patterns. 

T Procedure.—The learning procedure was 
the same as in Exp. I. The Ss were divided 
o two groups. Sixteen Ss learned a list 
Level 1 and 16 Ss learned a list at Level 5. 
ning was continued until two successful 
petitions of the list were completed. The 
were then given pattern recognition in- 
\\ structions. These instructions indicated that 
à should classify each successive slide as 
“¥apidly as possible into one of the four cate- 
ories that they had learned during the origi- 
l learning task. For half the Ss in each 
ondition feedback was given after each 
lassification. The other half received no 
feedback. The Ss were shown the transfer 
list twice in different random orders. The 
interslide interval was 9 sec. 


H 
Results 

- Table 2 shows the mean errors to 
riterion in original learning and the 
average error in the pattern recogni- 
tion tasks for all conditions for each 
‘block of four trials. As before, Ss in 


Level 5 took longer in original learning 
than those in Level 1. There was no 


ie 


} 
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significant difference in speed of 
learning between feedback conditions, 
Analysis of variance of errors in the 
pattern recognition task showed that 
the effect of level was significant, F 
(1, 28) = 9, p < .01, and the effect 
of feedback conditions was also sig- 
nificant, F (1, 28) = 4.8, p < .05. 
There were no significant interactions 
between level and feedback or be- 
tween either of the two main variables 
with successive blocks of 24 slides. 
Table 2 also shows the mean errors for 
successive blocks of four trials in the 
pattern recognition task. There is a 
nonsignificant trend for the differ- 
ences between Levels 1 and 5 to be 
reduced with practice particularly 
when feedback is present. 

A correlation coefficient was com- 
puted between groups at Levels 1 and 
5 over the particular slides to which 
errors were made during pattern rec- 
ognition. This correlation was .83 in- 
dicating that both groups tended to 
miss the same patterns. A rank 
order correlation of .97 between dis- 
tance from the prototype and errors 
indicated that patterns most distant 
from the prototype were more difficult 
to recognize. 


Conclusions 

The results of this study confirmed 
those obtained in Exp. 1, Once again, 
Ss who had been trained with the high 
variability patterns did better on transfer 


TABLE 2 
MEAN Errors IN ORIGINAL LEARNING AND PATTER! 


N Recoenition, Exe. I 


Block of Four Trials 


Original 
Learning 


Condition 


"Group 1 


Feed 2|16 17|15 | 13 | 16 | 13 | 12 11] 11] 11 | 15 | 10.8 
No Feedback it 16 19 | 15 | 14 | 17 | 21 | 17 | 16 11 |10 | 19 | 13 | 11.9 
2|17|10|14| 9| 8|15 16|12| 9 9| 9.1 

E 1ó tt 13 | 13 | 17 | 15 | 14 | 17 | 15 | 9 14 | 11 | 10.4 
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than those trained with the low varia- 
bility patterns. Moreover, they main- 
tained the advantage over the first 24 
slides, even though each slide was dif- 
ferent. Therefore, it is difficult to 
argue that the deficiency in transfer for 
Level 1 Ss was due to an initial startle at 
seeing patterns which were more dis- 
torted than those used in original 
learning. Moreover the transfer task 
reduced or eliminated the advantage of 
general learning strategies attained in 
the original task (learning-to-learn). 
However, it could still be argued that 
the advantage of Level 5 is primarily in 
the kinds of criterion which Ss set for ad- 
mission of a particular pattern into one 
of the meaningful categories. The use of 
three highly familiar categories and one 
nonsense pattern within the same list 
may have contributed to this. There is 
a strong tendency for Ss at Level 1 to 
classify patterns about which they were 
unsure into the random category. The 
percentage of random responses made 
during the 48 trials of pattern recognition 
were 25.5, 37.5, 33.5, and 42.9 for condi- 
tions: 5 feedback, 5 nonfeedback, 1 feed- 
back, and 1 nonfeedback, respectively. 
Table 3 shows a breakdown of the pro- 
portion of correct and false alarm re- 
sponses during pattern recognition for 
each of the prototypes. The false alarm 
rate is obtained by dividing the errors in 
a category by the number of possible 
errors, In the case of the three meaning- 
ful patterns, Level 5 Ss show a higher 
proportion of correct responses and 
about equal false alarms. For these 


TABLE 3 


PROPORTION OF CORRECT AND FALSE 
ALARM RESPONSES FoR EACH 


CATEGORY 
Category 
Triangl| M | F |Random | X 
Cond. 1 m mm 
Correct 59 |.49|.50| .53 |.53 


False Alarm | .10 |.13|.06| . 

Cond. 5 Fi d 
Correct -73 |.71|.60| .49 |.64 
False Alarm | .11 |.12|.05| .26 |.14 
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distortions, therefore, it is clear that . 
Level 5 Ss are showing better discrimi- 
nation than those trained on Level 1. 
For the random patterns, Level 1 Ss have 
both more correct responses and more 
false alarms. When these two measures 
are combined using a graphical method 
(Norman, 1964) the Level 5 Ss are. 
slightly superior in overall performance, - 
Thus even though Level 1 Ss have a 
higher proportion of correct responses 
with the random pattern, when false 
alarms are taken into consideration, they 

do not show better discriminability; 

The strong tendency of Ss in Level 1 — 
to use the random category suggests that 
they were somewhat reluctant to classify 
distorted instances into one of the mean: 
ingful categories. While in this study 
the main differences between groups. 
seem to be in the ability to discriminate: 
the categories, it would seem reasonable 
to explore changes in criterion particu- 
larly in studies where a forced choicd 
between categories is not required. 


EXPERIMENT III 


The previous two experiments have * 
indicated that Ss do learn something —— 
about the variability of instances that. — 
they have seen. In both of these ex- — } 
periments, Ss in each group had the gas 
same prototypes. If Ss had been 
storing only the schema, then Level T 
1 Ss should have shown better perfor- — f 
mance than Level 5 since it is easier tO 
define the prototype based on Level 1 — 
patterns than based on Level 5 pat 
terns. The results are in the opposite 
direction, indicating that Ss are 
learning some information about the 4 
individual patterns which they use 1 
their later judgment. In this experi- - 
ment an effort is made to determine 
directly whether Ss are also learning 
information about the prototype. 


Method 


Subjects—The Ss were 30 students Te 
cruited from the University of Oregon Em- 
ployment Service and paid $1.50 per hour for. 
their services. 1 
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Material.—There were two lists of original 
learning materials. Each list contained 12 
slides. The 12 slides were four distortions of 
three different prototypes. A set of distortions 
of the same prototype is called a “concept.” 
The prototypes were different for the two lists 
and were constructed by placing dots in nine 
randomly selected positions in a 30 X 30 
matrix. The four distortions of each proto- 
type were constructed using the same four 
random samples of a 7.7-bit distortion rule. 
Thus the distances from each of the proto- 
types to its four distortions were identical. 

The transfer material consisted of two lists 
of 24 slides. Three of the slides were the 
prototypes of the patterns in the learning 
lists. These represent the schema of each 
concept. Six slides, two from each of the 
concepts, were patterns memorized during the 
original learning (old distortions). Six slides, 
two from each of the concepts, were new 
7.7-bit distortions of the prototypes which 
had not been seen during learning. Six slides, 
two from each of the concepts, were new 5-bit 
distortions of the prototypes. Finally, three 
slides were new random patterns unrelated to 
any of the concepts which S had learned. 

In Table 4 the distances from the four 
memorized patterns of each concept to the 
respective transfer patterns are shown. The 
individual patterns are identified by a number 
orletter. In the case of the old distortions 
the two transfer patterns are identical to two 
of the stored patterns. These distance rela- 
tionships hold for all concepts in both lists, 
although the prototypes differ from one con- 
cept to another and between the two lists. 
The distances represent the sum of the vertical 
and horizontal distances from each dot in the 
Stored pattern to the corresponding dot in the 
transfer patterns. The numbers are in units 
of 1/20 of an inch. Previous results have 
shown that, for a given grain size, the loga- 
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rithm of this measure is linearly related to 
perceived distance (Posner et al., 1967). 
Procedure.—The 30 Ss were divided ran- 
domly into two equal groups and assigned to 
the original learning lists. Original learning 
proceeded as described in the previous experi- 
ments until completion of two correct classi- 
fications of thelists. After the original learn- 
ing was complete, Ss were given their respec- 
tive transfer lists in the pattern recognition 
procedure described in the last experiment. 
On the same day as the original learning they 
went through the transfer patterns twice, for 
a total of 48 patterns. Twenty-four hours 
later Ss returned to the laboratory and ran 
through the pattern recognition tasks four 
additional times. During the pattern recog- 
nition task, no feedback was provided. Both 
the classification chosen and the speed of 
classification were recorded. The Ss were 
instructed to respond accurately, but to try 
to respond as rapidly as they could when each 
new pattern was presented. Concepts were 
randomly assigned to switches for each S, 


Results 


Original learning required an aver- 
age of 41 and 34 errors to criterion, for 
List A and List B, respectively. This 
difference was not significant. The 
error and speed data for the pattern 
recognition task are shown in Table 5. 
Since the lists are replications of each 
other except for the use of different 
randomly selected original patterns 
and the results are similar for the two 
groups, all 30 Ss were combined in sub- 
sequent analyses. 

The analyses were performed by 


TABLE 4 
DisrANCES FROM SroRED EXEMPLAR PATTERN TO EACH TRANSFER PATTERN 
Schema Old Distortion New Level 7s New Level 5s 
Stored Pattern 

2 4 A B Ce D Eb 
1 66 37 | 89 49 | 48 51 
2 P 4 77 104 98 65 "n 59 
3 65 88 65 82 60 54 51 56 
4 65 71 0 83 87 51 62 55 
52 59 52 89 83 54 58 55 
14.7 42 39 26 28.5 19 


% Errorsin Transfer | 15 


* List B only, 
b List A only. 


360 MICHAEL I. POSNER AND STEVEN W. KEELE 
TABLE 5 
PERCENTAGE OF ERRORS AND SPEEDS (IN SEC.) FOR CLASSIFYING 
‘TRANSFER PATTERNS For Day 1 AND Day 2 
Day 1 Day 2 
Old |Schema| 5 7 New | Old |Schema| 5 7 New 
List A 
% Error 10 13:3 -| 28:3" [r35 — 9.7 | 144 | 24.1 — 
RT 2.04 | 2.19 | 2.36 | 2.52 | 2.88 1.86 | 1.88 | 2.03 2.51 
List B 
% Error 16.1 | 16.6 | 30.5 |41.7 — |158 | 16.1 |25.3 — 
RT 197| 2.37 | 2.71] 3.22 | 2.95 1.88| 2.06 | 2.12 2.35 
Average % Error | 13.0 | 14.9 |26.9 | 38.3 — ]|128 | 153 | 24.5 = 
RT 2.01 | 2.28 | 2.53} 2.87 | 2.91 1.87| 1.97 | 2.07 2.43 


sign tests because of the high correla- 
tion between successive experiences 
with the same pattern. Separate 
analyses were run for Day 1 and Day 
2. The results of the sign tests are 
shown in Table 6. 

On Day 1, it is clear that Ss show no 
significant differences in proportion of 
errors between the patterns which 
they had just finished learning and the 
prototypes which they had never seen. 
It is also clear that both the old distor- 
tion and the schema have a signifi- 
cantly lower error rate than any of the 
new distortions seen by S. The Level 
5 distortions showed significantly 
better recognition than the Level 7. 


No error data can be given for the new 
random patterns since there is not any 
correct classification for these patterns 
On Day 2, there is a slightly lower 
mean error for the old distortions than 
for the schemas. However, when the 
data are analysed by individual Ss, 16 
show a higher proportion of error on 


the old distortions and only 11 have a. 


higher proportion on the schema. 
This difference does not reach signifi- 
cance by a sign test. On Day 2, the 
new distortions all show significantly 
more error than either the old distor- 
tions or schema patterns. Overall, it 
is clear that the schema patterns show 
no greater error than the patterns 


TABLE 6 


NUMBER OF Ss with HIGHER AVERAGE Errors or LONGER AVERAGE TIMES 
IN SPECIFIED CONDITIONS OF TRANSFER 


Error 


Day 1 Day 2 
Sign Testa Sign Test 
Old Distort 9 |Schema 9|Tie 12 Old 3 | on 
j ie ns Distort. ^ 16 | Sch 11 | Ti ns 
Levis 23 |Schema 5|Tie 2| 91 |LevdS ^ 24|Schema 4|Tie 2| 0 
vel 7 19 | Level 5 Tie 5| (05 | Level 7 24|LevelS 5|Tie 1 01 
Reaction Time 
Old Distort. 11 |Schema 19|Tie 0 ns Old Di: i 
i istort. 14 | Schema 16 | Tie 0 ns 
Level 5 22 |Schema 8|Tie 0|  .05 | Level 5 23 | Schema 7 | Tie 0 01 
Level 7 21 |LevelS 8|Tie 1 105 | Level 7 22 | Level5 8|Tie 0 205 
ew Randoms 17 |Level7 13|Tie 0 ns New Randoms 18|Level7 12|Tie 0 ns 


a All sign test were two-tailed. 
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which S had actually seen and 
memorized. 

On Day 1 the old distortions show 
faster classification times than the 
schema patterns. This approaches 
but does not reach significance by a 
sign test. In every other respect the 
Day 1 speeds give the same picture as 
the error data. On Day 2 there is no 
significant difference between the old 
distortion and the schema patterns in 
speed. The other differences on Day 
2 are identical to those discussed pre- 
viously for errors. 

A trial by trial analysis of errors and 
speeds was performed for the schema 
vs. old distortions. On the very first 
trial, 21 Ss have longer RTs to the 
schema while 8 Ss have longer RTs to 
the old distortions. This is signifi- 
cant (p « .01) by sign test. By the 
second trial the distribution is 14 RTs 
longer with the old distortion and 16 
with the schema, and on no subse- 
quent trial do more Ss show longer 
times to the schema. The error data 
are similar. On the first trial, 13 
have a higher proportion of errors on 
the schema and 7 on the old distor- 
tion. This is not significant by sign 
test; however, this tendency disap- 
pears after the first trial. 

The transfer lists contain five gen- 
eral types of patterns. These are the 
old distortions, schema patterns, the 
new distortions at Level 7, the new 
distortions at Level 5, and new ran- 
dom patterns. As described earlier, 
Table 4 shows a breakdown of the 
Various patterns used in the transfer 
list and their distances from each of 
the patterns shown in the original 
list. The schema pattern and the 5- 
bit distortions have roughly the same 
mean distance from the four stored 
Patterns. Nonetheless, the schema 
pattern always shows better perfor- 
mance in terms of mean errors than 
the 5-bit distortions. The distances 
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from the stored patterns also differed 
among the three 5-bit distortion pat- 
terns used in the transfer lists. The 
performance on those distortions did 
not seem to be closely related to their 
mean distance from the stored pat- 
terns. Therefore, mean distance does 
not prove to be a particularly good 
predictor in the range of distances 
which include the schema and Level 
5 patterns. However, a comparison 
of Level 5 patterns with the new 
Level 7.7 patterns shows that the 
patterns which have the larger mean 
distance are recognized more poorly. 
In summary, the old distortions, 
schema, and new Level 5 patterns 
have nearly identical mean distances 
from the memorized patterns, but the 
old distortions and schema are better 
recognized than the Level 5s and are 
not different from each other. 


GENERAL CONCLUSIONS 


Abstraction.—In the introduction some 
operational statements of the old notion 
of abstract ideas were suggested. The 
data of the present experiments confirm 
that some form of this proposition is cor- 
rect. The weakest operational form of 
this proposition which is consistent with 
the present authors’ findings is that the 
prototype (schema) of the stored pat- 
terns has a higher probability of recog- 
nition than other new patterns contained 
within the concept. This is confirmed 
both by the finding that the schema is 
better recognized than transfer patterns 
with similar distance relationships (Level 
5) and by the finding that, after its first 
presentation, the schema is as well recog- 
nized as the patterns which have actually 
been memorized by Ss. This form of 
the proposition is consistent with but 
more explicit than the idea of stimulus 
generalization. It singles out the proto- 
type of the patterns as unique. In other 
words, it shows that the maximal gen- 
eralization for multidimensional patterns 
of this sort occurs at the prototype even 
though other patterns are nearly the 
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same average distance from the stored 
exemplars. Although other patterns 
may have nearly the same average dis- 
tance from the distortions, the prototype 
must share the most common properties 
with the set of patterns generated from 
it. This proposition is stronger than a 
generalization notion because the schema 
pattern is, on the whole, as well recog- 
nized as the exemplars from which it is 
abstracted. 

The first and second experiments allow 
the authors to reject the idea that only 
the abstracted prototype is stored. 
Clearly the information about the indi- 
vidual patterns must also be present in 
order for a loose concept (high varia- 
bility) to give better transfer than a tight 
concept (low variability). Moreover, the 
variability is of-sufficient importance to 
overcome whatever advantage the tight 
concept has from a more clearly defined 
central tendency. The beneficial effect 
of variability confirms results in other 
areas of problem solving (Morrisett & 
Hovland, 1959) and pattern recognition 
(Dukes & Bevan, 1967) which argue for 
the importance of variability during 
training. It is also consistent with 
Attneave's (1957) suggestion that part 
of the process of learning to recognize 
patterns involves acquaintance with the 
limits of variability. 

Time of abstraction.—It is possible to 
ask when the information is abstracted 
which allows the efficient recognition of 
the central tendency. One possibility is 
that the abstraction of this information 
takes place during the learning task. 
This is undoubtedly the notion which 
philosophers have implied in discussing 
the genesis of abstract ideas. The present 
authors cannot either confirm or deny 
this form of the proposition from the 
present data. It could be that informa- 
tion concerning the central tendency is 
stored during learning, but it also could 
be that the abstraction takes place when 
the schema pattern is first shown to S. 
That is, S may not recognize the schema 
on its first presentation in the direct 
way in which he identifies the old dis- 
tortions. Rather he may respond cor- 
rectly on the basis of a calculation from 


stored information concerning the ex- 
emplars. The finding that RT to the 
schema is longer than to the old dis- 
tortions on the first presentation of the 
transfer list may indicate that S is cal- 
culating on the basis of his stored infor- 
mation. However, it could also mean 
that he has stored abstracted informa- 
tion but that it is not as clearly or com- 
pletely defined as information concerning 
the individual exemplars. In either 
case, once he has seen the schema he 
recognizes it with the same efficiency as 
the memorized patterns. If the schema 
information is not abstracted , during 
learning, then upon its first presentation 
S must store it as a particularly good 
example of its concept and treat it on 
subsequent trials as equivalent to a 
memorized instance. One way to dem- 
onstrate that abstraction occurs during 
learning would be to find a situation in 
which the schema, when first introduced, 
is recognized as well as or better than the 
patterns memorized during the original 
learning. r 

What is abstracted—In the present 
study the authors have used the word 
idea in a neutral sense. It is not at all 
clear what Ss abstract in learning to 
recognize the transfer patterns. To say 
Ss learn the central tendency and the 
variability of the patterns does not tell in 
what type of a coding system such infor- 
mation is stored. For example, Ss might 
have an image or mental picture of the 
individual instances or of the abstracted 
central tendency. Or perhaps the ma- 
terial is in the form of verbal description, 
such as has been suggested by various 
theories of short-term memory (Glanzer 
& Clark, 1963; Sperling, 1963). 

The data obtained here give only 
a very incomplete answer to these ques- 
tions. Introspective reports were taken 
from 15 Ss run in a pilot study with 
materials identical to one list of Exp. III. 
These reports suggested that some Ss 
used verbal rules which related to the 
patterns. The rules tended to empha- 
size position of dots, center of gravity, 
overall orientation of figure, familiar 
subgroups, and association to familiar 
objects. The rules were highly idio- 


cratic and some Ss verbalized no 
s at all. These verbal reports sug- 
that some of the storage, at least, is 
way of rules which are related to the 
mon features of the patterns within 
concept. Whether these verbal codes 
ipresent all of the information storage 
“are used in conjunction with other 
orage codes cannot be determined from 
e present data. 
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PAIRED-ASSOCIATE ACQUISITION AS A FUNCTION 
OF ASSOCIATION VALUE, DEGREE, AND 
LOCATION OF SIMILARITY * 


DOUGLAS L. NELSON ? 


University of Wisconsin 


In Experiment I S acquired a list of 12 pairs by the anticipation 
method. Variations of trigram stimulus characteristics included 3 levels 
of association value, 3 levels of interstimulus similarity, and 2 condi- 
tions of locus of identity. Control lists having zero-identity letters 
were constructed at each level of association value. When similarity 
was located in the initial letter position and stimuli were medium or 
high in association value, difficulty increased from zero to complete 
identity. Location of similarity in the last letter position did not 
impair acquisition rate as severely. Under conditions of low associa- 
tion value difficulty was independent of similarity regardless of locus. 
In Experiment II S acquired a paired-associate list representing either 
low or high association value in which there was high similarity in 
both 1st and last letter locations. For conditions of low and high 
association value, identity in both the 1st and last letter positions 
increased difficulty significantly beyond that obtained when identity 


was only in the Ist position and only in the last position. 


Underwood (1963) has concluded 
that within the paired-associate task, Ss 
select minimally differentiating stimulus 
cues thereby producing a disparity be- 
tween the nominal stimulus (presented 
by E) and the functional stimulus (to 
which the response is associated). On 
the assumption that S attends to all 
letters as a unit when stimuli are highly 
meaningful and to individual letters 
when stimuli are of low meaningfulness, 
he hypothesizes an interaction between 
stimulus meaningfulness and formal in- 
terstimulus similarity. Increasing in- 
terstimulus similarity should produce a 
greater decrement in performance when 


1This paper is based upon a dissertation 
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Brogden and submitted in partial fulfillment 
of the requirements for the Degree of Doc- 
tor of Philosophy. The research was sup- 
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mittee of the Graduate School from funds 
provided by the Wisconsin Alumni Research 
Foundation. 

2 Now at the University of South Florida. 


meaningfulness is low than when it is 
high. Evidence consistent with this hy- 
pothesis is equivocal. McGhee (1961) 
reports supportive data using words vs. 
consonant syllables (Exp. II) but not 
when meaningfulness is defined by de- 
gree of trigram integration (Exp. I) or 
when consonant syllables from extreme 
ends of the Witmer scale are used 
(Exp III). In contrast Goss, Nodine, 
Gregory, Taub, and Kennedy (Exp. Ia 
& Ib, 1962) find that difficulty signifi- 
cantly increases under high but not 
under low association value as simi- 
larity is increased from low to high. 
Nodine (1963) reports similar results 
using a mixed list design. However, 
when a recall rather than the standard 
anticipation format is used, the associa- 
tion-value-similarity interaction is not 
significant, although the direction of 
the numerical differences is similar 
(Goss & Nodine, Exp. II & III, 1965; 
Levitt & Goss, 1961). 

In each of these studies similarity 


364 


PA ACQUISITION AS DETERMINED BY STIMULUS CHARACTER 365 


is varied unsystematically across posi- 
tion, reading the trigram from left to 
right. Recent evidence indicates that the 
location of a letter within a verbal stim- 
ulus affects the degree of association 
between that letter and the response 
(Jenkins, 1963; Postman & Green- 
bloom, 1967). Postman and Green- 
bloom (1967) have shown that the lo- 
cus effect is significantly greater for 
hard-to-pronounce than for easy-to- 
pronounce stimuli and that in the 
former condition single letter associa- 
tion is largely limited to letters in 
the initial positions. Assuming that 
pronounceability and association value 
are positively related. (Underwood & 
Schultz, 1960), the results of the Post- 
man and Greenbloom (1967) study 
suggest a potential interaction between 
association value and locus. How- 
ever, the first-letter-association result 
occurring with the hard-to-pronounce 
stimuli should be attenuated under con- 
ditions of high similarity within the 
initial letter positions. These data 
suggest potential interactions between 
association value and interstimulus 
similarity, association value and locus, 
and between association value, similar- 
ity, and locus. 

The purpose of Exp. I is to covary 
association. value, interstimulus simi- 
larity, and locus of similarity in an 
effort to evaluate their interactive ef- 
fects on rate of paired-associate acquisi- 
tion, Three levels of association value 
were combined with three levels of sim- 
ilarity and two conditions of locus (first 
vs. last letter). Control lists represent- 
ing zero similarity were also used. 
Following Exp. I two treatment condi- 
tions were added. For conditions of 
low and high association value, lists 
with high similarity within both first 
and last letter positions were COn- 
structed. These conditions are Te- 
ferred to as Exp. II. 


EXPERIMENT I 
Method 


Verbal material—Each S acquired a 
single list of 12 pairs. Twenty-one stimulus 
lists were constructed from CVC norms 
compiled by Archer (1960). Eighteen of 
the lists represented all combinations of 
three levels of association value (low, 
medium, and high), three levels of simi- 
larity (4, 8, and 12 identical letters), and 
two conditions of locus (first vs. last letter 
position). Three control lists (zero simi- 
larity) representing each association value 
treatment were also constructed. For low, 
medium, and high association value the re- 
spective means and standard deviations were 
12.5% (4.7), 56.5% (4.4), and 97.3% (22). 
Each vowel (including Y) occurred twice 
within each stimulus set. Response terms 
were two-digit numbers in which each 
single digit (2-9) was used three times, 
with no inversions of digit sequence (62 
and 26 could not occur). Assignment of 
numbers to stimuli was random with the 
exception that no numbers beginning or 
ending in the same digits were assigned to 
stimuli having the same vowel (e.g. pairs 
SAD-56, SAG-58 or SAD-76, SAG-56 
could not occur). 

Manipulations of degree and locus of 
identity within each association value treat- 
ment produce lists which begin or end with 
a different number of identical letters, For 
example, identity for first-letter locus was 
produced in the following manner: within 
Tdentity-Cond. 4, four items began with the 
same letter while the remaining items began 
with different letters. In Identity-Cond. 8, 
eight items each began with the same letter 
and another four items each with a different 
first letter. In Identity-Cond. 12, all items 
began with the same first letter. All stim- 
uli with different first letters were selected 
from the zero-identity control whenever pos- 
sible. When identity was manipulated with 
respect to first-letter locus, there was no 
repetition among last letters. Conditions of 
last-letter location of identity were con- 
structed in a similar manner. 

Procedure.—The paired-associate list was 
practiced by the anticipation method to a 
criterion of one trial without error. Stimuli 
were pronounced on each test presentation. 
Slides of each stimulus and each stimulus- 
response pair were automatically projected 
by a Kodak Carousel slide projector driven 
by an auxiliary timer at a 2:2 sec. rate. 


366 


The intertrial interval was 10 sec. Se- 
quences of pairs were presented in four un- 
systematically varied orders, with the re- 
strictions that no two successive pairs in 
the list have the same vowel. The sequence 
of each of the four orders was changed for 
each S in each condition. To insure under- 
standing of instructions all Ss learned a 
practice list of two pairs (names-verbs) be- 
fore undertaking the experimental task. 
This practice list was presented in a man- 
ner identical to that of the experimental 
lists. 

Subjects.—There were 21 groups of 12 Ss 
all of whom were naive to verbal learning 
experiments, The Ss were assigned to con- 
ditions in blocks of 21, with one S from each 
condition per block. The assignment within 
blocks was determined by a table of random 
numbers. 


Results 


The experimental design provides for 
a series of analyses of variance of er- 
rors (omission plus overt) to the ac- 
quisition criterion, Fisher LSD tests 
and orthogonal comparisons were used 
to evaluate differences between means 
and interaction components. The five % 
level was the criterion of significance 
for all statistical tests. 

Figure 1 shows errors as a function 
of the degree of similarity and associa- 
tion value for first-letter-locus condi- 
tions, Statistical analysis shows that 
main effects and the interaction of as- 
sociation value and similarity are sig- 
nificant sources of variation. An or- 


[7 4 8 [ 
NUMBER OF IDENTICAL. LETTERS 
FIRST LETTER LOCUS 


Fic. 1. Errors as function of number of 
identical letters and association value for 
first letter locus of identity (N — 12). 
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LAST LETTER LOCUS 


Fic. 2, Errors as a function of number 
of identical letters and association value for 
last letter locus of identity (N = 12). 


thogonal breakdown of this interaction 
indicates that it can be attributed to a 
combination of the high and medium 
conditions vs. the low-association-value 
condition, In general, difficulty of ac- 
quisition increases in a linear fashion 
for high-and-medium-association-value 
conditions as similarity increases. Rate 
of acquisition under conditions of low 
association value is independent of sim- 
ilarity, This description is substanti- © 
ated by tests on simple effects. Mean 
differences equal to or greater than 
30.00 are statistically reliable. 

Figure 2 presents the results of the 
association value and similarity condi- 
tions for last-letter locus. As in Fig. 1 
the zero-identity controls are presented 
in the figure and are included in the 
statistical analysis. The pattern of re- 
sults is different from that of first-let- 
ter-locus conditions. Only association 
value is significant, although the inter- 
action approaches significance (p < 
08). Degree of similarity is not as 
effective a variable for medium- and 
high-association-value conditions when 
the similarity is located in the last let- 
ter. However, the fact that the inter- 
action approaches significance suggests 
reduction in effectiveness rather than 
total ineffectiveness. This conclusion 
is substantiated by an orthogonal break- 
down of the association value by simi- 
larity interaction. As under first letter- 
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locus conditions, the combination of the 
high and medium conditions accounted 
for a significant proportion of the in- 
teraction variance. Mean differences 
which equaled or exceeded 19.35 were 
statistically reliable. 

Statistical evaluation of the effects of 
locus requires that the zero-identity 
conditions be excluded from the analy- 
sis. Association value, similarity, and 
locus as well as all second-order inter- 
actions of these variables are significant. 
The third-order interaction is not. The 
association value by locus interaction 
indicates that significantly more errors 
occur under conditions of medium and 
high, but not under low-association- 
value conditions when identity is lo- 
cated in the first relative to last-letter 
position, The similarity by locus in- 
teraction shows that as similarity is 
increased, difficulty of acquisition sig- 
nificantly increases only when the simi- 
larity is located in the first letter 
position, 

Within-S comparisons of the effects 
of identity were made for Identity- 
Cond. 4 and 8 since the stimuli of each 
of these conditions could be subdivided 
into subsets of stimuli having a letter 
in common (similar subset) and stim- 
uli having no letters in common (dis- 
similar subset). This evaluation of er- 
rors per item was made across condi- 
tions of association value and locus. 
Table 1 presents the means for the sig- 
nificant association value by locus by 
subset interaction. Any mean differ- 
ence greater than or equal to 1.50 is 
statistically reliable. The results of 
this within-S analysis are generally 
consistent with the between-S data in 
showing that the greatest number of 
errors occur on items having letters in 
common, however, one exception occurs 
under conditions of low-association 
value. In contrast to the between-S 
null-effect of identity, the within-S data 


TABLE 1 


Mean Errors As A FUNCTION OF 
ASSOCIATION VALUE, Locus, AND 


SussET TYPE 
Locus 
First Last 
Letter Letter 
Similar 16.37 15.28 
Low 
Dissimilar 11.76 11.64 
Similar 15.85 12.31 
Medium 
Dissimilar 847 8.79 
Similar 12.32 7.39. 
High 
issimilar 6.97 7.78 


show that significantly more errors per 
item occur on similar subset types than 
on dissimilar subset types. This differ- 
ence is statistically reliable within each 
locus condition. 


ExPERIMENT II 


Method 


Verbal material—Two — paired-associate 
lists were constructed representing low and 
high association value with respective means 
and standard deviations of 15.7% (9.2) and 
99,6% (.51). Stimuli were selected from 
the same source as in Exp. I and were sub- 
jected to similar restrictions. Identity in 
both first and last positions was produced 
by orthogonal combinations of two different 
first and two different last letters with three 
vowels. (A, E, & I). For low association 
value the two-first letters were F and Z 
and the two-last letters, J and Q; for high, 
the respective letters were B and P and N 
and T. The two-digit-number responses 
used in Exp. I were also used in Exp. II 
and were assigned to stimuli in a similar 
manner. 

Procedure.—The procedure in Exp. II 
was identical to that of Exp. I. The only 
known difference was that Ss were chosen 
from second rather than first semester 
courses in introductory psychology. 


Results 


Means of errors to criterion for low- 
association value for the identity-condi- 
tion of Exp. II and for the first only 
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and last only similarity conditions of 
Exp. I are 426.58, 161.42, and 155.17; 
for high association value the respective 
means are 224.50, 166.83, and 96.08. 
Statistical analysis showed that associa- 
tion value, locus, and the interaction of 
these two variables were significant 
sources of variation. Mean differences 
which equaled or exceeded 42.31 were 
statistically reliable. Within the low- 
association-value-treatments conditions 
of first only and last only, each differed 
from the first and last condition. 
Within the high association-value con- 
ditions each mean was statistically dif- 
ferent from each other mean. Compar- 
ison of the low- and high-association- 
value zero-similarity conditions of Exp. 
I to the similarity-conditions of Exp. II 
showed that the main effects and the 
association-value by similarity-interac- 
tion all represented significant sources 
of variation. Mean errors for the zero- 
identity conditions were 176.75 and 
64.00, respectively, for low and high 
association value. Differences exceed- 
ing or equaling 50.06 were significant. 


Discussion 


In Exp. I acquisition rate is independent 
of similarity when association value is 
low, regardless of the locus of the similar- 
ity. Under conditions of medium and 
high association value, increases in diffi- 
culty are concomitant with increases in 
similarity from zero to complete identity. 
Although this disruptive effect is ap- 
parently greater for initial than for ter- 
minal letter location, the difference is not 
great enough to produce a third order de- 
pendency. The direction of the associa- 
tion value by similarity interaction is op- 
posite to that hypothesized by Underwood 
(1963) and to that found by McGhee 
(1961), but is consistent with findings of 
Goss et al. (1962). However, when sim- 
ilarity is increased simultaneously in each 
letter locus (Exp. II) difficulty is propor- 
tionately greater under low than under 
high association value. These data are 
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consistent with Underwood's general claim 
of a disparity between the nominal and 
functional stimulus and further define the 
conditions which affect the direction of 
the association value-similarity interac- 
tion, Thus, difficulty is increased under 
medium and high but not under low asso- 
ciation value so long as similarity in- 
creases in a given locus. When similarity 
is increased simultaneously in each locus, 
difficulty increases to a greater extent 
under low than under high association 
value. 

In Exp. I when similarity is located in 
the initial letter position, there is a differ- 
ent terminal letter associated with each re- 
sponse. When similarity is located in the 
terminal letter position, initial letters are 
perfectly correlated with responses. These 
relations hold within each level of asso- 
ciation value. Thus, unique letters or 
combinations of unique letters and vowels 
might serve as cues to which responses 
could be associated. To the indeterminate 
extent that component parts of the stimuli 
serve as cues, the effectiveness of varia- 
tions in similarity should be systematically 
reduced. The results of Exp. I suggest 
that when association value is low unique 
letters in either first or last letter loci 
serve as effective cues. When association 
value is medium or high unique letters in 
the first letter positions are more effective 
cues than are unique letters in the last 
letter positions, 

The low vs. medium-high dichotomy of 
results can be interpreted as reflecting a 
stronger tendency to attend to individual 
letters than to the “stimulus-as-a-unit” 
when association value is low. This 
tendency must be strong enough to over- 
come the “unit” bias set by the instruc- 
tions which required pronunciation of 
each stimulus on each test presentation, 
The within-S data showing similarity to 
be an effective variation when association 
value is low can be interpreted as the 
result of initial attempts to associate the 
response to the stimulus as a unit. If Ss 
were not required to pronounce, attenuat- 
ing the unit bias, then difficulty should 
decrease as similarity in a given position 
increases, This interpretation also sug- 
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gests that elimination of the perfect cor- 
relation between first or last letters and 
responses will significantly increase diffi- 
culty of acquisition. Since only the unit 
stimulus is perfectly correlated with the 
response, considerable time must be spent 
in integrating the stimulus into a unit 
when association value is low. The low 
association value results of Exp. I and II 
are entirely consistent with this inter- 
pretation, 

Implicit in the interpretation of the low 
vs. medium-high dichotomy of results is 
the assumption that medium and high as- 
sociation value stimuli are treated as dis- 
crete units, However, the significant 
locus of similarity effect obtained in these 
conditions suggests that unique letters in 
initial positions are more effective cues 
than are unique letters in terminal letter 
positions, Comparison of the simulta- 
neous similarity condition of Exp. II to 
the first only and last only conditions of 
Exp. I suggests that, when association 
value is high, similarity in the terminal 
position increases difficulty of acquisition 
only if there is similarity in the initial 
position. These data suggest that, under 
the conditions of these experiments, there 
are strong habits of emphasizing the ini- 
tial letter components even when dealing 
with well integrated meaningful words. 
Alternatively, it might be assumed that 
these items are attended to as units and 
the locus difference reflects a serial stor- 
age process in which stimuli are encoded 
into and decoded out of memory in a left- 
right fashion. Given these conditions, lo- 
calizing similarity in the initial letter posi- 
tions would produce greater interference 
than localization in terminal positions. 
Additional research is required to deter- 


mine which one of these two alternatives 
gives the best account of the locus finding. 
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DETERMINANTS OF SHORT-TERM RETENTION * 
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2 groups of 24 Ss recalled 5- and 6-word units following 0-, 4-, and 


8-sec. retention intervals (RIs). 


the digits in 2-digit numbers wh: 
same numbers as high or low an 
nificantly more words were recall 


During the RIs, one group added 
ile the other group classified the 
d odd or even. Slightly but sig- 
ed from 6-word units following all 


RIs. Rate of forgetting was significantly greater for 6-word units 
between 0- and 4-sec. RIs, but no rate difference was found between 
4- and 8-sec. RIs. Neither of these effects of unit size interacted 
with interpolated-task difficulty. The results were considered to sup- 
port a rehearsal-capacity interpretation of short-term retention rather 
than an intraunit interference interpretation, which implies that 
words from 6-word units should become less available than words 
from 5-word units with increases in retention interval. 


Recent investigations of short-term 
memory (STM) employing the single- 
unit recall procedure (Peterson & 
Peterson, 1959) have shown that the 
rate of forgetting of verbal units varies 
directly with the number of elements 
contained in the units (Houston, 1965 ; 
Melton, 1963). Houston (1965) has 
also shown that these differential rates 
of forgetting are not attributable to un- 
controlled differences in degree of 
learning. Since comparable results 
have been found when elements are 
defined as either single words or single 
letters (Murdock, 1961), it appears 
that the number of chunks (Miller, 
1956) contained in a unit determines 
the rate at which it is forgotten. 

Melton (1963) and Houston (1965) 
have suggested that these increased 
rates of forgetting found when unit 
size is increased are due to greater 
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amounts of intraunit interference. Ac- 
cording to this interpretation, increases 
in the number of elements in a unit 
increase the interference among ele- 
ments. This interference decreases the 
relative availability of all elements, and 
its effect becomes greater with in- 
creases in retention interval. 

An alternative interpretation of the 
effect of unit size is suggested by the 
work of Posner and his associates 
(Posner, 1966; Posner & Konick, 
1966; Posner & Rossman, 1965). 
They have demonstrated that recall 
decreases systematically with increases 
in the size of the informational trans- 
forms required in order to perform 
the interpolated task (IT) presented 
during the retention interval. Dif- 
ficulty of the IT also varied directly 
with the size of the transformations. 
Assuming a limited capacity for proc- 
essing information, these data indicate 
that at least part of this capacity is 
needed to sustain STM. Increases in 
the size of the transforms presumably 
decrease the amount of capacity avail- 
able for processing (ie., rehearsing) 
the to-be-remembered material, Since 
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chunks are very likely the units for in- 
formation Processing (Miller, 1956) 
and the number of elements is prob- 
ably closely related to the number of 
chunks, this rehearsal-capacity inter- 
pretation implies that the same number 
of elements are rehearsed following 
presentation of any size unit as long 
as the number of elements approaches 
the maximum capacity. The avail- 
ability of the elements that are not re- 
hearsed, however, should decrease 
rapidly. Because a larger number of 
elements which cannot be rehearsed 
are presented in larger units, the rate 
of forgetting should be greater for 
these units, 

Both the intraunit interference and 
rehearsal-capacity interpretations pre- 
dict increased rates of forgetting for 
units with greater numbers of ele- 
ments. The two interpretations, how- 
ever, predict different relative levels 
of recall. performance for different 
Size units following increased reten- 
tion intervals. The intraunit inter- 
ference interpretation implies that the 
number of elements recalled from units 
containing larger numbers of elements 
Should decrease to lower levels with 
increases in retention interval than the 
recall of elements from smaller units, 
The rehearsal-capacity interpretation, 
however, implies that the recall of 
elements from larger units should 
never become lower than the recall of 
elements from smaller units. In other 
words, at least the same number of 
elements should always be recalled 
from larger units no matter how long 
the retention interval. 

The present Study was designed to 
test these implications of the two inter- 
Pretations of the effect of unit size on 
STM. o Tata manne similar to that 
followed by Houston ( 1965), the recall 
of five-word and six-word units was 
evaluated after several retention inter- 


vals, However, recall was measured 
in terms of the absolute number of 
words or elements recalled rather than 
the percentage-of-units-recalled and 
the percentage-of-words-recalled mea- 
sures used by Houston (1965). Per- 
centage measures must necessarily 
show lower recall for larger units even 
when the number of elements recalled 
from different size units is constant. 
For this reason, they are not approp- 
riate units of analysis for determining 
the relative merits of the two interpre- 
tations of the effect of unit Size on 
STM. The difficulty of the IT was 
also varied. This permitted an evalua- 
tion of the recall of five-word and six- 
word units under conditions which re- 
sult in different levels of performance 
and, presumably, allow different mag- 
nitudes of rehearsal, 


METHOD 


Design and materials.—The design was a 
2X2X83 factorial. Different groups of Ss 
performed one of two ITs. These tasks 
were similar to the addition and two-bit 
classification tasks developed by Posner and 
Rossman (1965) and required information 
reduction of 27 and 46 bits, respectively. 
Within each group, the recall of five- and 
six-word units was measured after reten- 
tion intervals of 0, 4, and 8 sec. Each 
S was given six trials under each within-Ss 
condition, and six different orders for pre- 
senting these conditions were used. These 
orders were determined by a random Latin 
square, Each order was used once for each 
S, and six sequences of orders determined 
by another random Latin Square were used 
for equal numbers of Ss in each group. 

The materials were selected randomly 
without replacement from the 300 most fre- 
quent four-letter words in the Thorndike- 
Lorge (1944) word count. Thirty-six six- 
word units were selected and two different 
presentation orders were determined ran- 
domly. The five-word units were formed 
by eliminating the sixth word in the six- 
word units. 

Procedure—Different Presentation times 
were used for the five- and six-word units 
in an attempt to equate the degree of learn- 
ing for each word. Pilot data indicated 
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that 2.25-sec. and 3.25-sec. presentations for 
the five- and six-word units, respectively, 
allowed Ss to read all the words and re- 
sulted in comparable levels of recall imme- 
diately following presentation. The word 
units were presented on a rear-projection 
screen by a Kodak Carousel projector which 
was controlled by a programmed tape re- 
cording coordinated with the projector by 
a Uher Dia-Pilot II programmer. 

On each trial a .25-sec. tone was pre- 
sented 2 sec. before presentation of the 
word unit, All words in a unit appeared 
in a single column and Ss read each word 
aloud starting at the top of the column. 
For the 4- and 8-sec. retention intervals, a 
slide with a matrix containing three rows of 
four two-digit numbers was presented imme- 
diately after the word unit. The Ss with 
the addition IT added the two digits in 
each number and called out the sum. For 
the classification IT, Ss classified each 
number as being high or low and odd or 
even. Both ITs were self paced, and in 
order to make them maximally effective, 
Ss were told that their score on each trial 
would be determined on the basis of both 
the number of words recalled and the num- 
ber of two-digit numbers added or classified. 
At the end of the retention interval, the 
slide containing the numbers was removed 
and a .25-sec. tone signalled the beginning 
of a 12-sec. recall period. The termination 
of the recall period was signalled by the 
tone which preceded the presentation of the 
next word unit. The same basic procedure 
was followed on all 36 trials except for the 
Q-sec. retention intervals where the tone 
signalling recall occurred when the slide 
containing the word unit was removed. 

Subjects.—The 24 Ss in each group were 
male and female students fulfilling a course 
requirement at the University of Waterloo. 
Assignment of Ss to groups occurred when 
they appeared in the laboratory for the ex- 
periment. It was done on the basis of a 
predetermined unsystematic order which in- 
sured that one S in each block of four was 
assigned to each word-unit presentation 
order within each group. 


RESULTS 


Recall performance.—Recall scores 
for each S were determined by sum- 
ming performance over the six trials 
under each condition. Two different 
scores based on correct recall were 
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computed. These were total recall, 
the total number of words recalled, and 
ordered recall, the number of words 
recalled in the correct order. Both re- 
call scores were analyzed by 2 X 2X 
2 x 3 analyses of variance to evaluate 
IT difficulty, word-unit presentation 
order, unit size, and retention interval. 
Since unit presentation order was not 
a significant source of variance and did 
not interact with any other variable in 
the analyses, the scores were collapsed 
across this variable and reanalyzed. 

The mean total-recall and ordered- 
recall scores for the five- and six-word 
units are presented in Fig. 1. It 
should be noted that the recall scores 
for both the five- and six-word units 
were somewhat below the maximum 
possible (30 and 36 for the five-word 
and six-word units) under the O-sec. 
retention interval. Also, an inspection 
of the total-recall scores revealed that 
the maximum score for five-word units 
was only obtained by eight Ss and only 
four Ss recalled the maximum possible 
for six-word units. Thus, it does not 
appear that the results were con- 
strained by a ceiling effect. 

The total-recall scores show that 
slightly more words from six-word 
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recall scores for five-word and six-word 
units after each retention interval. 
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units were recalled following all reten- 
tion intervals. The analysis of vari- 
ance revealed that this small difference 
in total recall between five-word and 
six-word units was significant, F (1, 
46) = 24.50, p < .001. Furthermore, 
the greater rate of forgetting between 
0 and 4 sec. for the six-word units 
produced a significant Unit-Size x Re- 
tention-Interval interaction, F (2, 92) 
—7.8, p<.001. As indicated in 
Fig. 1, there was a highly significant 
decrease in total recall with increases 
in the length of the retention interval, 
F (2, 92) = 568.50. Newman-Keuls 
comparisons revealed that recall was 
significantly different under the three 
retention intervals (p < .01). 

In contrast to the total-recall scores, 
the ordered-recall scores presented in 
Fig. 1 show that slightly more words 
were recalled from five- than six-word 
units. The analysis of variance re- 
vealed that this slight superiority in re- 
call for five-word units only ap- 
proached significance, F (1, 46) = 
285, p < .10. The analysis also indi- 
cated that the consistent decrease in 
recall with increases in the retention 
interval was highly significant, F (2, 
92) = 472.14. Subsequent Newman- 
Keuls comparisons showed that recall 
under all retention intervals differed 
significantly (p <.01). As indicated 
in Fig. 1, no Unit-Size X Retention- 
Interval interaction was found for the 
ordered-recall scores, F (2, 92) <1. 

The mean total-recall and ordered- 
recall scores for the addition IT and 
classification IT are shown in Fig. 2. 
Under the 4-sec. and the 8-sec. reten- 
tion intervals, both total and ordered 
recall were better following the addi- 
tion IT than the classification IT. The 
analyses revealed that the overall dif- 
ferences due to IT difficulty were sig- 
nificant for total recall, F (1, 46) = 
1095, » <.01, but only approached 
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recall scores under each retention interval 
for the Addition-IT and Classification-IT 
groups. 


significance for ordered recall, F (1, 
46) = 3.35, p < .08. Likewise, the IT- 
Difficulty X Retention-Interval inter- 
actions which are evident in Fig. 2 for 
both total and ordered recall only 
reached significance for the total-recall 
scores, F (2, 92) 2 821, p « .001. 
Both the total-recall and ordered-recall 
analyses revealed no differential effect 
of unit size under the different ITs, 
both Fs (1, 46) < 1. The magnitude 
of the IT-Difficulty x Retention-Inter- 
val interaction also did not change sig- 
nificantly as a function of unit size, 
both Fs (2, 92) <1. This absence 
of any difference in the effect of unit 
size under the two ITs indicates that 
the differential rates of forgetting for 
five- and six-word units remained rela- 
tively constant regardless of the dif- 
ficulty of the IT. 

Interpolated-task performance.—The 
total number of two-digit numbers 
added or classified during the 4-sec. 
and the 8-sec. retention intervals fol- 
lowing presentation of the five- and 
six-word units was determined for 
each S. No distinction was made be- 
tween correctly and incorrectly pro- 
cessed numbers. Overall, more numbers 
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were processed by Ss required to add 
(34.7) than those required to classify 
(24.4), F (1, 46), = 53.51, p < .001. 
More numbers were also processed 
under the 8-sec. (39.8) than under the 
4-sec. (19.3) retention interval, F (1, 
46) = 956.15. No significant differ- 
ence was found between five-word 
(29.2) and six-word (29.9) units, 
F (1, 46) — 249, and unit size did not 
interact with any other variable. 


Discussion 


The most important aspect of the re- 
sults is the almost identical recall of ele- 
ments from five-word and six-word units 
following the 4- and the 8-sec. retention 
intervals. Even though there was a sig- 
nificant decrease in recall from the 4- to 
the 8-sec. retention interval, there is no 
evidence, as demanded by the intraunit- 
interference interpretation, that the ele- 
ments from the six-word units were be- 
coming less available than those from 
five-word units, This similarity in recall, 
on the other hand, does offer strong sup- 
port for the rehearsal-capacity interpreta- 
tion of the effect of unit size. The data 
are completely compatible with the as- 
sumption that the same number of chunks 
are rehearsed during the retention interval 
following presentation of five- or six-word 
units. 

The greater rate of forgetting for six- 
word units predicted by both interpreta- 
tions is evident between the 0- and the 4- 
sec. retention intervals with total-recall 
but not ordered-recall scores. Since the 
total recall of elements from six-word 
units was facilitated at the 0-sec. interval, 
these data suggest that the memory span 
is limited by the requirement that the 
order of elements rather than the elements 
themselves must be remembered, Fur- 
thermore, the absence of any difference in 
the magnitude of this rate difference under 
the two levels of IT difficulty indicates 
that the decrease in the availability of un- 
rehearsed elements was extremely rapid. 
Presumably, the difference between the 
number of elements rehearsed and the 
number not rehearsed is greater with 
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more difficult ITs. "Thus it is reasonable to 
expect that differential rates of forgetting 
for five- and six-word units would be evi- 
dent over longer retention intervals with 
more difficult ITs. Since this did not oc- 
cur, it appears that under each level of IT 
difficulty either the same number of no 
unrehearsed elements from the five- and 
six-word units were available after the 
4-sec. interval. In order to detect possible 
rate differences attributable to IT diffi- 
culty, it would be necessary to use reten- 
tion intervals considerably shorter than 
4 sec. 

The results of the present study are 
consistent with those reported by Houston 
(1965). While Houston only used per- 
centage measures, the absolute number of 
words recalled under the various condi- 
tions can be estimated from his percent- 
age-of-words-recalled measure. When this 
conversion is made, Houston’s data show 
that more words were recalled from six- 
word than five-word units following all 
retention intervals. Furthermore, only at 
the 0-sec. retention interval was there a 
large superiority in favor of the six-word 
units, The absolute numbers of words 
recalled following each of the other reten- 
tion intervals were quite similar for the 
five-word and six-word units. Thus, the 
results of both the present study and 
Houston’s study appear to provide support 
for the rehearsal capacity rather than the 
intraunit interference interpretation of 
the effect of unit size in that there is no 
evidence in either study that elements 
from six-word units become less available 
with increases in retention interval than 
elements from five-word units. In fact, the 
results of both studies show that approxi- 
mately the same number of elements are 
recalled from five-word and six-word 
units following each retention interval 
except the 0-sec. interval. This indicates 
that the effect of unit size on the recall 
of elements is limited primarily to imme- 
diate recall or to recall following reten- 
tion intervals which are less than the 
shortest intervals used in either the Hous- 
ton study (1.5 sec.) or the present study 
(4 sec.). 

One aspect of the present results which 
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deserves brief mention is the consistently. 
superior total recall and poorer ordered 
recall for the six-word as compared with. 
five-word units. While there are several 
possible interpretations, the one which 
can most easily handle both differences is 
that there is a greater likelihood of two or 
more elements being grouped together to 
form single chunks when a greater num- 
ber of elements is presented. This would 
reduce memory load and enable Ss to re- 
call more elements. At the same time, it 
would lower ordered recall since elements 
which are chunked would not necessarily 
be adjacent and the correct position of 
these elements would be forgotten. 
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SENSORY TRACES VERSUS THE PSYCHOLOGICAL MOMENT 
IN THE TEMPORAL ORGANIZATION OF FORM 
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Visual stimuli were constructed so that any given stimulus by itself 
appeared to be a random collection of dots. However, when 2 cor- 
responding stimuli were superimposed by means of a 2-field tachisto- 
scope, a 3-letter nonsense syllable was perceived. When a temporal 
interval was introduced between the presentation of the correspond- 
ing stimulus halves, organization in perception was found to be a 
decreasing function of interstimulus interval over a range in excess 
of 100 msec. The applicability of a concept of a decaying sensory 
trace in accounting for the integration of form perception over time 
was investigated. Clear evidence of a decaying sensory trace was 
found; however, this concept was unable to account for all aspects 
of the data. The applicability of a psychological moment acting in 
conjunction with a decaying sensory trace was considered as was also 
the possibility of discontinuity detectors which inhibit or break up 


temporal organization. 


Evidence for a persisting sensory 
trace has come from experiments such 
as those of Sperling (1960) and Aver- 
bach and Coriell (1961) in which an 
overload of information is presented in 
a brief visual display. Some milli- 
seconds following termination of the 
display S is presented with a signal or 
indicator designating an element or a 
subset of elements of the display which 
he is to report. Accuracy of perform- 
ance is a decreasing function of delay 
of the indicator out to delays of the 
order of 200 to 300 msec. 

This result suggests the conclusion 
that the total information of the display 
is available for a brief period following 
termination of the display and if the 
indicator occurs before this informa- 
tion decays, S is able to encode the 
designated element or subset of the ele- 
ments from the display. 


1This investigation was supported by 
Public Health Service Research Grant 
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However, an alternative interpreta- 
tion of this result is possible for the 
visual system is considerably less than 
perfect in temporal resolution. Bloch's 
law is perhaps the best known example 
and recently many masking phenomena 
have seemed to be explainable in terms 
of the lack of temporal resolution in the 
visual system (Eriksen, 1966b; Erik- 
sen & Collins, 1965 ; Thompson, 1966). 

The applicability of a concept of a 
psychological moment (Stroud, 1956) 
to these temporal characteristics of 
vision has been noted by Boynton 
(1961) and by Eriksen and Collins 
(1965). The psychological moment 
has been variously conceived as having 
a duration of from 50 to approximately 
200  msec.  (Kristofferson, 1965; 
Stroud, 1956; White, 1963). Events 
separated by less than the duration of 
the moment have a probability of enter- 
ing the same moment and thus being 
perceived as simultaneous. The closer 
the two events are in time the higher 
the probability that they will both 
enter the same ongoing moment, but as 
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the time separation increases to the 
duration of a moment the probability 
becomes zero that the two events will 
be represented in the same moment, 

The moment interpretation of the 
Sperling (1960) and the Averbach and 
Coriell (1961) data would explain the 
decreasing performance with delay of 
the indicator as the result of the de- 
creasing probability that the indicator 
and the display are represented in the 
same psychological moment. The 
asymptote of the function is reached 
when the delay between termination of 
the display and the indicator is longer 
than the duration of the moment and 
therefore the probability that on any 
trial they would be represented as si- 
multaneous is zero. Since the data are 
based upon average performance over 
a large number of trials it is impossible 
to choose between a decayed trace in- 
terpretation and a psychological mo- 
ment. 

In a previous paper (Eriksen & Col- 
lins, 1967) the authors reported a tech- 
nique for studying form organization 
over time. In Fig. 1 the upper and 
middle patterns of dots appear to be 
essentially random, but when the two 
patterns are carefully aligned and 
superimposed in two fields of a tach- 
istoscope and presented simultane- 
ously, the embedded nonsense syllable 
shown in the bottom of Fig. 1 can be 
read. An S's ability to read the non- 
sense syllable can then be determined 
by introducing a variable delay be- 
tween the presentation of the two 
stimulus halves or dot patterns. The 
nature of the separate stimulus halves 
is such that it seems necessary that S 
has an image-like or sensory trace of the 
first half present in order to integrate 
it with the second half thus enabling 
S to read the nonsense syllable. The 
dot patterns do not seem to lend them- 
selves to any type of cognitive encod- 
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Fic, 1. The upper two dot patterns when 
superimposed result in the bottom stimulus 
pattern in which the nonsense syllable HOV 
can be read. 


ing or memory that would permit the 
summing of the information of the two 
halves in any but an image-like fashion. 

In the previous study (Eriksen & 
Collins, 1967) performance was found 
to decrease as a negatively accelerated 
function of the interstimulus interval 
(ISI) between the two stimulus 
halves. An asymptote was approached 
at ISI values between 100 and 200 
msec. where performance reached a 
level to be expected from guessing for 
single stimulus half presentations. 
This result could be attributed to a 
decay of the first stimulus half in time. 
If the second half of the pattern is 
presented before the decay of the first 
half has progressed too far, it is pos- 
sible to integrate the two patterns into 
the perception of the embedded non- 
sense syllable. This explanation 
assumes that integration becomes pro- 
gressively more difficult if the two 
stimulus halves are of different bright- 
ness. Thus the longer the delay the 
less bright is the first stimulus half 
which has to be integrated with the 
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now presented bright second half pat- 
tern, 

In the present experiment lumi- 
nances of 5, 2, and 1 mL. were used 
for the stimulus halves. The stimulus 
halves were presented either simultane- 
ously or at ISIs between termination 
of the first half and onset of the second 
that varied from 0 to 100 msec. Con- 
trol observations were made when both 
stimulus halves were presented simul- 
taneously at 5, 2, and 1 mL. Also as 
a control both stimulus halves were 
presented at 5 mL. at each of the ISI 
values. Of greatest experimental inte- 
rest were the conditions where the two 
stimulus halves were of unequal bright- 
ness, Here a 5 ml. half was presented 
with a 2 mL. half and with a 1 mL. 
half at each of the ISI values. At 
each of these ISI values the order of 
occurrence of the bright and dim half 
was systematically varied so that there 
were an equal number of experimental 
trials where the bright half had occur- 
red first followed by the dim half as 
for the reverse order. 

With this experimental design a 
concept of a decaying sensory trace 
would require certain expectations as 
to the results to be obtained. First the 
concept would require that recognition 
accurracy for the embedded nonsense 
syllable be decreased as the discrep- 
ancy in brightness of the two stimulus 
halves becomes greater even when both 
halves are simultaneously presented. 
Second, it would predict an interaction 
between order of occurrence of the 
bright and dim halves with ISI. If the 
bright half is presented first, then at 
any given ISI its trace, due to decay, 
will more nearly match the lower 
brightness of the second occurring 
stimulus half. On the other order, the 
dim half presented first will have a 
greater disparity in brightness with the 
brighter second half when it occurs 
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at any given ISI due to some decay in 
the brightness of the first half trace. 
Further, the longer the ISI up to some 
maximum the greater the superiority 
of performance for the order in which 
the bright half occurs first. A third 
expectation that follows from the sen- 
sory trace concept is that at some ISI 
value a 2 mL. stimulus half following 
a 5 mL. half will exceed the perform- 
ance that is obtained when both stim- 
ulus halves are 5 mL. Similarly a 1 
mL. stimulus half following a 5 mL. 
half will exceed at a greater ISI value 
than that for the 2 mL. half, the per- 
formance obtained when both halves 
are 5 mL. 


METHOD 


Subjects—Six graduate students (one fe- 
male) with normal or corrected to normal 
vision served as paid Ss. 

Procedure.—A. Scientific Prototype Model 
GA three-field tachistoscope in which Syl- 
vania FT 45 CWX lamps had been sub- 
stituted was used for stimulus presentation. 
The “complete” stimuli consisted of 20 three- 
letter nonsense syllables. These stimuli have 
been described in detail previously (Eriksen 
& Collins, 1967). In brief, they were simi- 
lar to the one shown in Fig. 1. They were 
made up of two stimulus halves which when 
superimposed and properly aligned permitted 
the reading of the embedded nonsense syl- 
lable. Originally constructed with India ink 
dots on white cardboard, they were photo- 
graphed with high contrast negatives and 
presented to S backlighted in the tachisto- 
scope (contrast approximately 98%). The 
two stimulus halves for a nonsense syllable 
were presented in separate fields of the 
tachistoscope. As presented they subtended 
2° of visual angle in length and 1° in 
height. The third tachistoscopic field was 
dark but contained a faintly luminous cross 
.3° of angle and positioned .5° below the 
center letter of the nonsense syllables. 
When an ISI occurred between the pre- 
sentation of the two stimulus halves this 
interval was also dark. 

The Ss were instructed to fixate the cross 
and to trigger a stimulus presentation when 
this fixation point appeared sharp and clear. 
An experimental trial consisted of the pre- 
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sentation of both stimulus halves either si- 
multaneously or at an experimentally varied 
ISI. The Ss had available in their dimly 
illuminated booth a list of the 20 possible 
nonsense syllables from which they selected 
their response following each trial. On the 
average a 5-sec. interval occurred between 
trials. 

Prior to beginning the experimental ses- 
sions each S' served two practice sessions 
during which the corresponding stimulus 
halves were presented simultaneously for a 
duration of 6 msec. and at a luminance of 
5 mL. All Ss were required to obtain an 
accuracy level of at least 8596 in order to 
participate in the experiment and several 
potential Ss were eliminated for failure to 
do so. In each of the following 20 experi- 
mental sessions S was run for five blocks of 
20 trials each under a particular luminance 
condition (dim stimulus half, 1 or 2 mL.) 
and order (bright half first or second). 
ISIs were varied within sessions over the 
values of simultaneous, 25, 50, 75, and 100 
msec, Each stimulus half was presented for 
a duration of 6 msec. A total of 80 trials 
were run under each particular combination 
of variables. Two control conditions were 
also run but were not included in the main 
analysis. One control was the condition 
where both stimulus halves were 5 mL. and 
were separated by the five ISIs used. The 
second control involved the presentation of 
both stimulus halves simultaneously at a 
luminance of 1 and of 2 mL. The various 
control and experimental conditions were 
counterbalanced over Ss throughout the 20 
experimental sessions. 


RESULTS AND DISCUSSION 


The percentage of correct nonsense 
syllable identifications were analyzed 
for the two experimental conditions 
(stimulus halves, 5 and 2 mL. or 5 and 
1 mL.; order, dim half first or bright 
half first; ISIs; and Ss). This four- 
way classification analysis of variance 
yielded significant (p< .01) main 
effects for all four variables. In addi- 
tion, the interaction of order with ISI 
and order with Ss was significant be- 
yond the .05 level. The Conditions X 
ISI interaction was also significant (P 
X 01), but none of the third order 
interactions approached significance. 


379 


In Fig. 2 the percentage of correct 
nonsense syllable identifications is 
shown as a function of condition, 
order, and ISI. Also plotted are the 
data from the control condition where 
the two stimulus halves were of equal 
luminance at 5 mL. 

The results in Fig. 2 are in agree- 
ment with our expectations from a sen- 
sory trace concept. The basic assump- 
tion that organization of the embedded 
nonsense syllable is more difficult if 
the test stimulus halves are of unequal 
energy is borne out by comparison of 
the data where the two stimulus halves 
are simultaneously presented at 5 mL., 
at 5 and 2 mL., and at 5 and 1 mL. 
Even though under these conditions 
both stimulus halves are simultaneously 
presented for the same duration, per- 
formance drops markedly as the dis- 
parity of luminance of the two halves 
becomes greater. That this perform- 
ance drop is due to the relative bright- 
ness difference in the two halves rather 
than just the lower brightness of one 
half can be seen by comparison with 
the performance obtained when both 
halves are reduced in' luminance. 
When both halves are presented si- 


PERCENT CORRECT 


ISI IN MILLISECONDS 


Fic. 2. Percentage of correct nonsense 
syllable identifications as a function of the 
luminance of the corresponding stimulus 
halves and the interstimulus interval. 
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of the Syllables is 88% and 80% when 
both halves are 1 mL. 

The significant order effect and 
Order x ISI interaction are both seen 
to be in the direction expected from 
the sensory trace concept. Perform- 
ance is superior if the bright stimulus 
half precedes the dim half and the dis- 
crepancy in performance for these two 
orders becomes greater as ISI in- 
creases. 

The third expectation following 
Írom the sensory trace concept is only 
partly confirmed. In the case where 
the 5-mL, half is followed by the 2-mL. 
half, performance exceeds that ob- 
tained for the control condition where 
both halves are 5 mL. somewhere be- 
tween an ISI of 20 and 50 msec. 
However when the first half is 5 mL. 
followed by a 1 mL. half, performance 
does not exceed that obtained when 
the two halves are 5 mL. within the 
range of 100-msec. ISI investigated. 
'The data do suggest, however, that had 
longer ISIs been examined, these two 
curves would have met before the 
asymptote of performance was reached. 

These results, taken as a whole, are 
certainly indicative that a decaying 
sensory trace does exist in the nervous 
system. Part of the shape of the func- 
tion relating performance to ISI ob- 
tained for the control condition where 
both stimulus halves were 5 mL. and 
in the previous experiment (Eriksen & 
Collins, 1967) can be accounted for in 
terms of a decaying trace and the in- 
creased difficulty in organizing a form 
from stimulus halves that are disparate 
in brightness. 

There are however indications in the 
present data that more is involved than 
the decaying trace. This can be seen 
if one considers the control observations 
where both halves were simultaneously 
presented either at 2 mL. or at 1 mL, 


oe piati at 2 mL., identification 
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As long as both halves are of equal 
luminance, reducing the luminance 
level of both halves has only a small 
effect upon identification performance. 
Thus the syllables are identified with 
92% accuracy when both halves are at 
5 mL., with 88% accuracy at 2 mL., 
and 80% accuracy at 1 mL. If per- 
formance decrement as a function of 
ISI was solely attributable to the de- 
cayed trace and therefore the greater 
disparity in brightness of the two 
halves, presenting a 5 mL. half fol- 
lowed by a 2 mL. half should at some 
ISI value equal the performance ob- 
tained when both halves are simultane- 
ously presented at 2 mL. Similarly, 5 
mL. presented first followed by 1 mL. 
should at some longer ISI value equal 
the performance obtained when both 
halves are simultaneously presented at 
lmL. Inspection of Fig. 2 shows that 
this performance level is not achieved 
by either of these two experimental 
conditions. For both, performance is 
consistently below that obtained for 
the respective control observations. 
The most that seems to be achieved by 
presenting a bright half first followed 
by a dimmer half is an attenuation in 
the rate of decay of the performance 
measure with increases in ISI.? 


?In the previous study (Eriksen & Col- 
lins, 1967) a similar attempt was made to 
test a sensory trace concept with equivocal 
results. In that study the importance of a 
mismatch in brightness of the two stimulus 
halves was found, but there was no inter- 
action of order with ISI nor a significant 
order effect. Hindsight suggests that this 
prior failure was attributable to the manip- 
ulation of intensity or luminance disparity 
via increased exposure time of one stimulus 
half in terms of the I X t — c relation. Per- 
haps more important was the use of black 
on white stimuli. Subsequent research in 
our laboratory has shown that the temporal 
interval over which black on white stimuli 
can be integrated or organized is appreci- 
ably shorter than that for the backlighted 
stimuli used in the present experiment. 
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It is possible that something like a 
psychological moment may also be in- 
volved in the form organization over 
time that occurs in the present task. 
The concept of a psychological moment 
(Stroud, 1956; White, 1963) is too 
unspecific with respect to such char- 
acteristics as whether incoming stimu- 
lation can enter an ongoing moment 
or whether it has to be stored for the 
following moment. And if stored, is 
there decay in the stored trace? A de- 
caying sensory trace concept and the 
concept of a psychological moment are 
not necessarily incompatible. Tf one 
posits that an incoming stimulation 
cannot enter an ongoing psychological 
moment, then two stimulations sepa- 
rated by less than the psychological 
moment but by a finite time period 
would both be stored until they could 
enter the next moment. But because 
of their differential time of arrival 
one would have decayed more than the 
other leading to, in the present experi- 
mental task, a mismatch in the bright- 
ness level of the two stimultations. 
Such a concept would account quite 
well for the present data in describing 
the organization of form over short 
temporal intervals. It would attribute 
the decrease in performance as a func- 
tion of ISI as due to two processes; 
first, to the differential decay of the 
two stimulus halves with the resulting 
mismatch in brightness, and second, 
to the decreasing probability that they 
would both enter the same psychologi- 
cal moment as ISI increased. 

A third possibility can also be en- 
tertained. For convenience this possi- 
bility may be termed discontinuity de- 
tection. A system that has the pro- 


Thus in the Eriksen and Collins (1967) ex- 
Periment it is possible that the time period 
Over which temporal organization could 
Occur was too short to demonstrate the 
order effect. 


nounced temporal integration that vis- 
ion does would also seem to require 
mechanisms capable of detecting or 
responding to discontinuity in stimula- 
tion. Such discontinuity detectors 
might be responsible for the breaking 
up of integration of incoming stimula- 
tion over time so as to permit discrete 
perceptions. Granit (1947) has sug- 
gested such a role for off receptors and 
more recently Barlow (1961) has 
speculated along these lines. 

If discontinuity detectors are respon- 
sible for interruption of temporal inte- 
gration the data of the present experi- 
ment for the 5-5 mL. control condi- 
tion would reflect the increasing prob- 
ability of discontinuity detection with 
lengthening ISI as well as the decay 
of the stimulus trace. Also the 
monotonic negatively accelerated for- 
ward and backward masking functions 
(Eriksen, 1966; Eriksen & Collins, 
1965) might be viewed in part as 
threshold functions for discontinuity 
detection. 
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AVERAGING OF SPACE AND NUMBER STIMULI 


WITH SIMULTANEOUS PRESENTATION? 


NORMAN H. ANDERSON 
University of California, San Diego 


Ss received sets of 3-digit numbers or linear arrays of dots with 
instructions to estimate a central tendency. Each set consisted of 
2 compact groups of stimuli, of varied size and separation. Responses 
were fairly accurate though there was a general tendency to over- 
weight the outlier group of numbers, and a partial tendency to under- : 
weight the outlier group of dots. The main purpose of the experiment 
was to test an elementary averaging model. This model showed 
promise for the Number data, but there were some moderate, 
systematic discrepancies for the Space data. A straightforward 
generalization of the model appeared to provide a good semiquantita- 


tive account of the Space data. 


Recent studies of information inte- 
gration have tended to support an aver- 
aging model for personality impressions 
(e.g., Anderson, 1962, 1965). In these 
experiments, Ss make some judgment 
based on the information in a set of 
stimulus items. Each stimulus is con- 
sidered to have a value, s, and a weight, 
w, relative to the dimension of judg- 
ment. The response is then formally a 
weighted sum or average, R = Zwist. 

The present experiments were de- 
signed to get preliminary information 
in an analogous, but similar task, with 
stimuli not primarily semantic in na- 
ture. Accordingly, Ss were asked to 
estimate either the average of a set of 
numbers, or the balance point of a 
linear array of dots. These stimuli 
are simpler than the trait-adjectives 
used in the personality-impression ex- 
periments. The dots themselves are 
perceptual in nature, and though the 
numbers are semantic, their meaning 


1 Experiment I was run at the University 
of California, Los Angeles. The author 
wishes to thank Wendell Jeffrey for making 
Space available and Anita Lampel and David 
Weiss who served as Es in Exp. I and II, 
respectively. This work was supported by 
soe Science Foundation Grant GB 


is presumably simple and uniform 
over Ss. In the model analysis, more- 
over, both stimuli could be assigned 
scale values on an a priori basis. As a 
consequence, these stimuli afforded po- 
tentially a clearer view of the process 
of information integration. 

Both of these tasks have been used 
previously with simultaneous presenta- 
tion (Beach & Swensson, 1966; 
Spencer, 1961, 1963).  Spencer’s 
papers give some interesting theoret- 
ical observations, but the primary con- 
cern of all these reports has been with 
the accuracy of the response. Here the 
primary interest was on the behavioral 
process rather than on accuracy, as in 
a previous report on number averaging 
with serial presentation (Anderson, 
1964). à 

The first objective of this work was 
to use stimulus sets in which the effect 
of a single element would be isolable. 
For this, an especially simple stimulus 
configuration was employed. Each 
stimulus set consisted of two compact 
groups of items that could be treated 
as units. The influence of each unit 
could then be varied by manipulation 
of simple experimental variables such 
as group size and intergroup distance, 


383 


384 NORMAN H. ANDERSON 


The second purpose, interrelated 
with the first, was to attempt to find a 
theoretically adequate description of the 
behavior using an averaging model. 
The centers of the two groups, denoted 
by X, and X,, were taken as the scale 
values of the groups. If there is no 
constant error, the model equation is 
simply, 


Paix w XOA R 


In this form, w is the importance 
or relative weight that S attaches to 
the outlier group centered at Xa, and 
(1—w) the weight attached to the 
main group centered at Xo. An arith- 
metically correct response would re- 
quire that w equal the size of the outlier 
group divided by the total size of both 
groups. Larger and smaller values of 
1 would represent overweighting and 
underweighting, respectively, of the 
outlier group. 

To the extent that S is averaging, 
w is the appropriate descriptive sta- 
tistic. These experiments studied how 
w depended on such experimental vari- 
ables as group size in order to assess 
the validity of this approach. 


METHOD 


The two experiments reported here were 
quite similar in nature and design. In effect, 
Ss estimated a central tendency of a set of 
numbers, or of a linear array of dots. Each 
set of dots or numbers consisted of two com- 
pact groups of variable size and separation. 

Two instruction conditions were used: the 
Average instructions referred to the simple 
arithmetic or. graphical mean as the standard 
of correctness; the Guess instructions sug- 
gested differential weighting by describing 
each stimulus set as a set of guesses about 
the number of marbles in a jar, S’s task 
being to “best guess” the correct number. 


Experiment I 


Stimulus design—Each S judged 58 stim- 
ulus cards, each constructed of two compact 
groups of dots. The main group had three, 
four, or six dots. The outlier group lay to 


the right of the main group and varied in 
size from one dot up to the size of the main 
group, There were thus 13 combinations of 
group sizes. For each of these, the distance 
between the centers of the main and outlier 
groups took the four values: 1.75, 3.50, 5.25, 
and 7.00 in. In addition, the 8.75-in. distance 
was used for outliers of size one and two 
for each size of main group. 

The dots had à in. diameter, and were 
stencilled in black ink on heavy, white 3 X 24 
in. cards. Within either the main or outlier 
groups, there was 4 in. between the edges of 
successive dots. 

Two complete sets of cards were con- 
structed, with the center of the main group 
either 7 or 9 in. from the left margin of the 
card. This variable was included as a factor 
in the complete design to assess possible 
position bias. Neither stimulus replication 
nor any of its interactions were significant 
so the data were pooled. 

Procedure.—One card at a time was placed 
on a stand in front of S. After 5 sec, E 
said "Time," and S had 5 sec. more to slide 
a spool-and-pointer along a rod just below 
the card. The response was read by E to 
the nearest 4 in. from a second pointer that 
lay against a yardstick at the rear of the 
stand. 

Aíter the instructions were read, 13 prac- 
tice sets were given. These were a repre- 
sentative sample from the 58 cards of the 
experimental series and were given in the 
same arranged order for each S. The 58 
experimental sets were given at a fairly uni- 
form rate that took between 15 and 20 sec. 
per card. They were shuffled and presented 
in a different order for each S. 

Additional number-averaging  condition.— 
Half of the full experiment was devoted to 
a number-averaging task for which the basic 
design was quite similar to the above space 
condition. These number-averaging data 
showed an apparent response bias and are 
not reported in detail. Instead, this condition 
was replicated with minor changes as Exp. TI 
below. 

Instructions and Ss.—Two sets of instruc- 
tions were also included as a factor in the 
full experiment. Under Average instruc- 
tions, S was told to imagine that each dot 
represented a 1 lb. weight on a board that 
extended from the leftmost to the rightmost 
weight. His task was then to move the 
pointer to the position where the board would 
balance on a fulcrum. 

Under Guess instructions, $ was told 
that each dot represented a guess about the 
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number of marbles in a jar, that the dots 
were spaced on the card in proportion to the 
numbers guessed, and that his task was to 
move the pointer to mark off his own best 
guess about the actual number of marbles in 
the jar based on the information contained 
in the other guesses. 

The Ss were 64 male, upper division, psy- 
chology students. Each was paid $1.50 for 
the 4-hr. session. They were randomly as- 
signed to the eight experimental conditions. 
Half of these Ss were in the number-aver- 
aging condition which is reported here only 
as noted. 


Experiment II 


Stimulus design.—Each S judged 90 stim- 
ulus cards, each constructed of two compact 
groups of three-digit numbers. The main 
group contained three, four, or six numbers. 
The outlier group varied in size from one 
number up to the size of the main group. 
There were thus 13 combinations of group 
sizes, just as with the spatial stimuli of 
Exp. I. 

In the primary experimental sets, the main 
group was centered at 113. The center of 
the outlier. group was greater by 16, 32, 48, 
and 64 for each combination of group sizes. 
In addition, the intercenter difference of 80 
was used with outliers of size one and two. 
These 58 sets were thus formally identical 
to the Space stimuli of Exp. I, and in fact 
had been used in the number-averaging con- 
dition there. 

Within each group, the numbers were 
spaced two units apart. Illustrative stimuli 
are: (111, 113, 115; 144, 146), (110, 112, 
114, 116; 177), and (108, 110, 112, 114, 116, 
118; 158, 160, 162, 164). 

To the above 58 sets were added 22 sets 
in which the main group was centered at 187, 
and the center of the outlier group was less 
by 16, 32, 48, 64, and 80. All 14 sets with 
main group of size three were replicated in 
this complementary form. The 8 sets with 
main and outlier groups both of size four and 
six were similarly replicated. These 22 sets 
were added in part to prevent response bias 
by extending the range of stimuli and re- 
sponses, in part to help assess the bias if it 
reappeared. 

Each set of numbers was centered and 
typed on a plain, 4 X 6 in. card in elite type. 
The numbers were arranged in increasing 
Order, with two spaces between successive 
numbers within each group, and four spaces 
between the two groups. 


Instructions and procedure—Two sets of 
instructions were used. Under Average in- 
structions, S was told to estimate the nu- 
merical average of the numbers. He was 
told not to try to work it out mathematically, 
but to "simply give your feeling or im- 
pression what the average is approximately." 

Under Guess instructions, § was told that 
each number on a card was the guess of a 
separate person about the number of marbles 
ina jar. He was instructed to consider these 
guesses and from them make a guess about 
how many marbles really were in the jar. 

In both conditions, S was told to respond 
to the nearest unit. After the instructions 
were read, 15 practice sets were given in 
fixed order, during which E answered ques- 
tions and further instructed S to avoid round 
number responses and direct arithmetic cal- 
culation whenever this seemed to be needed. 
The essentials of the instructions were then 
repeated, and the experimental cards were 
presented in shuffled order for each indi- 
vidual S. 

A trial started when E placed a card in 
front of S. After 8 sec, E said “Time,” 
and after S responded, E withdrew the card, 
placed the next card, restarted the timer, and 
recorded the previous response. 

Subjects.—Thirty-two students who were 
fulfilling a course requirement in introductory 
psychology served as Ss. They were ran- 
domly assigned to the two experimental con- 
ditions. 


REsuLTS 


There are two modes of data descrip- 
tion that are of interest: first, the re- 
sponses may be evaluated in terms of 
their accuracy ; and second, the trends 
in the weight given the outlier group 
may be studied. Both modes were 
used here, but the second has principal 
theoretical importance and therefore is 
presented in more detail. 

Formal statistical analysis was al- 
most completely restricted to the data 
from the main group of size six at the 
first four outlier distances. These data 
are discussed first in each experiment, 
and then compared with the data from 
the main groups of smaller size. For 
completeness, the w-values for the 
largest outlier distance are also given 
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even though these sets were included 
mainly as a precaution to minimize any 
end-response effects (Anderson, 1967) 
and were not considered to be strictly 
comparable to the other sets. 

Space stimuli—One way of viewing 
the data of Exp. I is by comparing the 
observed response with the true stim- 
ulus mean. Figure la gives this in- 
formation for the main group of size 

` six, with outlier size listed by each 
separate curve. The solid line repre- 
sents the geometrically correct re- 
sponse, and the triangles are data 
points. These data were pooled over 
the two instruction conditions since 
they yielded similar results. The ordi- 
nate of the graph has been scaled to 
agree with that of the Number stimuli 
of Exp. II in the lower panel; the unit 
on the ordinate is slightly less than 4 
in,, and the zero corresponds to the 
center of the main group. 

On the whole, Fig. la indicates that 
the mean response for the Space stim- 
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leftmost curve, this would be expected 
since, roughly speaking, just a bisection 
is required when the main and outlier 
groups have equal size. For this curve, 
the average magnitude of the difference 
between the observed and correct means 
is .06 in.; in comparison, the corre- 
sponding value is only .16 in. for the 
remaining five outlier sizes considered 
together. 

Nevertheless, some systematic in- 
accuracies in the Space data are ap- 
parent. The primary trend is a flatter 
slope in the observed than in the cor- 
rect curves. This trend is more aptly 
discussed in the following connection. 

The second way of viewing the 
Space data is through the w-values 
estimated from Equation 1. These 
values are simply the observed re- 
sponse expressed as a proportion of 
the distance between the centers of the 
main and outlier groups. 

Table 1 gives the mean w-values for 
all stimulus sets. These may be com- 
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DISTANCE BETWEEN MAIN AND OUTLIER GROUPS 


Fic. 1. Mean response as a function of outlier distance along the horizontal. 


(Outlier 


size is listed by each curve. Data for main group of size six, for Space stimuli of Exp. I, 


upper panel, and Number stimuli of Exp. 


II, lower panel. The straight lines with 


each data curve give the true arithmetic mean; points above and below the line represent 
overweighting and underweighting of the outlier, respectively, relative to this criterion of 


correctness.) 
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of w listed in the rightmost column. 
When the main and outlier groups are 
equal in size, the values of w lie near 
.50 as they should. Accordingly, the 
data from these three rows of the table 
will not be further discussed. 

The primary trend toward a flatter 
observed curve in Fig. la reappears 
more explicitly in Table 1 as a de- 
crease in w with increased outlier dis- 
tance. For the closest outlier, the ob- 
served w is as large or larger than the 
correct w; as outlier distance increases, 
w declines to values generally less than 
the correct w. 

Linear trend tests (Grant, 1956) 
taken over outlier distance were run 
on the w-values for the main groups of 
size six and for outlier groups of size 
one through five pooled. The overall 
trend was significant, F (1, 28) — 
16.18, and followup tests showed sig- 
nificant trend for the outlier groups of 
size one, two, three, and four, sepa- 
rately. 

An accurate response would give the 


TABLE 1 


Mean WEIGHT OF OUTLIER GROUP AS A 
Function oF SIZE or MAIN GROUP, 
SIZE oF OUTLIER GROUP, AND 
DISTANCE BETWEEN MAIN AND 
OUTLIER Groups: SPACE 
Strutt, Exe. I 


Distance Between Groups 


Outlier Group True 
Size Weight 
1 2 3 4 5 
Main Group 
Size 3 
1 286 | 267 | 227 | 224 | 186 | 250 
2 394 | 386 | 356 | 372 | 372 | 400 
us 488 | 532 | 502 | 492 500 
Main Group 
Size 4 
1 268 | 180 | 177 | 154 | 162 200 
2 380 | 329 | 313 | 291 | 262 | 333 
3 388 | 410 | 390 | 388 429 
4 505 | 492 | 512 | 488 500 
Main Group 
Size 6 
1 237 | 193 | 179 | 151 | 144 143 
2 337 | 260 | 232 | 239 | 212 | 250 
3 348 | 307 | 310 | 275 333 
4 400 | 354 | 344 | 332 400 
5 436 | 421 | 419 | 403 455 
6 497 | 507 | 512 | 512 500 


Note.—Entries are w-values, decimal points omitted. 


same weight to an outlier at all dis- 
tances, Thus, trend in w automatically 
implies inaccuracy in the response. 
However, additional tests of accuracy 
were made by comparing w averaged 
over the four distances with the correct 
w for each outlier size. These tests 
showed significant underweighting of 
outlier groups of size five and four, and 
significant overweighting of the outlier 
group of size one, F (1, 30) = 8.85, 
6.81, and 7.02, respectively. In con- 
trast, the main groups of size three and 
four produce an overall tendency to- 
ward underweighting even though the 
smallest outliers at the closest distance 
tend to be overweighted. 

Number stimuli—The lower panel 
of Fig. 1 plots the mean response for 
the Average and Guess conditions for 
the main group of size six in Exp. II 
in which Number stimuli were used. 
The format is similar to that of Fig. 
la, and all observations have been re- 
ferred to the center of the main group 
by subtracting 113. For the leftmost 
curve, the main and outlier groups have 
equal size. Accordingly, the close 
agreement of the data with the correct 
curve simply indicates no overall re- 
sponse bias. 

In the remaining five cases the Aver- 
age data lie above the correct curve, 
thus showing a uniform tendency to 
overweight the outlier. The magnitude 
of the error evidently increases with 
outlier distance. In contrast, the Guess 
condition, despite some irregularities, 
tends to be much more accurate in the 
mean response. 

A different view, more theoretically 
relevant, of the Number data is ob- 
tained through the w-values of Table 
2. For the Average instructions, the 
observed values are considerably 
greater than the true zw listed in the 
center column, particularly for the 
smaller outliers. Nevertheless, there 
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appears to be little or no trend in the 
w-values as a function of outlier dis- 
tance. This observed constancy in w 
supports the averaging model. 

'The Guess instructions, in contrast, 
show a general tendency for w to de- 
crease with outlier distance, at least 
for the smaller outliers. A large part 
of this trend appeared to be due to 
overweighting of the closest outlier and 
fairly accurate responses to the farther 
outliers, 

Linear trend tests were run on the 
data from the main group of size six, 
with the outlier groups of size one, two, 
and three, pooled for greater power. 
The larger outliers had shown little 
trend in the number-averaging data of 
Exp. I so these were not included since 
they would dilute the mean trend and 
decrease the power of the test. Of 
course, no trend would be expected for 
the outlier group of size six for which 
essentially a bisection was required. 
For analogous reasons, little trend 
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would be expected for outliers of size 
five and four, and this was in fact the 
case in both experiments. 

For the Average instructions, the 
linear trend in w did not approach sig- 
nificance, F (1, 15) = 1.49. This sup- 
ports the above observation that w is 
nearly constant over outlier distance. 
In contrast, the Guess instructions 
showed significant trend, F (1, 15) — 
21.92, and the difference in slope be- 
tween these two instruction conditions 
was also significant, F (1, 30) — 7.17. 

Essentially the same trends were 
found in the data from the smaller 
main groups of size three and four. 
Despite some irregularities in the data, 
w was roughly constant over outlier 
distance under Average instructions, 
but tended to decrease under Guess in- 
structions. 

Some supplementary information on 
accuracy was also obtained by com- 
paring correct and observed w-values 
averaged over the four outlier distances 


TABLE 2 


MEAN WEIGHT OF OUTLIER GROUP AS A FUNCTION or SIZE or MAIN 
Group, SIZE or OUTLIER Group, DISTANCE BETWEEN MAIN AND 
OUTLIER GROUPS, AND INSTRUCTIONS: NUMBER STIMULI, 


Exp. II 
Average Instructions Guess Instructions 
d Distance Between Groups True Distance Between Groups 
Outlier Group Size Weight 
1 2 3 4 5 1 2 3 4 5 
Main Group Size 3 
1 304 | 366 | 364 | 318 | 286 | 250 | 295 | 248 | 196 | 255 | 170 
2 406 | 462 | 428 | 444 | 383 | 400 | 420 | 436 | 398 | 423 | 401 
3 516 | 514 | 526 | 516 500 | 535 | 566 | 500 | 518 
Main Group Size 4 
1 309 | 283 | 320 | 306 | 306 | 200 | 262 | 209 | 185 | 232 | 136 
2 406 | 461 | 397 | 354 | 440 | 333 | 436 | 330 | 424 | 339 | 258 
3 453 | 500 | 456 | 496 429 | 508 | 432 | 486 | 469 
4 508 | 531 | 519 | 506 500 | 586 | 459 | 459 | 512 
Main Group Size 6 
1 278 | 293 | 324 | 290 | 237 | 143 | 254 | 149 | 172 | 177 |153 
2 391 | 340 | 369 | 314 | 301 | 250 | 382 | 295 | 225 | 191 | 218 
3 429 | 432 | 411 | 382 333 | 434 | 391 | 338 | 330 
4 496 | 412 | 414 | 470 400 | 434 | 408 | 476 | 484 
5 524 | 498 | 525 | 502 455 | 469 | 486 | 453 | 495 
6 519 | 508 | 518 | 464 500 | 543 | 531 | 488 | 512 


Note.—Entries are w-values, decimal points omitted. 
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for the main group of size six. The 
Average instructions showed highly 
significant overweighting for each of 
outlier sizes one through five; for the 
Guess instructions, no significant in- 
accuracy was observed in this average 
score. Of course, the trend in w does 
imply some real lack of accuracy in 
this condition. 


Experiments I and 1I.—There are a num- 
ber of subsidiary aspects of the data that 
also require comment. The first of these 
concerns the Number data of Exp. I which 
were not reported because they contained a 
response bias. For main and outlier groups 
of equal size, the w-values were consistently 
above .50, at least for the nearer outliers. 
Nevertheless, these data showed a tendency 
to overweight the outlier that seemed to be 
clearly greater than the prevailing response 
bias. In this respect, these data were con- 
sistent with the results of Exp. Il. 

This bias is evidently due partly to the 
asymmetrical battery of stimulus sets in Exp. 
I since it did not recur in Exp. IL. The 
occurrence of this bias points up the need 
for caution in stimulus construction, and the 
special necessity for control conditions such 
as the sets with groups of equal size. With- 
out such controls, interpretation of the data 
and comparison of the observed and correct 
weights may be hazardous. 

The present results may be compared with 
two previous reports that used both numbers 
and spatial stimuli. These used outliers of 
size one (Spencer, 1961) and skewed dis- 
tributions (Spencer, 1963). Both reports 
claim to find overweighting of the outlying 
numbers, but either underweighting or no 
misweighting of the spatial stimuli, Un- 
fortunately, these reports are difficult to 
evaluate, largely because so little of the 
supporting data or design details are pre- 
sented, but such results would be in general 
agreement with those obtained here. 

One minor discrepancy in the present data 
concerns the effect of the Average and Guess 
instructions in the two experiments. In Exp. 
I, the Guess instructions made the briefest 
reference to marble guessing. This had ap- 
peared adequate in pilot work, but failed to 
yield much difference from the Average in- 
structions except perhaps for the very largest 
outlier distance. In Exp. Il, the marble 
guessing aspect was worked into the instruc- 
tions more thoroughly. Even here, however, 


the discounting effect was sizeable only for 
the smallest outlier groups. 

Finally, the extent of individual differences 
needs a brief discussion. Since the w-values 
may be compared across different conditions. 
between-groups comparisons were accord- 
ingly made on MSs, from the main group 
of size six. The w-values were averaged 
over the four outlier distances, and over all 
outlier sizes for Exp. I, or over the first 
three outlier sizes as above for Exp. II. In 
Exp. I, individual differences were 1.8 times 
greater for Number than for Space stimuli, 
a result that agrees with Spencer (1961, 
1963). In Exp. II, MSs, was a little lower 
for Guess than for Average instructions, 
and in both slightly lower than the cor- 
responding value from Exp. I. 

Within a single condition, MSs. needs to 
be compared to some measure of within-Ss 
variability. The three-way interaction of Ss 
with size and distance of outlier was used 
for this purpose and yielded ratios that 
ranged from a low of 4.84 for the Guess 
condition of Exp. II to a high of 14.68 for 
the Space condition of Exp. I. These ratios, 
which indicated marked individual differ- 
ences, are conservative since the three-way 
interaction would generally overestimate 
within-Ss variability. Actually, the two were 
about equal here as indicated by separate 
analysis for the main group of size three in 
Exp. II where the complementary sets served 
as replicates. It was thus evident that indi- 
vidual differences were quite large, contrary 
to Spencer (1961, 1963) who reported no 
individual differences in a student population. 


Individual differences are relevant to 
the theoretical interpretation. Their 
very existence necessarily implies sys- 
tematic inaccuracies relative to any 
normative criterion such as the arith- 
metic mean. The present averaging 
model, however, allows for individual 
differences in the w-parameter. Never- 
theless, the validity of the model sub- 
sists only at the level of the single S, 
and the usefulness of group means 
should not obscure that they may aver- 
age out real, if idiosyncratic discrepan- 
cies from the model. Fully satisfactory 
tests will require extended sessions that 
provide enough data for statistical 
analysis of the single S (Anderson, 


1962). 
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Discussion 


The main focus of this discussion is on 
the relation of the data to the averaging 
model. The three main experimental 
conditions are considered in turn. 

The data from the Number stimuli, 
under the Average instructions, fit the 
model quite well. Each outlier group had 
about the same influence at all outlier dis- 
tances as shown by the lack of trend in 
w estimates, This approximate constancy 
of w supports the model assumption that 
the response is an average of the stimulus 
values, 

Of course, these subjective averages 
were not true arithmetic means since the 
observed w was generally greater than the 
arithmetically correct value. This reflects 
a tendency to place arithmetically dispro- 
portionate weight on the outlier. As 
might be expected, this tendency toward 
overweighting varied inversely with out- 
lier size. 

The second condition used the Guess 
instructions with Number stimuli, These 
instructions were designed to set up an 
inconsistency between the main group and 
the outlier, and the primary purpose of 
this condition was to test whether this 
would produce a discounting effect anal- 
ogous to that found in personality im- 
pressions (Anderson & Jacobson, 1965). 
That such a discounting effect was ob- 
tained is shown by the lower w-values for 
Guess than for Average instructions. 
Moreover, the amount of discounting in- 
creased with outlier distance. 

This last result, however, makes it 
open to question whether an averaging 
model can apply to these data. Equation 
1 will necessarily fit if w is allowed to 
vary freely, but this leaves no basis for 
a test of goodness of fit. Ordinarily, at 
least, the adequacy of any model is based 
on some constancy in its parameters, as in 
the Number Average condition just con- 
sidered. On the other hand, it is evident 
that discounting will necessarily depend 
on outlier distance; at near distances, the 

outlier will become part of the main group 
and not be discounted at all, Accordingly, 
the trend in w-values does not invalidate 
an averaging formulation. Of course, the 
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dependence of w on outlier distance would 
need to be accounted for within the model 
before it could be considered adequate. 
Whether, and how this could be done is 
uncertain. It may be noted, however, 
that most of the trend in the Guess condi- 
tion occurred from first to second outlier 
positions. 

It had been hoped that the Space stim- 
uli would show the same pattern as the 
Number stimuli, but this evidently did 
not occur. For comparative purposes, it 
is appropriate to consider only the Aver- 
age instructions. In contrast to the Num- 
ber data, then, the Space data showed a 
general tendency to underweight the 
outlier. Underweighting occurred only 
in certain conditions, however, and ap- 
parent overweighting occurred at certain 
points. More seriously, perhaps, the 
Space data also showed a trend toward 
lower w with increases in outlier distance. 
These results make it questionable that 
the process underlying the behavior is the 
same for the two kinds of stimuli, They 
also raise the more immediate problem of 
evaluating the model, and this requires 
more detailed examination. 

If the criterion of constancy in w is 
maintained, then the model is inadequate 
for the Space data. However, it is pos- 
sible that S is estimating some other mea- 
sure of central tendency than that assumed 
in Equation 1. The median and midpoint, 
both considered by Spencer (1963), are 
obviously incorrect for the present data. 
A weighted midpoint is more reasonable 
but it also turns out to be incorrect as will 
be seen in the course of the following 
analysis, 

Rather than one alternative at a time, it 
is useful to take a large class at once. The 
simplest general formulation would take the 
response as a linear function of the stimulus 
values. The present task and data require 
additionally that this linear function be a 
weighted average, though with different 
weights allowed for each single stimulus 
element of the set. These weights, con- 
sidered separately for the main and outlier 
groups, define weighted means or centers 
that will be denoted by X', and X^. re- 
spectively. It is then straightforward to 
show that the response can be expressed 
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as a weighted average of these weighted 


centers : 
R= (1-—w') Xo +wX".. (2) 


Equation 2 is formally identical to Equa- 
tion 1. Thus, the generalized averaging 
model is formally the same as the original 
model, the only difference being that each 
group of stimuli is represented by its 
weighted center rather than its unweighted 
center as in Equation 1. A closer relation 
between the two models can be obtained if 
Equation 2 is rewritten as, 


R—(1—w)X t wX.-4 C, (3) 


where Xo and X, are the unweighted centers 
of the main and outlier groups, as in Equa- 
tion 1. The only formal difference between 
the general model and the model of Equation 
1, therefore, is the additive constant, C. In 
particular, Equation 3 specifies that the re- 
sponse is a straight-line function of outlier 
distance, Xs. The data generally agree with 
this, though there is some suggestion of real 
nonlinearity at the farthest outlier. 
The expression for C is, 


Cz (1—w') (X-X) Hw (X'—X«).. (4) 


This is just a weighted average of the differ- 
ences between weighted and unweighted cen- 
ters for the main and outlier groups. 

If this formulation is to fit the data, it 
must satisfy two conditions that can be seen 
by reference to the upper panel of Fig. 1. 
First, w’, the weight of the outlier, must be 
a litle less than its arithmetically correct 
value. From Equation 3, w is the theoretical 
slope of the data curves; for the actual data, 
these are less than the slope of the arith- 
metical reference curve in each case. This 
slope difference was also found for the main 
groups of smaller size. 

The second condition is that C be positive, 
decreasing to zero as the size of the outlier 
approaches the size of the main group. This 
follows in part from the first condition. If 
C were zero or. negative, then the slope con- 
dition would require that the data lie below 
the arithmetically correct curve at every 
outlier distance, contrary to fact. 

To see the implications of the condition 
that C be nonnegative, it is necessary to re- 
call the stimulus layout. The main group 
lay on the left, and so was lower, in numer- 
ical terms, than the outlier group on the 
right. The above analysis then implies at 
once that S cannot be estimating a weighted 


midpoint. This would correspond to weights 
of 1 — w' and w’, respectively, for the small- 
est and largest stimulus elements, that is, 
for the leftmost point of the main group and 
the rightmost point of the outlier. These 
points would in fact be the weighted centers 
of the main and outlier groups. But then 
(X',— Xo) in Equation 4 would be negative 
and so also C itself. This illustrates that the 
generalized averaging model need not fit the 
data. 

If the second condition is to be satisfied, 
it is in fact necessary for the two groups to 
be centered toward their interior endpoints. 
That is, X^ must lie to the right of Xo, and 
X', must lie to the left of Xa. Then (X's — 
Xo) and hence C will be nonnegative. At 
the same time, (X',— X.) will be non- 
positive, and C will decrease to zero as the 
outlier approaches the main group in size. 

This accords with the general trend of 
over- and underestimation in Fig. la. The 
value of C will be largest when the main 
group is largest and the outlier group is 
smallest, Thus, the amount of overestima- 
tion should be greatest for the outlier of 
size one as is indeed the case. Less over- 
weighting is found for the outlier of size 
two, and the remaining outliers show under- 
weighting except at the closest outlier dis- 
tance, all in qualitative agreement with the 
given implications. The data for the main 
groups of size three and four also generally 
agree with this analysis. 

To this qualitative agreement, it can be 
added that the value of C would be of the 
right general size. An upper bound on C is 
obtained by taking X's and X", at the interior 
endpoints, and setting w’ equal to its arith- 
metically correct value. For example, this 
yields C = 3.50 for the outlier group of size 
two, expressed in terms of the ordinate scale 
of Fig. 1. This is near the value that these 
data would require as can be seen by pro- 
jecting both curves leftward one unit to the 
origin. 

To sum up, this interpretation of the 
Space data still rests on the assumption 
that S is averaging the stimuli. In con- 
trast to the original model, however, sep- 
arate weights are allowed for the individ- 
ual stimuli within either group of the 
stimulus array. As a consequence, the 
subjective and objective centers of the 
stimulus groups will not in general be 
equal. Instead, the subjective center of 
either group is displaced toward the in- 
terior endpoint of that group, that is, in 
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the direction of the other group. The 
response is then simply a weighted aver- 
age of these subjective centers, with the 
main group receiving a little more than 
its objective share of the weight. 

On intuitive grounds, it is not im- 
plausible that S would in effect center 
both groups of a linear array at their in- 
terior endpoints. From the theoretical 
view, the averaging model developed on 
this assumption is a straightforward gen- 
eralization of the initial model. It pro- 
vides a qualitative, even semiquantitative 
account of some moderately complex data. 
Despite its speculative, post hoc character, 
therefore, it deserves further consideration. 

A strict test of goodness of fit could 
be employed in further work although 
this could present difficulties unless the 
subjective center were exactly at the end- 
point. However, there is an experimental 
analysis that has several advantages. If 
the basic idea is correct, then varying the 
spacing of the dots within one or both 
groups will have predictable effects on the 
pattern of over- and underestimation. The 
more compact each group, the better the 
original model of Equation 1 should be 
satisfied. It would also be of interest to 
use three groups in the stimulus array 
although that would require the introduc- 
tion of a second weight parameter. 

Finally, the general approach taken here 
is of interest from a joint theoretical- 
methodological standpoint. Spencer (1961, 
1963) and Beach and Swensson (1966) 
have placed primary emphasis on response 
accuracy, evaluated against some numer- 
ically "correct" criterion such as mean or 
midpoint. Indeed, Beach and Swensson 
consider high accuracy to be the most im- 
portant result that had been obtained. 
Such results can be quite interesting, of 


course, but they can obscure the basic 
theoretical problem. 

In the present data, the greatest inac- 
curacies were with Number stimuli under 
Average instructions, yet it was this con- 
dition that followed the model best. Under 
Guess instructions, mean accuracy was 
much greater. If the discounting inter- 
pretation is correct, however, this greater 
accuracy is fortuitous and has no sub- 
stantive significance. Under such circum- 
stances, emphasis on a normative criterion 
may retard the development of an ade- 
quate theory. 
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Rats were conditioned to a performance asymptote with either 12- 
pellet or 1-pellet rewards in both black and white runways. Half 
of the 1-pellet Ss continued under the same conditions while the rest 
of the Ss were rewarded with 12 pellets in 1 runway and 1 pellet 
in the other. The Ss trained on 12 pellets showed large negative- 
contrast effects in the 1-pellet runway while similar effects for the 
Ss trained with 1 pellet were much smaller and limited to the early 
sections of the runway. These results were discussed as supporting 
a frustration theory in which the onset of stimuli associated with 
reward reduction elicit an unconditioned emotional response rather 


than an anticipatory response. 


It is well established that shifts from 
larger to smaller magnitudes of a food 
reward produce an initially lower level 
of performance than is the case with Ss 
exposed to only the smaller magnitude. 
Such phenomena, frequently referred to 
as negative contrast effects (NCEs), 
have been extensively reviewed by 
Pubols (1960). A presumably related 
phenomenon has been observed in dif- 
. ferential magnitude of reward condi- 
tioning (Bower, 1961; Ludvigson & 
Gay, 1967) where speed of running in 
a runway associated with a smaller re- 
ward has been found to be slower for 
Ss receiving a larger magnitude in an- 
other runway than for those receiving 
the smaller magnitude in both. Spear 
and Spitzner (1966) have referred to 
the former type of NCE as successive 
(SucNCE) and to the latter as simul- 
taneous (SimNCE). 

Spear and Spitzner (Exp. III) used 
a differential magnitude-of-reward par- 
adigm to investigate the effects of 
SucNCE on subsequent SimNCEs. 
Specifically, rats were ‘trained to run 


1 This study was designed by Kenneth W. 
Spence and run by the second author. The 
paper was written by the first author. 


in black and white runways with either 
1-pellet rewards in both, or a 12-pellet 
reward in one and 1 pellet in the other. 
After a SimNCE was obtained in this 
phase of the study, all Ss received 1 
pellet in both runways. In this second 
phase there was some tendency for Ss 
with prior 12-pellet experience to run 
slower in the formerly large magnitude 
runway, but the effect was small and 
significant only when the larger mag- 
nitude had been given in the black run- 
way. 

From a frustration theory (Amsel, 
1958, 1962; Spence, 1960), the small- 
ness of Spear and Spitzner's (1966) 
SucNCE might be expected on the 
basis of running (Rz) having been con- 
ditioned to the stimulus consequences 
(st) of anticipatory frustration (r) in 
either or both runways during differ- 
ential conditioning. A more likely 
technique for obtaining a SucNCE- 
SimNCE interaction might be to ob- 
serve the magnitude of SimNCEs as a 
function of various successive magni- 
tude shifts. Thus the study to be re- 
ported here involved training Ss with 
the same large or small magnitude of 
reward in both black and white run- 
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ways, followed by differential training 
with the large magnitude in one and the 
small in the other. The primary con- 
cern was whether SimNCEs in the dif- 
ferential phase would be related to the 
predifferential magnitude of reward. 


METHOD 


Subjects.—The Ss were 24 male hooded 
rats, approximately 100 days old, which were 
obtained from Simmonson Laboratories. 

Apparatus—The apparatus consisted of 
a black and a white wooden runway, each 
43 in. long, 34 in. wide, and 4 in. deep. A 
single midgray start box, 9X2*X4 in, 
served both runways. The start box was 
separated from the runways by a midgray, 
inverted-guillotine-type door. The last 12 
in. of each runway was set off by a conven- 
tional guillotine-type door to form a goal 
box which contained a small glass furniture 
coaster for presenting .045-gm. Noyes pel- 
lets, Response times were recorded by 
Hunter Photo-relays and Klockounters from 
the opening of the start-box to a point 1 ft. 
into the runway (start), over the next 6 in. 
(run), and in a section 6 in. on either side 
of the goal-box door (goal). The entire 
apparatus was covered by Plexiglas lids 
and surrounded by black curtains hung from 
a translucent acetate sky suspended 3 ft. 
above the apparatus. A low level of homo- 
geneous illumination was provided by a large 
number of 7-w. light bulbs mounted 1 ft. 
above the sky and controlled in series by an 
autotransformer. 

Procedure.—Two weeks before and 
throughout the study Ss were maintained on 
a 12 gm/day ration, including food obtained 
in the experimental apparatus. The adjusted 
ration was always given in the home cages, 
15 min. after S’s experimental session. The 
Ss were run rotationally in squads of six 
and had free access to water in the carrying 
cages in which they spent the 5-min. inter- 
trial intervals. The Ss were handled for 
4 days and on the fifth day were allowed 
to explore the unbaited apparatus for 2 min. 
in each runway. During these 5 preliminary. 
days Ss were also fed 10 Noyes .045-gm. 
pellets/day in their home cages. The 24 
Ss were then assigned randomly and equally 
to three groups. 

In Phase I each S received 156 trials. 
The first 6 trials were given at the rate of 
3/day and the remainder at 6/day. Three 
of these daily trials were in the white run- 


way and 3 in the black For Group 
DL all trials terminated in a 12-pellet re- 
ward while Groups DS and C always re- 
ceived 1 pellet. Throughout the experiment 
two 6-trial orders of alleys were used: 
abbaab and baabba. These orders were run 
in double alternation across days and for 
half of the Ss in each group "a" was the 
white alley and "b" the black while the re- 
verse was true for the rest of the Ss. Phase 
II consisted of 168 additional trials. Group 
C continued in the same manner as in Phase 
I, but Groups DL and DS always received 12 
pellets in one alley and 1 in the other, with 
the pairings of alley color and reward mag- 
nitude being counterbalanced within each 
group. 

On any trial in the experiment S was re- 
moved from its carrying cage and placed 
in the start box. When S had been oriented 
to the start-box door for 1 sec., the door 
was dropped allowing traversal of the run- 
way and consumption of the reward, imme- 
diately after which $ was returned to the 
carrying cage. 


RESULTS AND DISCUSSION 


Phase I.—The mean start, run, and 
goal speeds for 1-pellet (C and DS) 
and 12-pellet (DL) Ss in successive 
12-trial blocks of Phase I are shown in 
Fig. 1. Separate Groups X Blocks 
analyses of variance for each of the 
three measures all yielded F-ratios sig- 
nificant beyond at least the .025 level 
for both main effects and their inter- 
action. The significant Groups effects 
reflected the consistently faster running 
of Group DL. 

Phase II —Mean start, run, and goal 
speeds in successive blocks of 12 trials 
to each stimulus for Groups DL and 
DS are shown in Fig. 2. Also shown 
are the speeds of Group C averaged 
over both runways. Separate Groups 
X Stimuli X Blocks analyses of vari- 
ance yielded Stimuli and Groups X 
Stimuli effects which were significant 
beyond the .001 level for each of the 
three response measures. With this 
result as evidence that discrimination 
had occurred, the data for positive (12- 
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Fic. 1. Mean start, run, and goal speeds 


for the groups in Phase I. 


pellet) and negative (1-pellet) stimuli 
were separated. 

Analyses of variance of Groups DL 
and DS over blocks of 12 trials to the 
positive stimulus yielded significant 
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Fic. 2. Mean start, run, and goal speeds 
on positive and negative trials for the groups 
in Phase II. 
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Groups effects in the start, F (1, 14) 
= 5.29, p < .05, and goal, F (1, 14) = 
6.38, p < .025. There was also a sig- 
nificant Blocks effect in the goal, F (6, 
84) 21142, p < .001. All other in- 
teractions and main effects failed to 
reach the .05 level. 

Analyses of variance of all three 
groups over blocks of 12 trials to the 
negative stimulus showed highly sig- 
nificant Groups, Blocks, and Groups X 
Blocks effects in all three measures. 
Orthogonal comparisons within these 
Groups effects under the negative stim- 
ulus indicated that the slower running 
of Group DL, as compared to DS and 
C, was statistically significant beyond 
the .01 level in all three sections of 
the runway. In no case, however, was 
there a reliable difference between 
Groups DS and C. These results may 
be taken to indicate that a SimNCE 
occurred in Group DL and that it was 
larger than any similar effect for DS. 
That Group DS did show some Sim- 
NCE early in the runway is indicated 
by a significant difference between the 
mean start speeds of Groups DS and 
C over the last five blocks of trials to 
the negative stimulus, t(14) = 243, p 
<.05. Similar analyses for the run 
and goal measures failed to yield sig- 
nificant ts. 

Trend analyses of the Blocks effects 
for all groups under the negative stim- 
ulus yielded quadratic trends which 
were significant beyond the .001 level 
in all three sections of the runway. 
These trends may be taken as verifying 
the reliability of the eventual decrement 
of SimNCEs which is apparent in Fig. 
2. It should also be noted that Fig. 2 
offers some evidence that maximal 
SimNCE eventually occurred in the 
early sections of the runway. This re- 
sult is indicated in Group DL by the 
obvious fact that SimNCEs decre- 
mented most rapidly in the goal section 
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while being most persistent in the start. 
A similar finding is represented by the 
previously mentioned fact that only the 
start measure yielded statistically sig- 
nificant SimNCE in Group DS. 

The fact that the magnitude of Sim- 
NCEs was found to be an increasing 
function of predifferential reward mag- 
nitude in the present study suggests 
that a SucNCE-SimNCE interaction is 
easily obtainable. Within a condi- 
tioned-frustration theory (Amsel, 
1962) this result would be expected 
since the large expectancy (fg) con- 
ditioned in Phase I for Group DL 
should elicit greater amounts of frustra- 
tion (Rp) in the negative goal box dur- 
ing Phase II. 

While the SucNCE-SimNCE inter- 
action obtained in the present study 
was in agreement with a conditioned 
frustration theory, the eventual occur- 
rence of maximal inhibition early in 
the runway raises a serious question 
concerning this interpretation. Accord- 
ing to a conditioned frustration theory, 
Ry elicited in the goal box occurs only 
in the fractional, anticipatory form (rr) 
earlier in the runway as a result of 
stimulus generalization and higher- 
order classical conditioning. In the 
absence of complicating phenomena, 
maximum inhibition would thus be ex- 
pected to occur nearest the goal box. 
The present results are comparable to 
those of Ludvigson and Gay (1966) 
and Peckham and Amsel (1967) in 
their disconfirmation of this prediction. 

In investigating these early intratrial 
maximums of NCEs, Ludvigson and 
Gay (1967) found that the use of dif- 
ferential start boxes further displaced 
maximum inhibition from start speeds 
to a measure of the latency of start-box 
orientation. On the basis of these re- 
sults they concluded that Rp might oc- 
cur shortly after presentation of the 
negative stimulus, in response to a sud- 


den decrement in rz. The Ry, it was 
speculated, might then slowly dissipate 
so that performance later in the runway 
would eventually reach a level deter- 
mined by the rg conditioned by the 
small reward. Implicit in this theo- 
rizing, of course, is the assumption that 
any motivational enhancement (FE) 
produced in locomotor behavior by Ry, 
will not be detectable until competing 
responses elicited by Sy have subsided. 
The results of the present study are 
obviously in agreement with such a 
primary-frustration interpretation. 
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MOTOR PERFORMANCE ON TEMPORAL TASKS AS A 


FUNCTION OF SEQUENCE LENGTH 
AND COHERENCE * 
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2 experiments on pursuit tracking of temporal patterns are given in 
which the effects of the coherence and the length of the patterns are 
described, both for outcome performance scores and indexes of 
response organization. In the lst experiment, both eye movements 
and manual control movements were recorded continuously for se- 
lected trials. Integrated error scores showed little improvement rela- 
tive to a random pattern except for the shorter and most coherent 
patterns, Eye and hand movements were highly coordinated in time 
but both showed a range effect, leading targets of long duration 
while lagging short-duration targets. Experiment II provided further 
evidence on the length-difficulty relation, supporting the evidence from 
verbal learning that increases in length do not disproportionately 


increase difficulty. 


In earlier reports on the role of task 
variables in serial-motor learning, 
major emphasis was placed on the co- 
herence, or predictability, of stimulus 
patterns in terms of the spatial location 
of successive events (Cross, 1966; 
Noble, Trumbo, Ulrich, & Cross, 1966; 
Trumbo, Noble; Cross & Ulrich, 
1965), and with the number of discrete 


events in these spatial patterns (Swink, 


Trumbo, & Noble, 1967; Trumbo, 
Noble, & Ulrich, 1965). 

With one exception (Cross, 1966), 
these studies involved series of events 
with spatial uncertainty but without 
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temporal uncertainty; event durations 
were held constant at 1 sec. That is, 
there was uncertainty as to where the 
events would occur and S’s task was 
to learn the sequence of spatial loca- 
tions of the events. 

Under these conditions it was 
shown, with respect to task coherence, 
that relatively small proportions of un- 
predictable events in an otherwise pre- 
dictable series disproportionately in- 
creased tracking error. Thus, with as 
few as 4 of the events unpredictable, 
error scores were no better than with 
a completely random sequence, On 
the other hand, as the number of events 
in a sequence of events was increased 
from 5-8 to 10-12, there was little ef- 
fect either on rate of improvement or 
on terminal performance level, but 
further increases in length resulted in 
performance decrements. Finally, long- 
term forgetting was related more 
clearly to changes in the timing of re- 
sponses than to changes in spatial accu- 
racy of responses. However, it seems 
likely that changes in timing accuracy 
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did not represent a loss of timing ability 
per se, but instead, having forgotten 
the spatial locations, Ss were unable to 
accurately time their movements in 
tasks which require anticipatory re- 
sponses which must be accurate both 
spatially and temporally. 

The argument concerning perform- 
ance decrements (involving changes in 
timing) generated interest in the pres- 
ent experiments in which coherence was 
varied in the temporal rather than in 
the spatial domain and sequence length 
was defined in terms of the number of 
events (durations) in a series of tar- 
get events. The major purpose, there- 
fore, was to evaluate the roles of task 
coherence and task length on perform- 
ance in tasks involving sequences of 
events of varied durations. 

The question of task length in the 
present tasks is analogous to the clas- 
sical length-difficulty question in verbal 
learning. However, rather than dis- 
crete measures of performance, the 
present tasks have the advantages of 
continuous error scores (Bahrick, 
1964; 1965), Furthermore, the use of 
a continuous-response tracking task 
permitted fine grained analyses of the 
temporal-spatial organization of be- 
havior in the face of temporal un- 
certainty. 


EXPERIMENT I 


The first experiment was designed 
primarily to evaluate the effects of co- 
herence of input sequence on perform- 
ance in a temporal tracking task. In 
addition, two conditions of sequence 
length provided opportunity to examine 
the effect of this second task variable. 
Finally, both manual and visual track- 
ing data were obtained, making it pos- 
sible to compare response patterns of 
the eye and the hand under the various 
ask conditions. 
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Method 


Subjects.—Sixty right-handed male uni- 
versity students between 17 and 25 yr. of 
age served as Ss. None of the Ss wore 
glasses nor reported any known visual de- 
fects. All Ss received class credit and/or 
were paid for their services. 

Apparatus.—The VETA tracking appara- 
tus, which has been described in detail else- 
where (Trumbo, Eslinger, Noble, & Cross, 
1963) was used in both experiments. The 
display consisted of a 4-in. vertical target 
and cursor hairlines displaced along the x- 
axis of a 5-in. oscilloscope. The position 
and duration of the stimulus (target) line 
was controlled by a punched paper-tape pro- 
gram, read out by a tape reader. The posi- 
tion of the cursor line was controlled by an 
arm control. The control, which was fitted 
to the right side of a steel chair, consisted 
of a lateral arm rest, pivoted at the elbow, 
with a hand grip adjustable to the length of 
the forearm. The control was positional and 
freemoving with a control-to-display ratio 
of 11° arc to 1 in. 

An operational amplifier manifold was 
used to obtain the absolute momentary 
error, a voltage value directly proportional 
to the distance between target and cursor 
lines, and to integrate this value with respect 
to time, yielding an integrated absolute error 
score. The latter score was recorded from 
a voltmeter after each trial and served as 
the principal performance criterion. 

Additional apparatus was included to re- 
cord the lateral eye movements of Ss. The 
basic system was the Model F-20 Eye-Move- 
ment Sensor developed by George Kelk, 
Ltd. This system consisted of a head band, 
on which was mounted a light source and 
photocell unit, and a control unit which con- 
tained the operating controls and a stabi- 
lized power supply for the lamp and photo- 
cell. In operation, the light source was posi- 
tioned to illuminate the eye at the margin 
between the sclera and iris when S' was 
fixating upon the center of the field. Lateral 
eye movements exposed greater or lesser 
portions of the sclera with a corresponding 
change in reflectance to the photocell and, 
consequently, a change in the voltage out- 
put of the system. 

A bite board was used to restrict head 
movements. The display was located 22 in, 
in front of the bite board and slightly above 
eye level for the average S. 

Tasks—All Ss received stimuli in the 
form of a step-function input of regular 
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amplitude (31.125 in. from the center of the 
scope, or 5.86^ visual angle), but the event 
durations, or dwell times, varied from .3 
sec. to 1.8 sec. in .3-sec. intervals. All trials 
consisted of 54 target events and were ap- 
proximately 58 sec. long. 

Experimental conditions,.—The design in- 
cluded seven experimental conditions, These 
included five values of sequence coherence 
and two sequence lengths in an incomplete 
design. Coherence was defined in terms of 
the proportions of the sequences of target 
durations which were fixed and system- 
atically repeated on each repetition of the 
sequence. Thus the completely coherent, 
or fixed, tasks consisted of six (Task F-6) 
or nine (Task F-9) target events the dura- 
tions of which, once selected randomly, were 
fixed and repeated within and throughout all 
trials. For Task F-6 the sequence of these 
durations was: .6, 9, 12, 9, 18, and .3 
sec., and for Task F-9 it was: .6, 12, .6, 
3, 1.5, 12, 1.8, .9, and .6 sec. 

Intermediate degrees of coherence were 
obtained by varying the proportions of tar- 
get durations which were selected anew 
and randomly (and which events were, 
therefore, unpredictable) on each repetition 
of the otherwise fixed sequence. Thus, be- 
ginning with the F-6 sequence, in one case 
the duration of the third target event was 
chosen randomly (Task I-%) and, in the 
other case, the durations of the second and 
fourth events were chosen randomly on each 
repetition of the six events (Task I-76). 
Similarly, beginning with the F-9 sequence, 
the third, fifth, and eighth event durations 
in one case (Task I-%) and the third and 
sixth event durations in the other case 
(Task I-%) were selected randomly for 
each repetition of the nine-event sequence. 
Thus, Tasks I-34 and 1-56 contained the 
same proportions of unpredictable durations, 
but differed in sequence length. The seventh 
task (Task R) was a random series of the 
six event durations, 270 targets, or five 
trials, long. 

Three control groups received Tasks F-9, 
1-8, and R, but without eye-movement ap- 
paratus or bite board. They were included 
to evaluate main effects and possible inter- 
action effects of eye-movement apparatus 
and task conditions. 

Procedures.—The Ss were assigned ran- 
domly, six each, to the 10 task conditions. 
All Ss were given 15 and 25 trials, respec- 
tively, on two successive days. On Day 1 
detailed instructions included a description 
of the tracking task, and the error score cri- 


terion as well as a demonstration of the 
value of rapid anticipatory movements for 
minimizing error scores. Repetitions of the 
basic sequences within trials were separated 
only by a brief jump of the target to the 
center of the scope. This cue was explained 
to Ss, and they were told that their error 
scores would be fed back aíter each trial. 
Instructions were followed by a three-point 
calibration of the eye-movement sensor. 
On Day 2, Ss were told only that the task 
remained the same. 

Target, control output, and eye-sensor 
output were recorded via an F-M tape sys- 
tem for 12 selected trials for subsequent 
readout and hand scoring. 


Results 


Integrated error.—lntegrated abso- 
lute error data, in arbitrary voltage 
units, are presented in Fig. 1 for the 
seven experimental conditions. Data 
for the three control conditions are not 
shown, since a 2 X 3 analysis of vari- 
ance on the last block of trials failed to 
show any difference between Experi- 
mentals (E) and Controls (C), F (1, 
30) = 1.55, p > .05, or any interaction 
of E and C with the three common task 
conditions, F (2, 30) = 1.33, p > .05. 
Thus, the eye-movement apparatus had 
no appreciable effect on criterion per- 
formance. 

Inspection of Fig. 1 indicated that 
only the shorter of the two fixed se- 
quences (F-6) and the most redundant 
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Fic. 1, Integrated error scores for Exp. 
I. (Data are for 5-trial blocks with Trial 
1 plotted separately.) 


400 TRUMBO, NOBLE, FOWLER, AND PORTERFIELD 


of the *I" conditions (1-6) showed 
marked improvement in error scores 
relative to the random (R) condition. 
This observation was supported by an 
F test for Block 8 data, which showed 
significant differences among condi- 
tions, F (6, 35) = 475, p < 01, but 
the only reliable differences were be- 
tween Tasks F-6 and I-% and the 
remaining conditions (p< .05), as 
shown by Duncan's range test. Thus, 
while the two fixed conditions of var- 
ied length (F-6 and F-9) differed sig- 
nificantly, comparable I conditions 
(1-% and 1-%) did not differ, indi- 
cating that sequence length may have 
relatively more effect on performance 
with completely coherent than with 
partially coherent sequences. 

Timing — performance.—Continuous 
oscillographic records were obtained 
on Trials 1, 15, and 35 for the three 
Ss in each task with the middle error 
scores on the last block of five trials. 
These records were scored for lead-lag 
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of manual control movements and lead- 
lag of eye movements, both relative to 
target movements. Leads or lags were 
measured to the nearest 50 msec. as 
the time between target displacement 
and the initiation of the primary pur- 
suit movement, and means were com- 
puted for the 54 events per trial. 

Figure 2 summarizes the lead-lag 
data for both eye movements and man- 
ual control movements and their differ- 
ences for all experimental groups. On 
Trial 1 manual control movements 
lagged target displacements for all 
groups, especially for the experi- 
mentals, by about 220 msec. By Trial 
15, all groups had reduced their mean 
lags and Ss in Tasks F-6 and I-% 
were, on the average, leading the tar- 
get displacements. By Trial 35 lead 
scores tended to be inversely related to 
error scores (with the exception of 
Task F-9) and all groups led or had 
essentially zero lead-lag means by the 
end of training. 
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Fic. 2. Lead-lag data for eye (left) and arm (center) movement relative to target 


displacements, Exp. I. 


(Differences between visual and manual are shown on the right. 


Data are for three median Ss from each task condition.) 
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TRIAL 15 


TRIAL 35 
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Fic. 3. Envelopes showing median and range of lead-lag scores for eye (upper) and 


arm (lower) at each target duration. 
groups, Exp. I.) 


The eye-movement data closely paral- 
leled that for manual control move- 
ments. Not only was the overall con- 
figuration of the data highly similar, 
but also the curves for the individual 
task conditions were nearly identical. 
This correspondence of eye and hand 
movements was most evident in the 
difference data, shown in the right- 
hand portion of Fig. 2. Lead-lag 
Scores for the two responses differed, 
on the whole, by less than 100 msec., 
with the hand leading the eye by as 
much as 50 msec. on Trial 1, but with 
the eye tending to lead the hand on 
later trials. The amount by which the 
eye led the hand does not appear to 
be systematically related to integrated 
error scores, sequence length, or task 
coherence in these data. 


(Envelopes include means for all experimental 


A second timing analysis compared 
lead-lag performance separately for 
each target duration. Leads and lags 
were combined in terms of the dura- 
tions of the targets from which the 
responses were made. The results are 
summarized in Fig. 3 as envelopes of 
the data, showing medians and ranges 
for the means of the groups. Presenta- 
tion of the curves for the individual 
task conditions did not add clarity to 
Fig. 3, but, generally, the means for the 
more predictable tasks tended to be 
near the top of these envelopes. 

Again, the eye and hand data were 
highly similar, both from target dura- 
tion to target duration and from trial 
to trial. On Trial 1, all targets were 
lagged by about 250 msec., with only 
a slight tendency for greater lags to be 
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associated with the shorter target dura- 
tions, However, by Trial 15 the 
shortest target durations were lagged 
while the longest durations were led, 
and by Trial 35 this was generally true 
of all groups. Leads for the task con- 
ditions represented at the top of the 
envelopes reached values of over 750 
msec, for the 1.5- and 1.8-sec. target 
durations. The net result illustrated in 
Fig. 3 was a pronounced range effect, 
with the mean response very near the 
mean target duration of 1.05 sec. Thus, 
Ss appeared to smooth out the tem- 
poral pattern and to respond with 
much less variability of duration than 
is present in the target events. This 
tendency appeared to be of roughly the 
same magnitude regardless of sequence 
length or task coherence. That is, Ss 
in the more coherent, more highly 
repetitive tasks failed to show appre- 
ciably more differential responding to 
the intervals than those in the less 
coherent and random tasks. 


EXPERIMENT II 


The results of Exp. I showed that 
the shorter sequences led to better per- 
formance than the longer sequences, 
just as with irregular step-function se- 
quences with constant dwell time 
(Swink et al, 1967; Trumbo, Noble, 
& Ulrich, 1965). The second experi- 
ment was designed to describe in more 
detail the length-difficulty relationship 
in a temporal task, to provide more 
extensive practice, and to study long- 
term retention of temporal sequences. 


Method 


Subjects—A total of 100 right-handed 
male university students were assigned, 10 
each, to 10 experimental conditions. All Ss 
received research participation credit and/or 
pay for their services. 

Apparatus.—The tracking apparatus was 
essentially identical with that in Exp. I, but 
no eye-movement apparatus or bite board 
was used. 
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Task conditions.—Tasks were similar to 
those in Exp. I except that target durations 
were from .5 to 1.75 sec. in .25-sec. intervals. 
Eight of the 10 tasks constituted a 2X4 de- 
sign with two conditions of task coherence 
(fixed and I-44) and four conditions of se- 
quence length (4, 8, 12, and 16 targets). 
The remaining tasks were: (a) random, 
as defined in Exp. 1, and (b) a fixed task 
six targets long to provide additional data 
on the length-difficulty relationship for fixed 
sequence. The partially coherent (I-M) 
tasks were programmed with every fourth 
target selected anew on each repetition and 
designated: Tasks I-M, I-2& I-M», and 
I-o. 

Procedures.—Procedures, including in- 
structions and feedback, were those used in 
Exp. I, except that conditions related to 
eye movement recording were not included. 
The Ss were scheduled for 15, 20, 20, and 
25 trials in four daily %4-hr. sessions, One- 
half of each group returned after 1 wk., 
the other half after 1 mo., for 20 retention 
trials. All Ss were randomly assigned to 
conditions and to the training schedule. 


Results 


Integrated error.—Integrated error 
data are shown in Fig. 4, with fixed 
task conditions on the left and I task 
conditions on the right. Data for Task 
R are presented in both halves of Fig. 
4 for comparison purposes. The re- 
sults of the 20 retention trials are 
shown only as a single data point repre- 
senting the combined 1-wk. and 1-mo. 
results for the first block of five trials, 
since data for the two retention sessions 
did not differ, nor were there any sig- 
nificant losses at retention. 

The data in Fig. 4 indicated that per- 
formance on the fixed sequences was 
somewhat superior to that on the I 
sequences for series of the same length, 
but this was true only for the shortest 
sequences. With sequence lengths of 
12 and 16 intervals, fixed and I-} tasks 
do not appear to differ from one an- 
other or from the random task (R). 

An F test of the Day 4 means yielded 
a highly significant F (9, 90) = 7.47, p 
< 001. A separate 2 X 10 analysis of 
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Fic. 4. Integrated error data for Fixed (left) and "I" (right) conditions, Exp. II. 
(Retention data points, Ret, indicate means for the initial five trials for combined 1 


wk. and 1 mo. tests. 


the retention data indicated differences 
among tasks, F (9, 80) — 803, p< 
001, but not between retention inter- 
vals, F (1, 80) = 222, p > .10, or for 
the Task-Interval interaction, F (9, 
80) «1.00. A repeated measures 
analysis of means for last acquisition 
and first retention blocks (pooled over 
1 wk. and 1 mo.) indicated no signifi- 
cant sources of variance other than 
among tasks. The between-trials F (1, 
90) < 1.00 in the latter analysis sup- 
ported the observation that no change 
occurred as a function of the retention 
intervals. 

A final analysis of variance was per- 
formed on the means for the last block 
of acquisition trials for the four se- 
quence length conditions common to 
the two levels of coherence to test the 
main effects of coherence and length 
and their interaction. Thus, the analy- 
sis was a 2 X 4 factorial with two con- 
ditions of coherence (Fixed and I-1) 
and four sequence lengths (4, 8, 12, 
16). The results showed that both task 
coherence, F (1, 72) = 412, p < .05, 
and sequence length, F (3, 72) = 9.95, 


Trial 1 of acquisition is plotted separately.) 


p <.001, were significant sources of 
variance at the end of training, but that 
their interaction was not significant (F 
« 1.00). 

Thus, the error data indicated sig- 
nificant variance among the 10 experi- 
mental groups at the end of training, 
significant portions of which could be 
attributed to sequence length and to 
task coherence, but not to their inter- 
action. There were no significant 
losses at retention and scores after 1 
wk. and 1 mo. retention intervals did 
not differ. 

Sequence length and amount of prac- 
tice—A line drawn vertically at any 
point on Fig. 4 defines equal practice 
time for all task conditions. The func- 
tions relating sequence length to per- 
formance, using successively the means 
for Blocks 2, 4, 6, and 8, do not indi- 
cate that error increases disproportion- 
ately with increases in length. In fact, 
for Block 8 a fourfold increase in 
length (F-4 to F-16) is associated with 
less than 100% increase in error, while 
doubling length (F-8 to F-16) is ac- 
companied by less than 2096 increase 
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inerror. Similar results were found at 
the earlier training blocks. 

The length-difficulty relation. —The 
sequence length-task difficulty relation 
is more clearly indicated by a second 
comparison. A line drawn horizontally 
at any point across Fig. 4 defines an 
equal performance criterion for all 
groups. Taking an arbitrary value of 
8.5-v. error, which includes all experi- 
mental groups, then estimating from 
the abscissa the number of trials which 
each group required to reach this cri- 
terion, and multiplying by the number 
of repetitions per trial for each task 
condition, it was possible to describe 
task difficulty in terms of the number 
of repetitions required to reach cri- 
terion performance. The results are 
shown for both Fixed and I groups in 
the left hand portion of Fig. 5. Again, 
there is no evidence that difficulty in- 
creases disproportionately with in- 
creases in sequence length. In fact, a 
four-fold increase in length (4-16 
events) resulted in only a 60% in- 
crease in the number of repetitions re- 
quired to reach the 8.5-v. criterion. 
Thus, the disproportionality is in the 
opposite direction : longer sequences re- 
quire disproportionately fewer repeti- 
tions than shorter lists. Or, stated dif- 
ferently, the number of repetitions per 


Repetitions to Criterion 
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Fic. 5. The length-difficulty relationship. 
(Estimated total number of sequence repe- 
titions. to a performance criterion of 8.5 v., 
left, and estimated repetitions per target, 
right, as a function of sequence length.) 
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item, or event, decreases as the number 
of events increases. This is also shown 
in the right-hand portion of Fig. 5, 
where the ordinate represents the num- 
ber of repetitions required to reach the 
criterion, divided by the number of 
items in the sequence. The fixed se- 
quence of 16 targets, eg., required 
about 170 repetitions, or the equivalent 
of 7.5 repetitions for every item in the 
sequence, whereas the F-4 sequence re- 
quired about 75 repetitions, or nearly 
19 repetitions for every item. 


Discussion 


Both experiments indicated that diffi- 
culty, as defined by error scores, does not 
increase disproportionately with increases 
in sequence length under conditions of 
equal practice time. In fact, increases in 
length appear to be associated with rela- 
tively small increases in error. This is 
generally true for any amount of practice, 
after the first few trials, and for both the 
completely coherent and the partially co- 
herent sequences. 

The relation of sequence length and task 
difficulty in the present task is highly sim- 
ilar to that found in verbal serial learning 
(Deese, 1958; Thurstone, 1930). The 
analysis summarized in the right hand 
portion of Fig. 5 is analogous to that per- 
formed by Deese (1958) on verbal data 
which showed repetitions per item to de- 
crease with increases in number of items. 

The analogy with verbal learning may 
be closer than would appear at first 
glance. In order to reduce error, Ss in 
our tasks had to move in anticipation of 
target events in a manner not unlike the 
verbal anticipation of items in the serial 
list. One difference, and one that repre- 
sents some advantage to the present 
method, is that the authors had a con- 
tinuous measure of error, whereas with 
the verbal lists responses must be scored 
categorically as correct or incorrect. 

In Exp. I, only the shortest fixed se- 
quence and the most coherent of the I 
sequences showed appreciably more im- 
provement than the completely random 
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task after 40 trials, This was generally 
true in the second experiment, also, al- 
though the additional practice resulted in 
somewhat greater differential perform- 
ance among the longer sequence condi- 
tions, particularly for the complete co- 
herent sequences. 

The fine grained analyses in Exp. 
I revealed, first, that integrated error 
scores were rather consistently and nega- 
tively related to average lead time, a rela- 
tionship which was expected on the basis 
of our prior studies and from analysis of 
the task requirements. Of greater inter- 
est was the relationship between manual 
and eye movements, which was one of 
generally high correspondence, but with 
evidence that hand movements were ini- 
tiated before eye movements early in prac- 
tice, followed by the eye leading the hand 
on later trials. It seems likely that this 
change in the eye-hand relationship comes 
about when the temporal pattern has been 
reasonably well learned, that is, when 
there is perceptual anticipation of the 
series of temporal intervals. Before that 
point, S may respond sooner with the 
arm simply because of the response set in- 
duced by the task requirements and the 
instructions. 

The lead-lag analysis by target duration 
indicated a range effect for all task condi- 
tions, with extremely long and nonoptimal 
leads, especially for the more coherent 
tasks, yet these latter tasks showed the 
greatest reduction in integrated error 
scores. However, since lags are very 
costly of error, it may be that the non- 
optimal leads on the long durations were 
more than compensated for by the shorter 
lag times on the short duration events by 
Ss on the more coherent tasks. That is, 
while these Ss overanticipated the long 
durations, their timing performance was 
more nearly optimal on the short dura- 
tion events, 

The range effect is a rather universal 
phenomenon and, on the one hand, would 
be predicted at least early in practice on 
empirical as well as theoretical (Helson, 
1964) grounds. On the other hand, under 
the conditions of the present studies it 
might be expected that with sufficient 
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practice finer differentiation of temporal 
intervals would occur. Whether this 
would occur with additional practice re- 
mains to be determined. In any event, 
with the level of practice provided, S's 
approached a fixed-frequency response 
pattern, rather than one which more 
closely approximated the input pattern. 

Another finding which recurred with 
both fixed and I patterns was an apparent 
discontinuity in the length-difficulty rela- 
tion in the neighborhood of eight-item 
sequences, Both, when integrated error 
is plotted as a function of sequence length 
and when the number of repetitions to 
criteria is plotted as a function of se- 
quence length, the eight-item sequences 
appeared disproportionately more difficult 
than either shorter or longer sequences. 
This might have gone unnoticed except 
for the fact that a similar phenomenon 
was observed in the earlier studies using 
spatial, rather than temporal patterns 
(Swink et al, 1967; Trumbo, Noble, & 
Ulrich, 1965). It may be that the in- 
formation processing strategies are differ- 
ent for shorter than for longer sequences, 
in which case there could be some critical 
length near the upper limit of the strategy 
for shorter sequences which would lead to 
poorer performance. The fact that the 
discontinuity occurs near the immediate 
memory span may provide a clue to the 
strategies involved. For example, it may 
be that shorter sequences are handled as a 
single "chunk," whereas longer sequences 
are organized into smaller chunks. At 
some point, possibly in the range of eight 
items, the single chunk approach may be 
an attractive, but ineffective, information 
processing strategy. Further research is 
needed to test these possibilities. 

Finally, with respect to the question 
raised in the introduction regarding the 
forgetting of skill, the data from Exp. IT 
indicate that the losses observed in earlier 
studies involving spatial patterns reflect 
forgetting of the spatial pattern rather 
than losses of timing. This is indicated, 
despite the fact that in the earlier studies 
increased error was correlated with timing 
changes, since, when in Exp. II no spatial 
pattern had to be remembered and the 
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task was therefore primarily one of tim- 
ing, no losses occurred at retention. 


REFERENCES 


Bauricx, H. P. Retention curves: Facts 
or artifacts? Psychological Bulletin, 1964, 
61, 188-194. 

Bauricx, H. P. The ebb of retention. 
Psychological Review, 1965, 72, 60-73. 
Cross, K. Discrete tracking proficiency as 
a function of temporal, directional, and 
spatial predictability. Unpublished doc- 
toral dissertation, Kansas State Univer- 

sity, 1966. 

Drresg, J. The psychology of learning. 
(2nd ed.) New York: McGraw-Hill, 
1958. 

Hexson, H. Adaptation-level theory. New 
York: Harper & Row, 1964. 

Noste, M., Trumso, D. UrmcH, L, & 
Cross, K. Task predictability and the 
development of tracking skill under ex- 
tended practice. Journal of Experimental 
Psychology, 1966, 72, 85-94. 


TRUMBO, NOBLE, FOWLER, AND PORTERFIELD 


Swink, J., TRuMBo, D., & Nose, M. On 
the length-difficulty relation in skill per- 
formance. Journal of Experimental Psy- 
chology, 1967, 74, 356-362. 

Tuurstone, L. L. The relation between 
learning time and length of task. Psy- 
chological Review, 1930, 16, 44-53. 

Trumso, D. EsrriNcER, R., Nose, M., & 
Cross, K. A versatile electronic track- 
ing apparatus (VETA). Perceptual & 
Motor Skills, 1963, 16, 649-656. 

Trumpo, D., Nos, M., Cross, K, & 
UrnmrcH, L. Task predictability in the 
organization, acquisition, and retention of 
tracking skill Journal of Experimental 
Psychology, 1965, 70, 252-263. 

Trumpo, D. Nose, M. & UrmcH, L. 
Number of alternatives and sequence 
length in acquisition of a step-function 
tracking task. Perceptual & Motor Skills, 
1965, 21, 563-569. 


(Received June 26, 1967) 


Journal of Experimental Psycholo 
1968, Vol. 77, No. 3, 407-414 


EFFECT OF ABSOLUTE AMOUNT, MEAN AMOUNT, AND 
PATTERN OF REINFORCEMENT ON ACQUISITION 
AND EXTINCTION ' 


GEORGE C. JERNSTEDT ? 
Johns Hopkins University 


In this study the separate effects of absolute volume (V) and mean 
volume (M) of reinforcement, ratio of reinforcement (P), and num- 
ber of successive nonreinforcements (n) were examined. 84 rats 
made 1,024 instrumentally conditioned licks per day for either large 
or small volumes of saccharin solution on reinforced responses. 4 
different ratios of reinforcement were used, with n occurring in blocks 
of either 8 or 16. 5 values of M were produced by the design. The 
p results were that: (a) larger V in acquisition led to decreased 
" resistance to extinction (Rn), (b) P closer to 1/1 led to increased 
rate of responding in acquisition (Rt) and decreased Rn, (c) larger 
M led to increased Rt and decreased Rn, and (d) n did not affect 
Rt or Rn. The implications of the results for theories of partial 


reinforcement were discussed. 


Low percentages of reinforcement 
during acquisition generally lead to 
greater resistance to extinction than 
high percentages (or ratios) of rein- 
forcement (Lewis, 1960). Although 
the majority of reported investigations 
of this partial-reinforcement effect 
(PRE) have considered ratio of rein- 
forcement as the critical parameter, the 
specification of a particular ratio of 
reinforcement specifies neither the 
actual numbers of reinforcements or 
nonreinforcements, nor their distribu- 
tions. 

Actually, the number of nonrein- 
forcements (N) experienced by an 
organism appears to be a better pre- 
dictor of the PRE than the number of 
reinforcements (R) or the ratio of re- 
inforcement (P), (Notterman, 1951; 


1 This report is part of a dissertation sub- 
mitted to the Johns Hopkins University in 
conformity with the requirements for the 
degree of Doctor of Philosophy. The author 
is indebted to Stewart H. Hulse, the thesis 
adviser, for his most helpful counsel. The 
research was supported in part by National 
Science Foundation Research Grant GB- 
2406 to Stewart H. Hulse. 

? Now at Dartmouth College. 
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Uhl & Young, 1967). In particular, 
for a given number of nonreinforce- 
ments, the PRE appears to be a mono- 
tonic function of the number of suc- 
cessively occurring nonreinforcements 
without an intervening reinforcement 
(n), (Capaldi, 1964; Gonzalez & Bit- 
terman, 1964). Capaldi (1966) has 
advanced a sequential effects hypothe- 
sis to account for the PRE, in which he 
proposes that resistance to extinction is 
a function of three factors: the size of 
n, the number of different sizes of n, 
and the number of occurrences of each 
size of n which the organism experi- 
ences. 

There is, however, a suggestion in the 
literature that the effect of partial rein- 
forcement is best predicted as a joint 
function of absolute amount of rein- 
forcement (V), the physical amount of 
reinforcement received on reinforced 
trials, and ratio of reinforcement, i.e. 
mean amount of reinforcement per re- 
sponse (M), (Logan, Beier, & Kin- 
caid, 1956; Yamaguchi, 1961). Un- 
fortunately, in nearly all partial 
reinforcement experiments M has been 
completely confounded with either P or 
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V. Yet with certain values of each 
variable, it is possible to examine the 
results of variations in M, V, and P 
separately. "Thus the effect of M may 
be separated from the effect of P and 
the effect of V, if each given value of 
M is produced by a combination of 
several values of V and P. For ex- 
ample, the situation where there are 
two units of reinforcement on 3 of the 
trials and the situation where there 
are four units of reinforcement on 4 
of the trials produces an average of one 
unit of reinforcement per trial. Since 
the same M (one unit) results from 
two different values of V (two and 
four units) and two different values of 
P (3 and 4), neither V nor P are com- 
pletely confounded with M. 

The present study employed the in- 
strumentally conditioned licking re- 
sponse in rats to examine the effect of 
absolute volume of reinforcement (V), 
ratio of reinforcement (P) produced 
by varying the total number of nonre- 
inforcements within a constant total 
number of responses, and number of 
successive — nonreinforcements (n). 
The particular values of V and P used 
in the experiment were chosen to pro- 
duce different mean amounts of rein- 
forcement (M) which were not com- 
pletely confounded with either V or P. 

Evidence in the literature also sug- 
gests that the strength-of-response 
measures which are used in- most 
studies of partial reinforcement—rate 
of response in acquisition and number 
of responses in extinction—obscure the 
actual changes occurring in the be- 
havior under observation. Pereboom 
and Crawford (1958), using alley run- 
ning, and Bindra (1963), using bar 
pressing, found that changes in speed 
of response did not result from changes 
in speed when the animal was actually 
making the response, but resulted from 
changes in the amount of activity the 


animal engaged in which was not di- 
rectly part of the behavior leading to 
reinforcement. To test for similar 
effects with the instrumental licking 
response, the present study measured 
the effects of the experimental varia- 
bles on a number of components of 
licking behavior, such as duration of 
licks and duration of interresponse 
times, which were presumed to con- 
tribute to average rate of response in 
acquisition and number of responses 
in extinction. 


METHOD 


Subjects—The Ss were 84 female albino 
rats of the Sprague-Dawley strain, obtained 
from Sprague-Dawley, Madison, Wisconsin. 
The Ss were 80 days old when received and 
had no prior experience drinking from a 
tube. 

Apparatus—The apparatus consisted of a 
46 X 42 X 38-cm. box with Plexiglas sides 
and top and a stainless steel rod floor. At 
one end of the box a 1X3-cm. vertical 
slot for the drinking tube was centered in 
the wall 7 cm. from the floor. The box 
and drinking tube were placed in a larger 
insulated chest where a 7.5-w. bulb pro- 
vided illumination and a speaker provided a 
masking level of white noise, 

The drinking tube consisted of a 1.25-cm. 
diameter Plexiglas cylinder with a .2-cm, 
fluid duct through its center. Contact of S's 
tongue with the tube face was detected by 
a small flush brass contact slightly below 
the fluid duct opening and connected to a 
transistor relay. When a lick which was 
to be reinforced was detected by the relay, 
an infusion pump was operated which forced 
a measured drop of fluid to the tip of the 
drinking tube just as S's tongue passed the 
fluid duct (Hulse, 1960). On nonreinforced 
licks the fluid remained retracted within the 
tube. 

An electronic system operated all appa- 
ratus. The data were stored on clocks, 
counters, and a moving pen recorder. 

Design.—The experiment was carried out 
with three separate but complete replications 
of the basic design with two Ss randomly as- 
signed to each group in each replication ; 
Table 1 summarizes the basic design. Part 
of the design was a factorial combination 
of two absolute volumes of reinforcement, 
-0033 and .0067 cc per reinforced response 
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TABLE 1 
EXPERIMENTAL DESIGN 
Ratio of Absolute Mean Amount | Number of 
Groupitdhe Reinforcement Rewroreenents Reinforcement" Nonrelnforee- 
ments 
P1/1-V6 1/1 .0067 .0067 0 
P1/1-V3 1/1 .0033 .0033 0 
P1/2-V6-n8 1/2 .0067 .0033 8 
P1/2-V6-n16 1/2 .0067 .0033 16 
P1/2-V3-n8 1/2 .0033 .0017 8 
P1/2-V3-n16 1/2 .0033 .0017 16 
P1/4-V6-n8 1/4 .0067 .0017 8 
P1/4-V6-n16 1/4 .0067 .0017 16 
P1/4-V3-n8 1/4 .0033 .0008 8 
P1/4-V3-n16 1/4 .0033 .0008 16 
P1/8-V6-n8 1/8 .0067 .0008 8 
P1/8-V6-n16 1/8 .0067 .0008 16 
P1/8-V3-n8 1/8 .0033 .0004 8 
P1/8-V3-n16 1/8 .0033 .0004 16 


Note.—N = 6 in each group. 
* Expressed in cubic centimeters per lick. 


(labeled V3 and V6), three ratios of rein- 
forcement, 3, 4, and 3 (labeled P1/2, P1/4, 
and P1/8), and two numbers of successively 
occurring nonreinforcements, 8 and 16 (la- 
beled n8 and n16). Two additional groups 
were included in the design, one for each V, 
which received continuous reinforcement 
(labeled P1/1). All Ss made 1,024 re- 
sponses on all but the first 3 days of acquisi- 
tion. The blocks of n were arranged 
throughout the 1,024 responses so that num- 
ber of reinforcements between n varied ran- 
domly. Throughout acquisition each S ex- 
perienced only one V and after being shaped 
to its final schedule only one P and one n. 
The values of P and V used in the design 
were chosen to produce groups that received 
different mean amounts of reinforcement, 
where M was not confounded with either V 
or P (cf. Table 1). The resultant Ms were 
.0067, .0033, .0017, .0008, and .0004 cc per 
Mont (labeled M67, M33, M17, M8, and 

4). 

Procedure.—For 10 days after arrival from 
the supplier, Ss were maintained on ad-lib 
water (from cups) and Purina Lab Chow. 
During the next 10 days food and water 
intake of all Ss was limited until Ss reached 
80% of ad-lib weight. For the remainder 
. of the experiment each S was weighed every 

day and fed sufficient food mixed with 15 cc 
of water to maintain its weight at 80% of 
ad-lib weight. The Ss that received fluid 


in an experimental session were given a 
correspondingly smaller water ration. For 
the next 14 days after weight reduction Ss 
were freely handled and allowed to explore 
a large wooden box in groups of six for 20 
min. each day. 

Acquisition began the day after taming 
ended. Reinforcement consisted of the 
proper size drop of 1.5 gm./liter saccharin 
solution. On Days 1, 2, and 3 Ss were 
trained to drink from the Plexiglas tube 
placed behind the drinking slot and made 
512 continuously reinforced licks per day. 
Beginning on Day 4 Ss were shaped to 
their final schedule of reinforcement and for 
the remainder of the experiment made 1,024 
licks per day with the drinking tube .5 cm. 
behind the slot in the chamber wall. Shaping 
was accomplished by changing P and n each 
day until each S' reached the particular val- 
ues of P and n called for by the design. 
All Ss had reached their final schedule by 
Day 12 and continued on this schedule 
through Day 32. 

During extinction, which began on Day 33 
and lasted four days, conditions were the 
same as in acquisition with the exception that 
the pump was disconnected so that no licks 
were reinforced. Each S' remained in the 
experimental chamber for 5 min. which, with 
the exception of six Ss, was the longest 
time required in acquisition for Ss to emit 
their 1,024 licks. 
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Response measures included the mean and 
total duration and the total number of licks, 
time between licks (IRT), and interrup- 
tions in licking. Since a preliminary study 
suggested that with this apparatus IRTs 
<.5 sec. only occurred while Ss were con- 
tinuously licking and IRTs >.5 sec. only 
occurred when Ss stopped licking and left 
the drinking tube, all IRTs 2.5 sec. were 
recorded as interruptions in licking Two 
additional calculated response measures were 
burst licking rate (total number of licks/ 
total time in chamber less total duration of 
interruptions) and average licking rate (total 
numbers of licks/total time in chamber). 


RESULTS 


Acquisition.—The data in Fig. 1 in- 
dicate that for P closer to 1/1, average 
rate of responding is higher. The 
effect of P causes greater differences in 
performance for Cond. V3 than for 
Cond. V6. Other data showed that n 
did not affect performance. 

Since an overall analysis of variance 
of the data for the last four days re- 
vealed no significant interactions with 
replications and no significant main 
effect of days or interactions with days, 
the data were pooled across replications 
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Fic. 1. Average licking rate in acquisi- 
tion as a function of ratio of reinforcement 
and absolute volume of reinforcement. (Data 
points are means of the last four days of 
acquisition.) 


and days. An analysis of variance of 
the data from the partially reinforced 
groups (all groups except P1/1) re- 
vealed no significant main effects or 
interactions for P, V, or n. An analy- 
sis of variance of the data pooled across 
n and including the P1/1 groups 
showed significant effects for P and 
Px V, F (3, 76) = 4.54, p < 01, F 
(3, 76) = 21.15, p< .001, respec- 
tively. The use of F tests on the dif- 
ferences between group means showed 
that at Cond. V3 the P1/1 group dif- 
fered significantly from all partial 
groups, for P1/1 — P1/2, F (1, 76) = 
13.73, p < .001, and among the partial 
groups the P1/8 group differed from 
the others, for P1/4 — P1/8, F (1, 76) 
=7.70, p<.01. At Cond. V6 the 
P1/1 group differed significantly from 
the P1/4 and P1/8 groups, for P1/1 
— P1/4, F (1, 76) = 420, p « .05. 

The left side of Fig. 2 illustrates that 
average licking rate in acquisition was 
an increasing function of M. An 
analysis of variance of the data grouped 
according to mean amount (cf. Table 
1) revealed a significant main effect 
for M, F (4, 79) = 13888, p < .001. 
Comparisons of individual means 
showed that Group M4 differed sig- 
nificantly from all others, for M4—M8, 
F (1, 79) 213.08, p<.001, and 
Groups M33 and M67 differed from all 
others, for M17-M33, F (1, 79) = 
6.55, p < .05. 

Examination of the data also showed 
that licking occurred as a series of 
bursts (continuous licking) and inter- 
ruptions (pauses between bursts), 
and that average rate of licking was 
wholly a function of the total duration 
of interruptions.* Analyses of variance 
revealed that burst licking rate did not 
vary significantly across treatments, 

3A complete analysis of all measured 


components of the licking behavior appears 
in Jernstedt (1967). 
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tior and mean number of licks per day in 
extinction as a function of acquisition mean 
volume of reinforcement. (The acquisition 
data points are means across the last four 
days of acquisition. The scale of average 
licking rate is also valid for the extinction 
data points.) 


but both average licking rate and total 
duration of interruptions did, F (13, 
70) = 5.69, p < .001, F (13, 70) — 
4.36, p « .001, respectively. The cor- 
relation coefficient between average 
licking rate and total duration of inter- 
ruptions was r= —.96. 

Extinction.—The data in Fig. 3 in- 
dicate that P closer to 1/1 and larger 
V in acquisition led to fewer responses 
in extinction. Other data showed that 
n in acquisition did not affect extinction 
performance. 

Since an overall analysis of variance 
found no significant interactions with 
replications, the data were pooled 
across replications. In addition, an 
analysis of variance showed no signif- 
icant main effect or interactions for n 
among partially reinforced groups. An 
analysis of variance of the data pooled 
across n and including the P1/1 groups 
revealed significant effects for P, V, 
Days (D), and P x D, where F (3, 
76) = 49.33, p < 001, F (1, 76) = 
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22.62, p < .001, F (3, 228) = 88.70, 
p < 001, F (9, 228) = 4.96, p < .001, 
respectively. Comparisons of individ- 
ual means showed that on Day 1 the 
P1/1 groups differed significantly from 
the partial groups, for P1/1 — P1/2, 
F (1, 76) = 5.50, p < .05, and among 
partial groups the P1/8 groups differed 
from the rest, for P1/4 — P1/8, F (1, 
76) = 35.67, p < .001. On all other 
days the only significant differences 
were between the P1/8 groups and all 
others, for P1/4 — P1/8, F (1, 76) — 
2781, p < .001, F (1, 76) = 14.58, 
p<.001, F (1, 76) 24.09, p < .05, 
respectively. 

As indicated in Fig. 2 Ss receiving 
smaller M in acquisition made more 
licks in extinction. An analysis of 
variance of the data grouped by M 
revealed significant effects for M, D, 
and M x D, F (4, 79) = 26.14, p< 
.001, F (3, 273) = 12263, p < .001, 
and F (12, 273) — 7.01, p «.001, re- 
spectively. Comparisons of individual 
means showed that on Days 1, 2, and 3, 
Group M4 differed significantly from 
Group M8, F (1, 76) = 1645, p< 
001, F (1, 76) — 7.51, p < 0l, F (1, 
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Fic. 3. Mean number of licks on Day 1 
of extinction as a function of ratio of rein- 
forcement and absolute volume of reinforce- 
ment during acquisition. 
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76) = 5.60, p < .05, respectively, and 
on Days 1 and 2, Group M8 differed 
from Group M17, F (1, 76) —4.37, 
p < .05, F (1, 76) = 574, p < .05, re- 
spectively. The correlation coefficient 
between average licking rate on the last 
day of acquisition and number of licks 
on Day 1 of extinction was r = —.95. 

As in acquisition, licking in extinc- 
tion occurred as a series of bursts and 
interruptions. Analyses of variance 
showed that burst licking rate was not 
significantly affected by treatments, but 
both total number of responses and 
total duration of interruptions were sig- 
nificantly affected, F (13, 70), = 8.93, 
p < 001, F (13, 70) = 999, p < .001, 
respectively. Thus total number of 
licks in extinction could be predicted 
entirely from total duration of inter- 
ruptions. On Day 1 the correlation 
coefficient between total number of 
licks and total duration of interruptions 
was r= —.99, 


Discussion 


The major findings of this study were 
that: (a) ratios of reinforcement closer 
to 1/1 in acquisition led to higher average 
rates of response in acquisition (Rt) and 
lower resistance to extinction (Rn); (b) 
increased absolute volume of reinforce- 
ment in acquisition did not significantly 
affect Rt but lowered Rn; (c) changes in 
the number of successive nonreinforce- 
ments did not affect Rt or Rn; and (d) 
increased mean volume of reinforcement 
in acquisition increased Rt and reduced 
Rn. Furthermore acquisition and extinc- 
tion performance could be predicted en- 
tirely from the time Ss spent not licking. 

These results pose problems for many 
of the active theories of partial reinforce- 
ment. Cognitive dissonance theory (Law- 
rence & Festinger, 1962) and conditioned 
frustration theory (Amsel, 1958, 1962), 
applied to the present situation, would 
predict that the P1/1-V3 animals would 
exhibit greater Rn than the P1/1-V6 ani- 
mals. However, both theories would also 
predict, contrary to the obtained results, 
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that all groups with P greater than 1/1 
would show greater Rn after V6 than 
after V3. 

The fact that number of successive non- 
reinforcements did not affect the PRE is 
contrary to the predictions of Capaldi’s 
(1966) sequential effects hypothesis. Since 
there was no significant variation in re- 
sponse rate over the last four days of 
acquisition, it may be assumed that stable, 
asymptotic levels of behavior had been 
reached at the end of acquisition, Given 
this condition, the sequential effects hy- 
pothesis predicts that the nl6 groups 
should exhibit a greater PRE than the 
n8 groups. Moreover, since the partially 
reinforced groups should not differ across 
the P condition, groups which experi- 
enced P values close to 1/1 and high n 
values (such as P1/4-n16) should show a 
greater PRE than groups which experi- 
enced P values farther from 1/1 and low 
n values (such as P1/8-n8). No com- 
parisons among experimental groups ex- 
hibited this effect. 

The inability of the cognitive disso- 
nance, conditioned frustration, and sequen- 
tial effects theories to predict these data 
may be due to a common characteristic. 
All three theories stress specific events 
which happen on specific trials or series 
of trials. But these results appear to be 
most efficiently described as functions of 
average events across trials, Two the- 
ories which consequently fare better in 
predicting these results are the pooling 
model of adaptation level theory, which 
does stress the average of events across 
trials, and discrimination theory, which 
may be extended to stress the average of 
events, 

The pooling model of adaptation level 
theory (Bevan, 1963), while it does not 
account for the acquisition results, clearly 
predicts all of the extinction results. Ac- 
cording to the pooling model, performance 
is in part a function of the apparent mag- 
nitude of reinforcement, that is, a function 
of the difference between an internal aver- 
age of past reinforcements (internal 
norm) and the absolute magnitude of the 
present reinforcement. Thus during ex- 
tinction the apparent magnitude of rein- 
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forcement is a negative quantity equal to 
the difference between the internal norm 
resulting from acquisition and the zero 
amount of reinforcement present on ex- 
tinction trials. Performance deteriorates 
more rapidly in extinction for organisms 
that experience a larger negative apparent 
magnitude of reinforcement. The pooling 
model therefore predicts that Rn is a 
decreasing function of V and P (as P 
approaches 1/1) and independent of n, or 
that Rn is most clearly predicted as a 
function of mean amount of reinforcement. 
All of these predictions are clearly con- 
firmed by the results of this study. 

Similarly, discrimination theory (Lewis, 
1960) may be interpreted to predict the 
PRE as a function of the average of 
events in acquisition. Any condition 
which results in smaller mean amounts of 
reinforcement in acquisition should make 
it more difficult to discriminate the condi- 
tions of extinction from those of acquisi- 
tion. Either increasing mean amount or 
absolute amount of reinforcement, or let- 
ting ratios of reinforcement approach 1/1 
in acquisition should lead to decreased re- 
sistance to extinction, as these data con- 
firmed. Unlike the pooling model, dis- 
crimination theory would also predict in- 
creased resistance to extinction from 
larger values of n, which was not con- 
firmed here. 

There are two unique aspects of licking 
behavior which might explain why these 
data are best predicted by theories stress- 
ing average effects of variables. First, 
Hulse (1967) has presented evidence to 
suggest that licking behavior has both 
elicited and emitted components. Hulse 
suggests that behavior within a burst of 
licks is elicited (and thus reflexive) and 
that behavior between bursts, such as ap- 
proaches to the drinking tube and inter- 
ruptions in licking, is emitted. If this 
suggestion is correct, then one would ex- 
pect the interruptions in licking and there- 
fore Rt and Rn to be affected by the aver- 
age of conditions present during bursts of 
licking, and to be relatively unaffected 
by a particular condition present for spe- 
cific individual licks. This suggestion 
should not limit the general relevance of 
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these data since naive infant rats in a 
more natural environment also lick in a 
series of bursts and interruptions, ( Schaef- 
fer & Premack, 1961), and since the re- 
ported response measures in both acquisi- 
tion and extinction are nearly perfectly 
correlated with interruptions in licking, 
the instrumental component of the be- 
havior. 

Second, licking behavior is different in 
that, of the more commonly studied re- 
sponses, neither alley runs nor bar presses 
occur at the rate of response of licking. 
If the number of successively occurring 
nonreinforcements is in some way regis- 
tered within an organism as, for example, 
a function of the duration of time between 
reinforcements, then one would expect 
that at higher rates of response longer 
lengths of n would be required to equal 
the effect of shorter n at lower rates of 
response, At low response rates an organ- 
ism may attend to and record each rein- 
forcement and nonreinforcement as an 
independent event, whereas at higher re- 
sponse rates the organism may only be 
able to attend to the average conditions of 
reinforcement. Again, this should not 
reduce the general relevance of these data 
since some instrumental responses, such 
as the key peck in pigeons, do occur at 
rates comparable to those of licking. More 
importantly, in discrete trial arrangements 
many commonly used responses can occur 
at widely different response rates and 
thus may be subject to an effect of rate of 
response on such parameters as absolute 
amount of reinforcement and number of 
successive nonreinforcements. 
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USE OF INSTRUCTIONS AND HYPNOSIS TO 
MINIMIZE ANCHOR EFFECTS ' 


B. JACK WHITE, RICHARD D. ALTER, MARK E. SNOW, 
AND D. EUGENE THORNE 


University of Utah 


In an effort to determine whether Ss could successfully ignore a 
remote anchor Ss judged a series of 6 weights during a training ses- 
sion, then judged the same series in the presence of the anchor which 
appeared on alternate lifts, Ss judged the anchor but were instructed 
to “ignore the heaviest weight, [the anchor]; ie, don't let it 
affect your judgments of the other weights.” Compared to a con- 
trol group, only 2 of the 4 groups which were instructed to ignore 
the anchor were successful; viz, a group which was hypnotized 
before being given the instructions and a group which was given a 
2-hr. lecture on anchor effects before the instructions. The un- 
successful groups included a group of hypnotically susceptible Ss who 
were not hypnotized and a group of unselected Ss, both of which were 


simply instructed to ignore the anchor. 


The growing body of evidence indi- 
cates that the effects of supraliminal 
anchors on judgment are heavily deter- 
mined by conceptual functioning in the 
individual. For example, a number of 
studies have indicated that the maxi- 
mum effect of an anchor occurs when S 
defines the anchor as relevant to the 
stimulus series being judged; other- 
wise, it may have little or no effect 
(Bevan & Pritchard, 1963, 1964; 
Brown, 1953; Gumenik & Weiss, 1967 ; 
Tresselt, 1965). Rogers (1941), in 
his classic study of the effects of an- 
chors on absolute judgments of a series 
of weights, noted that one of his Os did 
not show the expected contrast effects 
for the five most remote of the eight 
anchors which were presented to him. 
According to Rogers (1941, p. 31), 
O “reported that he could no longer 
judge the stimuli in relation to the 
‘SIX’ defined by the anchoring stim- 
ulus, and was judging them ‘against 
each other’.” Thus, it seems reason- 


1 Portions of this paper were presented 
at the Rocky Mountain Psychological Asso- 
ciation meetings, May 1967. 


able to infer that when the category 
assigned to the anchor was grossly in- 
appropriate in the eyes of O, he chose 
to ignore the anchor. Inasmuch as this 
O was a graduate student in psychology 
and had had experience in a study of 
judgment of lifted weights before he 
served in Rogers’ experiment, this out- 
come is not surprising, especially in 
view of Guilford and Park’s (1931), 
and Engen and Tulunay’s (1957) ob- 
servations that the judgments of sophis- 
ticated Os (psychologists) are less af- 
fected by anchor stimuli and context, 
respectively, than the judgments of 
naive Os. Engen and Tulunay’s re- 
sults led them to propose that it might 
be possible, by providing experience 
with sources of bias in judgments, to 
calibrate Os to avoid biases such as 
those produced by interpolated anchors, 
context, etc. 

Because there have been, to the 
writers’ knowledge, no systematic in- 
vestigations of Ss’ ability to deliberately 
ignore remote anchors, the intent of 
the present study was to calibrate Ss 
to do this. The design of the study 
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provided comparisons between a con- 
trol group which received no anchor 
and five experimental groups which 
were expected to vary in the extent to 
which they were able to avoid the ef- 
fects of a remote anchor. If Engen and 
Tulunay (1957) were correct in their 
speculations about the possibility of 
calibrating Ss, one would expect that a 
lecture which provides a thorough 
knowledge of the effects of remote an- 
chors would enable Ss to avoid the ef- 
fects if, prior to judging a series of 
lifted weights, they were requested to 
ignore the interpolated anchor. Since 
hypnotized Ss appear to be able to 
ignore stimuli on command, it seems 
reasonable to expect that they, too, 
would successfully ignore the anchor 
when instructed to do so, even when 
given no information about anchor ef- 
fects. Thus, neither the lectured group 
(Group 4) nor the hypnotized group 
(Group 5) was expected to differ from 
the control group (Group 6). 

The other three groups were ex- 
pected to differ from the control group. 
Group 1 was not instructed to ignore 
the anchor; thus, it was expected to 
show the usual contrast effects which 
are produced by remote anchors. If 
there is validity in Engen and Tulu- 
nay's (1957) implicit assumption that 
sophisticated Ss would avoid biases in 
judgment by compensating for them, it 
follows that Ss in Group 2 who were 
instructed to ignore the anchor but 
given no information about anchor ef- 
fects would show contrast since they 
would have no means of compensating 
for unknown effects. Finally, one 
could formulate contradictory hypoth- 
eses concerning the performance of 
hypnotically susceptible Ss (Group 3) 
who were simply instructed to ignore 
the anchor but were not hypnotized. 
On the one hand, there is reason to 
debate whether it is susceptibility to 


hypnosis or hypnosis itself which dif- 
ferentiates the performance of the hyp- 
notized S from that of the unsusceptible 
S (eg. see London, Conant, & Davi- 
son, 1966), but if susceptibility is the 
crucial variable, in the present study 
one might expect waking susceptible 
Ss to be as successful in ignoring the 
anchor as the hypnotized Ss. On the 
other hand, if the rationale concerning 
Ss' inability to compensate for un- 
known effects is correct, one would 
expect that the waking susceptible Ss 
in Group 3 would show contrast effects 
and, thus, differ from the control group. 

One could also formulate an hypoth- 
esis concerning the relative magnitudes 
of contrast effects which would occur 
in the various groups; viz. that the 
rank order of the magnitude of contrast 
would range from most to least, re- 
spectively, in Groups 1-6. Obviously, 
Group 1 should show the most contrast 
since it was not instructed to ignore 
the anchor, and Group 6 should show 
the least since it was not exposed to an 
anchor. If one assumes that measures 
of susceptibility to hypnosis are largely 
measures of willingness to follow in- 
structions, Group 3 should follow the 
same instructions as those given to 
Group 2 somewhat better than the ran- 
domly selected Ss in Group 2, but not 
as well as Ss who have a thorough 
understanding of contrast effects, or 
who are hypnotized. Finally, if hyp- 
nosis makes it possible for S to ignore 
stimuli, Group 5 should be more suc- 
cessful than Group 4 which, presum- 
ably, would attempt to compensate for 
the anchor rather than ignore it. 

In summary, it was anticipated that 
Groups 1, 2, and 3 would differ from 
Group 6, that Groups 4 and 5 would 
not differ from 6, and that the magni- 
tude of contrast effects would be rank 
ordered from most to least in Groups 
1-6, respectively. 


MINIMIZING ANCHORING BY HYPNOSIS 


METHOD 


Subjects.—All of the groups except 
Group 4 were composed of students who 
were enrolled in introductory psychology 
classes at the University of Utah. Group 
4 was obtained from a laboratory section 
of a beginning course in experimental psy- 
chology which is usually taken immediately 
after introductory. psychology. The seven 
Ss in each group were given extra credit 
toward their course grades, with the ex- 
ception of Ss in Group 4 for whom the ex- 
periment was integrated into the laboratory 
schedule. The Ss in Groups 3 and 5 were 
chosen for their susceptibility to hypnosis 
as indicated by a score of nine or above on 
Form A of the Harvard Group Scale of 
Hypnotic Susceptibility. The number of 
women in each group of seven Ss was, re- 
spectively, for Groups 1-6; 1, 3, 3, 3, 2, 3. 

Apparatus —The stimuli which were used 
in the training session were the same as 
those used by Rogers (1941); specifically, 
the weights were of 55, 75, 93, 109, 125, and 
141 gm. (hereafter referred to as stimuli 
1-6, respectively, or as the original series). 
The anchor weighed 219 gm. 

Since a complete description of the appa- 
ratus which presented the stimuli is avail- 
able elsewhere (White & Larimer, 1963), 
only a brief account of its functions will be 
given. The basic unit of the apparatus con- 
sists of a large motor-driven platform which 
revolved within a circular enclosure. As 
many as seven Ss who were seated around 
the circumference of the apparatus put their 
left hands into the enclosure and, at 8-sec. 
intervals, directly lifted one of the 60 
weights which were arranged along the 
edge of the platform. The apparatus timed 
the stopping and starting of the rotation of 
the platform and initiated and terminated a 
l-min. rest interval after each series of 50 
judgments. 

Procedure in the training session—The 
Ss were told that they were to judge a series 
of weights using a response scale of 1-6. 
The lightest weight was to be called 1, the 
next lightest 2, and so on to the heaviest 
weight, which was to be called 6. They 
were instructed to lift with their left hand, 
using a uniform wrist motion, and to re- 
cord their judgments on their answer sheets 
with their right hand. Following a demon- 
Stration of the operation of the apparatus 
and 20 practice trials, Ss made 120 judg- 
ments of the series (20 judgments of each 
weight). The stimuli were ordered within 
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blocks of six in such a way that each 
weight was preceded by each other weight 
the same number of times; no weight ap- 
peared twice in succession. 

A week before, and again 24 hr. before 
the training session, Ss in Group 5 were 
given an hour of hypnotic depth training 
and a posthypnotic suggestion which would 
facilitate rapid trance induction at the time 
of the training session. They were, of 
course, hynotized before beginning their 
training session, a procedure which, accord- 
ing to Bevan and Pritchard (1964), has no 
effect on indifference points (IPs) for lifted 
weights. At the end of the session the hyp- 
notic trance was removed, 

Procedure in the anchoring session—At 
the conclusion of the training session, Ss 
left the room for 10 min. during which the 
weights were rearranged in the apparatus 
such that the anchor appeared on alternate 
trials and weights in the original series, 
randomized within blocks of six, appeared 
on the remaining trials. Following the 
instructions mentioned below, 240 judgments 
were obtained from each S in the experi- 
mental groups (20 judgments of each of 
the original series and 120 judgments of 
the anchor). 

Groups 1-5 were instructed to consider the 
heavy weight (remote anchor) which would 
appear on alternate trials as belonging to 
Category 12, to record a judgment of 12 
for it after lifting, and to continue using 
categories 1-6 for their judgments of the 
other weights. Group 1 received no instruc- 
tions beyond these. Groups 2 and 3 were 
told, in addition, “ignore the heaviest 
weight [the anchor]; that is, don’t let it 
affect your judgments of the other weights." 
Group 4 was given a detailed 2-hr. lecture 
on contrast effects in judgment, then this 
“sophisticated” group was brought to the 
experimental room and given the same in- 
structions as Groups 2 and 3. Group 5 
was hypnotized again before beginning its 
anchoring session and was then given the 
same instructions as Groups 2 and 3. Fi- 
nally, the sixth group served as a control 
inasmuch as they did not receive an anchor 
in their equivalent of the anchoring session; 
instead, the training session instructions 
were repeated at the start of their second 
session. 


RESULTS 


Since anchors exert maximum effect 
on judgments of stimuli which are 
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adjacent to the anchor (Magaro, 1966), Stimulus 6 in the training and anchor- 
difference scores which reflect differ- ing sessions were used as one measure 
ences between mean judgments of of anchor effects. A one-way analysis 
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of variance on the difference scores in- 
dicated that the treatment effect was 
significant, F (5, 36) = 4.04, p < .01. 
Dunnett's ¢ tests indicated, as expected, 
that neither the hypnotized group nor 
the group which received the lecture 
showed more change than the control 
group, ¢ (36) = 165, k=6, p> .05 
for each of the two comparisons. Also 
in line with expectations, Groups 1 and 
2 did show significantly more change 
(contrast effect) than the control 
group, t (36) = 4.04, k = 6, p < .005; 
t (36) = 3.34, k = 6, p < .005, respec- 
tively. Contrary to expectations, 
Group 3 did not differ from the con- 
trol group, although the difference did 
closely approach significance, t (36) 
= 2.4, k = 6, p>.05. A series of t 
tests to determine whether there were 
significant changes within groups indi- 
cated that there was no significant 
change in the control group, ¢ (6) = 
-18, p > .80, or in the hypnotized group, 
t (6) = 1.57, p > .10; however, there 
was significant contrast in the group 
which received the lecture, £ (6) = 
3.17, p < .02, in Group 1, t (6) = 3.94, 
p < 01, Group 2, t (6) = 432, p< 
-01, and in Group 3, t (6) = 293, p < 
05. Changes in the frequency of use 
of response Category 6, a measure 
Which, of course, is highly related to 
the one just discussed, showed pre- 
cisely the same pattern of results as 
those thus far reported, except that 
Group 3 differed from the control 
group, t (36) = 2.95, k = 6, p < .025, 
in this analysis. 

An overview of the results is pre- 
sented in Fig. 1. Since there appears 
to be an inconsistency between the re- 
sults of the statistical tests on magni- 
tude of change within Groups 4 and 5 
and the data presented in the figures 
for these groups, a few words of ex- 
planation are in order. As it is ap- 
parent in the figures, the magnitude of 
change in mean judgments of Stimulus 
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6 in Groups 4 and 5 was identical (viz. 
—.34); the fact that the change in 
Group 4 was significant but that the 
same magnitude of change in Group 5 
was not significant can be accounted 
for by the greater variance in the 
change scores for Group 5. On the 
basis of the figures, one would be led 
to conclude that, if anything, Group 5 
showed greater overall change than 
Group 4. In view of the foregoing, and 
the fact that neither group differed from 
the control group, the writers are in- 
clined to minimize the importance of 
the fact that Group 4 showed significant 
contrast and Group 5 did not. 

One of the hypotheses was concerned 
with the rank order of the magnitude 
of change in the six groups. The mean 
changes in mean judgments of Stim- 
ulus 6 (anchoring session minus train- 
ing session) for Groups 1-6 were, re- 
spectively: —.94, —.76, —49, —.34, 
—.34, and .09. Comparable scores for 
mean changes in the frequency of use 
of Category 6 were: —15.86, —20.43, 
—15.14, —7.29, —4.29, and 1.14. Thus, 
the rank order of the magnitude of con- 
trast effects produced in the various 
groups approximates that which was 
expected rather closely, but not per- 
fectly. The correlation between the ob- 
served rank order of mean changes in 
mean judgments of Stimulus 6 and the 
expected rank order was high (rho — 
.99, p < .01) as was the correlation be- 
tween the observed rank order of mean 
changes in the frequency of use of 
Category 6 and the expected rank order 
(rho = .94, p « .01). 


Discussion 


A recent study by Clark (1966) in- 
dicated that while instructions do not af- 
fect Ss’ sensitivity thresholds, they can be 
used to manipulate response biases. 
More specifically, he found that instruc- 
tions which produced “a facilitating 
set, in contrast to an inhibiting set, in- 
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creased the proportion of flicker responses 
to both a physically intermittent light . . . 
and to a continuous light . . ." [p. 358]. 
The results of the present study indicate, 
in accordance with Engen and Tulunay's 
(1957) expectations, that it is also 
possible, through instruction, to calibrate 
Ss such that they are not subject to cer- 
tain biases in judgment. It is interesting 
to note that this was done as effectively 
with a few words to the hypnotized group 
as with the 2-hr. lecture to the normal 
waking group. In view of the earlier 
mentioned controversy concerning the ef- 
fects of susceptibility to hypnosis vs. hyp- 
nosis itself, it is also of interest to note 
that the waking susceptible Ss did show 
contrast on two measures and did differ 
from the control group on one, whereas 
the hypnotized group did not show con- 
trast and did not differ from the control 
group on either of the two measures. 

One might raise the question whether 
Ss were really ignoring the anchor or 
whether they were simply compensating 
for what they knew to be the effects of the 
anchor. If the common assumption that 
hypnotized Ss can tune out sensory inputs 
is true, then it seems reasonable to con- 
clude that the hypnotized group in the 
present study was ignoring the anchor. 
It seems unlikely that they compensated 
for the effects of the anchor since they 
had no more information about them than 
Group 2 or Group 3, yet they showed no 
contrast while Groups 2 and 3 did. 

It is difficult to determine whether the 
lectured group was attempting to ignore 
the anchor or attempting to compensate 
for its effects. Some light is shed on this 
question by observations of the writers 
which were obtained before the data for 
the experiment were analyzed. The 
writers, who might be considered as 
roughly comparable to the lectured group, 
conducted the experiment. on themselves 
in an effort to determine what transpires, 
psychologically, when one is trying to 
ignore an anchor. Introspective accounts 
indicated that each was trying to avoid 
contrast effects but also trying to avoid 
ovércompensating for the effects which 
were expected. Thus, it was not possible 
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for the writers to really ignore the anchor 
and it seems unlikely that the lectured 
group was any more successful in doing 
so. This is not meant to deny the fact, as 
everyday observations tell us, that it is 
possible to tune out a sensory input while 
in the waking state; rather, it is meant to 
imply that when an individual is in- 
structed to avoid specified effects of a 
stimulus, he is more likely to do so by 
compensating for the effects, if he knows 
what they are, than by ignoring the 
stimulus. 

In conclusion, it seems appropriate to 
discuss the apparent inconsistency between 
the results of the present study and those 
recently reported by Gumenik and Weiss 
(1967). Using an interpolated weight 
(200 gm.) between the presentation of a 
standard weight (100 gm.) and one of five 
comparison weights (80, 90, 100, 110, or 
120 gm.), they found that Ss who were 
instructed to judge the comparison weight 
against both the standard and the inter- 
polated weight (i.e. Ss for whom the in- 
terpolated weight was "relevant") showed 
essentially the same mean point of sub- 
jective equality (PSE) as did a "neutral" 
group which lifted the interpolated weight 
but was only instructed to judge the com- 
parison weight against the standard. A 
nonrelevant group which was told "don't 
judge it" before each presentation of the 
interpolated weight yielded a mean PSE 
comparable to that of a control group 
which was not exposed to the interpolated 
weight; moreover, both of these groups 
evidenced significantly lower mean PSEs 
than the first two groups mentioned above. 
Gumenik and Weiss (1967, p. 118) con- 
cluded that, "it appears that explicit non- 
relevance completely removes the effect 
of the interpolated weight since PSEs ob- 
tained under the non-relevant condition 
did not differ from those obtained under 
the control condition." 

The fact that Gumenik and Weiss' 
(1967) nonrelevant group did not differ 
from the control group and the fact that 
in this study Group 2, which was simply 
instructed to ignore the anchor, did differ 
from the control group poses a contradic- 
tion worthy of comment. The procedural 
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differences between their study and the 
present one are many but the explanation 
may, in part, lie in the fact that their Ss 
were repeatedly instructed "don't judge 
it" when the interpolated weight was pre- 
sented and Ss in the present study were 
only instructed at the outset of the 
anchoring session to, "ignore the heaviest 
weight; that is, don't let it affect your 
judgments of the other weights." Since 
Gumenik and Weiss repeatedly reminded 
Ss to ignore the interpolated weight, 
whereas Ss in this study were not so 
reminded, this difference may account for 
the differences between the results of the 
two studies, especially, when this is 
coupled with the fact that their Ss did 
not judge the anchor and the present Ss 
did. Gumenik and Weiss found no differ- 
ence between the effects of judged (rele- 
vant) and nonjudged (neutral) interpo- 
lated weights; however, they were using 
the method of comparative judgment, 
whereas studies using the method used in 
the present study (eg. Brown, 1953; 
Magaro, 1966) have found that judged 
anchors produce larger effects than non- 
judged anchors. 
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96 rats learned and later relearned a 3-choice visual discrimination. 
Variation in interpolated learning (IL) included whether Ss received 
only extinction of the original choice, only acquisition of a new choice, 
or both. These 3 conditions were combined with factorial combinations 
of the interval between IL and relearning (RL) and the number of 
IL choices learned. The major finding was that large and significant 
recovery between immediate and 48-hr. tests occurred for Ss given 
counterconditioning during IL. “Pure” extinction procedures were 
relatively ineffective in reducing OL choices, as compared with larger 
effects of "pure" competition from a new habit or the still larger 
effects of combining both procedures in counterconditioning. The 
recovery findings tend to remove certain formal objections to inter- 
ference theories of extinction and forgetting. 


Counterconditioning is a procedure 
where extinction of a learned response 
is accompanied by acquisition of some 
new and incompatible response. The 
generalization has occasionally been 
made that unlike "regular" extinction 
procedures, counterconditioning is not 
followed by spontaneous recovery of 
the original response (Hall, 1966; Hil- 
gard & Marquis, 1940). Since the in- 
terference theory of extinction (Liber- 
man, 1944, Miller & Stevenson, 1936; 
Razran, 1956) holds that countercon- 
ditioning is the prototype for all ex- 
tinction, empirical failures to get re- 
covery after deliberate countercondi- 
tioning would pose a serious theoretical 
problem. Indeed, both Hall (1966) 
and Hilgard and Marquis (1940) in- 
clude the failure of recovery as a prom- 
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inent argument against the interference 
position. 

The interference theory of extinction 
is not to be confused with the inter- 
ference theory of memory (Keppel, 
1968; Postman, 1961). However, it 
turns out that absence of recovery fol- 
lowing counterconditioning is just as 
inconvenient for the latter as for the 
former. This inconvenience is obvious 
when it is noted that (a) the prediction 
of proactive inhibition (PI) depends 
theoretically on the spontaneous re- 
covery of unlearned (extinguished) as- 
sociations following interpolated learn- 
ing, and (5) the relation of original to 
interpolated learning (A-B, A-C) fits 
the counterconditioning paradigm and 
not the simple extinction paradigm 
(Barnes & Underwood, 1959; Keppel, 
1968). 

The evidence that there is no recov- 
ery following counterconditioning is 
found in a single study by Kellogg and 
Wolf (1939). These investigators 
trained dogs to flex a forepaw in re- 
sponse to a buzzer so as to avoid shock. 
The Ss received (a) right-paw train- 
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ing, (b) right-paw extinction, (c) 
right-paw reconditioning, and (d) left- 
paw training (counterconditioning), in 
that order. All training was carried to 
a stringent criterion of 19 out of 20 
correct responses in a daily session of 
20 trials. After Step d in training, 
approximately four times as many rein- 
forced trials had occurred with right- 
paw flexion as with left; nevertheless, 
all Ss, when tested from 1-6 mo. later, 
showed a marked preference for left- 
paw CRs. The conclusion was that 
recency had overcome a strong fre- 
quency bias and that an animal will 
perform whatever response he last 
learned in a situation. Since the level 
of right-paw response in the last coun- 
terconditioning session was (neces- 
sarily) 596 or less, any increase over 
this value in the retention test would 
have been recovery. However, all but 
one S emitted no right-paw CRs at all. 

This experiment is not conclusive 
with regard to the generalization it 
spawned. From the Kellogg-Wolf 
(1939) data, it can be determined that 
the last training phase lasted between 
3-5 days at a minimum. Since such 
alternating sessions of extinction and 
rest are believed to reduce the amount 
of recovery obtained after the last ses- 
sion, it is not clear that recovery would 
have been expected following even 
regular extinction (which possibility 
was not tested) with their training 
schedules. Thus, one purpose of the 
present experiment was to determine 
whether recovery could be found fol- 
lowing counterconditioning of a visual 
discrimination. 

The second purpose of the present 
study was to test the hypothesis ( Post- 
man, 1965) that if a fiixed number 
of interpolated-learning (countercondi- 
tioning) trials are distributed over two 
tasks there will be more extinction of 
the original task than if only one inter- 
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polated task is used. This hypothesis 
assumes that an original association 
will tend to be elicited more to the 
extent that there is not a single, domi- 
nant competing association. While the 
proposition has been subjected to in- 
direct verification in verbal learning 
(Postman, 1965), properly chosen 
animal-learning situations would per- 
mit a direct test. 

Also impossible to test directly in 
human Ss is the relative importance of 
nonreinforcement and new learning in 
negative transfer paradigms. Attempts 
to impose conditions of "pure extinc- 
tion" in verbal learning (Richardson & 
Gropper, 1964) have led to improved 
performance, if anything. Nor is it 
possible to rule out the occurrence of 
nonreinforced old responses in new 
learning, even when procedures are 
undertaken to minimize such covert 
intrusions (Goggin, 1967). The de- 
sign of the present study, however, 
allowed the separate examination of 
extinction and competition processes in 
transfer by virtue of the fact that E 
could control the presence or absence 
of the relevant stimuli. 

It needs to be said that the present 
authors have no illusions about the 
direct applicability of animal studies 
to theories based on and designed for 
human verbal learning. Incompatibil- 
ities between findings in these two 
areas are inconvenient and nothing 
more. The hope is that behavioral 
laws may be found with generality in 
both areas, ultimately, and the present 
research has been designed to provoke 
inferences in this direction. 


METHOD 


Experimental design—Twelve groups of 
eight rats each learned to approach a striped 
door in a three-choice visual discrimination 
task and later relearned (RL) the same dis- 
crimination, after various conditions of inter- 
polated learning (IL) and delay. The de- 
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sign was a 3X2X2 factorial. The three 
major conditions of IL were: (a) Free 
groups where reinforcement was removed 
Írom the striped door and placed behind 
either black or white doors during IL, (b) 
Yoked-Extinction groups where only the 
originally correct striped stimulus was pres- 
ent during IL (ie, all three choices were 
striped) and where Ss were given forced 
extinction trials according to the schedule of 
errors made by matched Free Ss, and (c) 
Yoked-Competition groups where only the 
correct IL stimulus (black or white) was 
available during IL and Ss were given trials 
according to the schedules of correct IL re- 
sponses by matched Free Ss. For example, 
if a particular Free S chose the (formerly 
correct) striped stimulus on the first three 
IL trials, the (incorrect for IL) black stim- 
ulus on Trial 4, and the (correct for IL) 
white stimulus on Trial 5, the two partners 
of this S in Yoked groups would have been 
treated as follows: The Yoked Extinction 
S paired with this Free S' would have been 
given three trials with unbaited striped stim- 
uli in each goal box, then for Trials 4 and 5 
he would have been detained until such time 
as corresponded with another trial to the 
striped stimulus by his Free partner. The 

Yoked Competition $ paired with the same 

Free S would have been detained for a pe- 
riod corresponding to the first four trials, 
then, on Trial 5, he would have been given 
a trial where all three goal boxes were 
baited and colored white. The second factor 
was the number of IL tasks; half of the Free 
Ss had 60 IL trials all to either black or 
white doors while the other half had 30 
trials to one stimulus and 30 to the other. 
The distinction between the two levels of 
this variable with the Yoked groups de- 
pended on the individual matching of Ss. 
The Yoked-Extinction Ss, of course, never 
saw the black or white stimuli during IL; 
however, any effects the number of IL tasks 
had on error frequencies or distributions 
with Free Ss would have been reflected in 
the treatment of Yoked-Extinction groups. 
The Yoked-Competition groups followed the 
schedules of correct IL choices for Free Ss. 
The last factor was retention interval. Half 
of each group was tested for relearning 
(RL) immediately after IL and half was 
tested 48 hr. later. 

Apparatus.—The three-choice maze used is 
shown in Fig. 1. The goal boxes varied 
with respect to color and the type of flooring. 
The three types of goal boxes were: (a) 
white with wire mesh floor; (b) black with 
ribbed-rubber flooring; and (c) i-in. alter- 
nating black and white stripes with smooth 


R. G. CROWDER, M. COLE, AND R. BOUCHER 


paper flooring. Three goal boxes of each 
type were necessary to produce the appropri- 
ate yoked conditions. 

A microswitch operated by the opening of 
the start-box door started a Standard Elec- 
tric timer. Interruption of any one of the 
photocell beams located in the goal boxes 2 
in. in front of the food cups stopped the 
clock. The photocells and lights were 
mounted in the goal-box sleeves and the light 
beam shone through matching holes in the 
goal boxes. 

Procedure.—The Ss were put on a depriva- 
tion schedule of 12.5 gm. daily of Purina Lab 
Chow 1 wk. prior to the start of training. 
Water was available at all times. On the 
day prior to training, Ss received one half 
of the daily ration. 

Pretraining began when Ss were put on 
the deprivation schedule. For the first 4 
days, each S was handled for about 5 min. 


Fic. 1. Experimental apparatus. 


The goal boxes were removable. Attached 
to the base board at each position was a 
sleeve with a Plexiglas top to guide the goal 
box into place and enclose it. The guillotine 
doors at the entrance to the goal boxes were 
operated with clear nylon cords attached 
near the start-box doors. 
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with the .045 mm. Noyes pellets that would 
later serve as reinforcement available on the 
table top. Three days prior to training, 
groups of four Ss were allowed to roam 
freely in the maze with food available in all 
goal boxes. Two days prior to training, 
closing the doors after goal-box entry was 
added to the routine of the previous day. On 
the day prior to training, each S was given 
four trials in the maze with food available 
in all goal boxes. 

All Ss were placed in a three-choice 
visual-tactile discrimination situation during 
original learning (OL) and RL. The posi- 
tion of each distinct goal box in the maze— 
left, right, or center—was changed randomly 
before each trial, with the restriction that the 
correct goal box not be in the same position 
twice in a row. 

The Ss were generally run in squads of 
three with a Free S run first and the two 
Yoked Ss run next with the appropriate 
spacing of either extinction or competition 
trials during IL. Two .045 gm. Noyes pel- 
lets were used as the reinforcement for a 
correct choice. A moncorrection procedure 
was implemented by closing the guillotine 
doors once S entered the goal box. 

Each S received 40 trials during acquisi- 
tion. A criterion of six correct choices dur- 
ing the last 10 trials was adopted to insure 
that all Ss entering the later phases of the 
experiment had shown some learning of the 
initial discrimination. Fifteen Ss who did 
not meet this criterion were dropped from 
the experiment and replaced. 

During the 60 trials of IL Ss were treated 
the same as in OL, except for changes in 
the discriminanda and reward contingencies 
appropriate to the various groups described 
in the section on experimental design. 

The RL procedure was identical to that 
during OL. The Immediate Retention Ss 
were given 40 trials immediately after IL. 
The 48-hr. Retention Ss were returned to 
their cages following IL and fed 7 gm. of 
lab chow. They were fed their normal ration 
on the following day. On the day of the 
RL session, approximately 48 hr. following 
IL, they were fed 7 gm. of lab chow. Ap- 
proximately 4 hr. later these Ss were given 
their 40 trials under conditions identical to 
those for the Immediate Retention Ss. (It 
should be noted that this a priori plan for 
equating drive levels of immediate and 48- 
hr. groups turned out to have been less 
than ideal. Figure 2 reveals that the average 
Free S with one IL task received approxi- 
mately 58 rewarded trials before RL; since 
two 0.45 gm. pellets were given as reward, 
this amounts to the equivalent of around 
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5.2 gm. Perfect performance would have 
led to 100 2X 0.45 —9 gm. Thus the 7 
gm. given 48-hr. Ss before RL overestimated 
by something less than 2 gm. the ideal 
amount. For Ss in other IL conditions, this 
overfeeding in 48-hr. groups was still more 
pronounced. However, a 4-hr. delay was 
imposed between this feeding and RL, which 
would have tended to attenuate the confound- 
ing. Furthermore, reduced drive for 48-hr. 
Ss would have biased the results against 
recovery.) 

As a rule, two Immediate Retention and 
two 48-hr. Retention Ss were run in each 
session. The exact schedule for the Yoked- 
Extinction and Yoked-Competition Ss was 
determined by responses of the Free S with 
which they were yoked. 

Both choice responses and running times 
were recorded on each trial The running 
time was measured, from the time the animal 
oriented properly in the start box to the 
moment he crossed a photocell beam. 


RESULTS 


Original learning.—Comparability of 
groups was evaluated by a 3X 2X 2 
factorial analysis of variance based on 
the number of correct choices in the 
last 10 trials of OL (at which point 
all Ss had received identical treatment). 
None of the resulting Fs reached con- 
ventional levels of statistical signifi- 
cance. There was, however, a trend 
for fewer correct OL choices among 
48-hr. groups than among immediate 
groups, F (1, 84) 2 35, 102 5» 
.05. Although strictly speaking no in- 
ference is justified by this finding, it 
may be noted that such a tendency 
would tend to bias the results against 
the finding of recovery between im- 
mediate and 48-hr tests. 

The first panel of Fig. 2 shows all 
Ss' OL performance combined. 

Interpolated learning.—The perform- 
ance of Free groups is shown in the 
second panel of Fig. 2. The data are 
pooled across retention intervals later 
imposed. Both incorrect responses to 
the OL alternative (striped box) and 
correct responses to the IL alternative 
are plotted. The complement of the 
sum of these two proportions, of course, 
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Fic, 2. Choice performance of Free groups throughout the experiment. 


(The first panel 


shows OL acquisition for all Ss combined. The second panel shows acquisition of IL 


and occurrence of OL errors separately for Ss receiving one or two IL tasks. 


In the 


third panel, RL performance is plotted separately for all groups.) 


gives the rate of response to the third 
alternative. 

What seems apparent from this fig- 
ure was corroborated by statistical 
analyses. During the latter half of 
IL, Ss with two IL tasks make more 
extinction responses to the OL stimulus 
than do Ss with a single IL task, t (30) 
— 237, p « .025. More obvious was 
the finding that fewer correct choices 
are made during the same period by 
Ss with two tasks than Ss with only 
one, + (30) 2 822, p « 01. These 
data for Free Ss in IL give exactly 
the histories of the two yoked groups 
during IL, since the free partner for 
every yoked animal contributed to the 
means in Fig. Z. Thus, Yoked-Extinc- 
tion groups received more extinction 
training with two IL tasks than with 
one and Yoked-Competition groups re- 
ceived more IL trials with one IL task 
than with two. 

Relearning—The RL results are 
given in detail for Free groups in the 
right-hand panel of Fig. 2 and for all 
groups in Fig. 3. Two measures are 


represented in the latter: the number 
of correct (i.e., responses to OL stim- 
ulus) choices in the first 10 trials 
(Part A) and over all 40 trials (Part 
B). Apparent in both measures are 
significant main effects of IL condition, 
F (2, 84) — 60.13, 108.61, respectively, 
for Measures A and B, p’s both < .01, 
and of recall interval, F (1, 84) — 
19.68, 34.02, respectively, for Measures 
A and B, P's « 01. There was also 
for both measures a.significant inter- 
action of IL Condition X Retention In- 
terval, F (2, 84) = 5.92, 10.17, p’s both 
<.01. Free Ss showed by far the 
poorest RL performance, with Yoked- 
Competition Ss intermediate and 
Yoked-Extinction Ss performing at 
approximately the level of their termi- 
nal OL trials. The significant IL 
Condition X Retention Interval inter- 
action reflects the fact that large and 
significant recovery was displayed by 
all groups except Yoked-Extinction. 
The simple main effects (Winer, 1962) 
of retention interval were significant 
for Free, F (1, 84) — 23.45, 45.16, and 
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Fic. 3. Correct RL choices for all experimental groups on the first 10 RL trials 
(Part A) and on all 40 RL trials (Part B). 


for Yoked-Competition groups, F (1, 
84) = 8.10, 9.07, (?'s all < .01), but 
nonsignificant for Yoked-Extinction 
groups, Fs < 1.00. 

Although there was no significant 
main effect of number of IL tasks, 
there is some evidence in the figure 
and in the accompanying statistical 
analyses that this variable influenced 
performance. The three-way (IL Con- 
dition X Retention Interval x Number 
of IL Tasks) interaction was statisti- 
cally significant in the case of Measure 
A, F (2, 84) = 5.89, p < .01, and the 
Retention Interval X Number of IL 
Tasks interaction was significant for 
Measure B, F (1, 84) = 5:29, p < .05. 
These results should be approached 
with unusual caution since the Fs for 
the corresponding effects in the other 
analyses were both close to 1. How- 
ever, it is reasonable to believe that 
recovery effects might be somewhat 
transient, and therefore further analy- 
ses were conducted on the data of 
Part A (first 10 RL trials). For this 
measure (and to a lesser extent with 
Measure B) it is seen in Fig. 3 that 
those Yoked-Extinction groups paired 
with Ss receiving two IL tasks showed 
recovery but that Yoked-Extinction Ss 


paired with Free Ss receiving one IL 
task showed a decrement. The simple 

interaction reflecting this tendency 
(Retention Interval X Number of IL 
Tasks for Yoked-Extinction groups) 

was statistically significant, F (1, 84) 

— 9,95, p < 01. Also statistically sig- 
nificant were the simple main effects 
(Winer, 1962) of retention interval 
for the two separate Yoked-Extinction 
groups, Fs (1, 84) both = 499, p< 

.05. Mean median-choice latencies on 
the first 10 RL trials for both one-task 
and two-task Yoked-Extinction groups 
showed some recovery between im- 
mediate and 48-hr. tests. The recovery 
was not statistically significant either 
for the Yoked-Extinction Ss matched 
with one-IL partners, ¢ (14) = 1.59, 
or for those matched with two-IL part- 
ners, £ (14) = 0.89. However, when 
the two immediate and 48-hr. groups 
were combined, there was marginally 
significant recovery in choice speed, t 
(30) = 1.74, p < .05 by one-tailed test. 
This difference, while uncorrelated with 
the choice data, is impressive in light 
of the possible differences in drive dis- 
cussed above (i.e., inappropriately large 
pre-RL feeding of 48-hr. Ss). 
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Discussion 


The present study gave unambiguous 
evidence for recovery from extinction 
when extinction was carried out by coun- 
terconditioning procedures. It may be 
speculated that the relative distribution of 
practice on counterconditioning in the 
Kellogg and Wolf (1939) study prevented 
their observing recovery. By implication 
these data indicate that absence of re- 
covery following counterconditioning is 
no longer a tenable objection to the inter- 
ference theory of extinction. By the same 
token, the interference theory of memory 
need not abandon the extinction model of 
unlearning on these particular grounds. 

(Keppel, 1968, persuasively argues that 
there are other grounds for such abandon- 
ment.) 

With regard to isolating those factors 
responsible for the decrement in OL re- 
sponding observed during IL, the present 
data suggest that the acquisition of new 
incompatible responses is more influential 
than nonreinforcement for OL errors. 
However, it must be observed that RL 
scores for Yoked-Competition groups may 
have been subject to an artificial depres- 
sion in the present experiment. It will be 
recalled that the Yoked-Competition Ss 
received reward in all three alleys during 
IL, each alley marked with the same 
“correct” visual pattern. It is therefore 
possible that in addition to learning a 
new visual association, these Ss relearned 
whatever position habits they had orig- 
inally carried into the experimental situ- 
ation. Thus the absolute positions of the 
three main groups in RL may not be 
taken as definitive estimates of what ex- 
tinction and new learning contributed 
separately to Free Ss’ decrements. 

A source of some concern with the 
present data is the finding that Ss given 
regular extinction showed no overall 
spontaneous recovery. Early in RL, how- 
ever, Ss matched with Free Ss given two 
IL tasks did show significant recovery. 
Since these Ss had both more and more 
recent extinction trials than one-task 
Yoked-Extinction Ss, it is not unreason- 
able that the former and not the latter 
showed recovery. In fact, recovery may 
have occurred before the immediate test 


of Ss with relatively fewer trials late in 
IL (ie, in the cases of one-task Ss). 
The significant forgetting shown by the 
latter is consistent with this interpreta- 
tion and with the possibility that they 
were subject to a ceiling effect in relation 
to the high performance achieved by im- 
mediate groups. 
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A NEGATIVE CONTRAST EFFECT OF REWARD DELAY 


IN DIFFERENTIAL CONDITIONING! 


RICHARD G. BEERY? 
University of Iowa 


Rats were run in a differential conditioning paradigm to assess contrast 
effects of reward delay. The groups were designated: 1-5, 1-10, 5-10, 
1-1, 5-5, and 10-10, where the first numeral in each case indicated the 
length of delay (sec.) in the shorter delay alley (S+), and the second, the 
length in the longer delay alley (SC). 47 Ss were run, with 8 Ss per 
group in all but Group 5-10, The results on both starting and running 
speed measures closely paralleled the typical findings from studies of re- 
ward magnitude contrast effects. Thus no evidence was found for a posi- 
tive contrast effect, while some clear examples of a negative contrast 
effect emerged. The results were discussed in terms of an inhibitory 


model. 


In instrumental reward condition- 
ing, the static physical conditions of 
reinforcement, as specified by such pa- 
rameters as magnitude of reward (W,) 
and delay of reward (T,) have been 
extensively investigated. In recent 
years, increasing attention has been 
devoted in addition to contextual fac- 
tors such as S's previous or concurrent 
experience with similar or dissimilar 
reinforcement conditions. 

Perhaps the most convincing dem- 
onstration of the effect of such factors 
has arisen from the investigation of 
contrast effects with manipulation of 
W,. Typically, the more recent of 
these studies have involved varying 
discrepancies between two alternative 
W,'s in selective learning or differen- 
tial conditioning paradigms, in order 
to assess simultaneous contrast effects. 


1 This paper is based on a PhD dissertation 
submitted to the University of Iowa. The 
author is greatly indebted to his major 
professor, Roger W. Black, for his generous 
assistance and advice in directing the project. 
The investigation received financial support 
from a National Aeronautics and Space 
Administration predoctoral training grant, 
and the present manuscript was prepared 
under support of National Institute of Mental 
Health Postdoctoral Research Grant No. 
MH-32,918-02. 

2 Now at the University of Wisconsin. 


The results of such studiés are gener- 
ally but not completely consistent. 

In the two-alternative selective 
learning paradigm, fairly clear evi- 
dence has been reported for a negative 
contrast or depression effect of We 
(Davenport, 1962 ; Spear & Hill, 1965 ; 
Spear & Spitzer, 1966). On the other 
hand, a positive contrast, or elation 
effect has typically failed to appear 
(Hill & Spear, 1963; Spear, 1964; 
Spear & Spitzer, 1966), with the ex- 
ception of one study (Spear & Pavlik, 
1966), in which such an effect was 
found with an extended intertrial 
interval. 

In the differential conditioning par- 
adigm, the results show similar con- 
sistency. Thus again, a negative 
contrast effect has typically been re- 
ported, while positive contrast has 
failed to appear (Bower, 1961; Mat- 
sumoto, 1965; Spear & Spitzer, 1966). 
In one seemingly discrepant study 
(Goldstein & Spence, 1963), no con- 
trast effects were reported. How- 
ever, the ordering of mean speeds for 
all three measures employed in this 
study again suggests a negative, but 
nota positive contrast effect, although 
no statistical tests of the contrast 
comparisons were reported. 
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In the case of delay of reward (T,), 
the evidence is very meager. In one 
study of selective learning (Fowler, 
1963), it was found that response 
strength to a given T, was greater 
when that delay constituted the 
shorter of two alternative T,'s than 
when it was the longer of the two. 
Thus some tentative support is given 
to the hypothesis that a contrast ef- 
fect may also appear in the case of Tg, 
although the above design was not 
analytic with respect to separation of 
positive from negative contrast. 

"Therefore, the purpose of the pres- 
ent study was to provide further evi- 
dence regarding the occurrence of T, 
contrast effects in differential con- 
ditioning, in a comprehensive design 
which allowed separate analysis of the 
positive and negative effects. From 
an inhibitory interpretation of dis- 
crimination learning (Black, 1968; 
Spence, 1936), it appeared reasonable 
to expect that T, should function in 
a manner similar to W, to produce 
negative, but not positive contrast 
effects. 


METHOD 


Subjects—The Ss were 48 experimentally 
naive male hooded rats from the colony main- 
tained by the Department of Psychology of 
the University of Iowa. One S was discarded 
midway in experimental training because of a 
respiratory ailment, leaving one group (5-10) 
with one S fewer than the five other groups. 
All Ss were between 85-100 days old at the 
beginning of the habituation period. 

Apparatus —The apparatus consisted of a 
pair of parallel straight alleys, one painted 
black and the other white, each 33 in. long, 
5 in. high, and 3 in. wide, A 6-in, gray painted 
start box, with the height and width of a 
single alley, was constructed so that it could 
be placed in front of either alley, separated 

from it by a horizontally opening aluminum 
door. Each alley was divided into a runway 
and a goal section by an unpainted aluminum 
guillotine door located 9 in. from the terminal 
end. The goal boxes were each 12 in. wide and 
the same height as the runways and start box. 

At the end of each goal box, directly in line 


with the corresponding runway, was mounted 
a metal food cup, recessed into a block of wood 
painted. the color of the alley. Covering each 
of these food cups was an aluminum plate 
which could be withdrawn through a slot in 
the rear of the goal box. The mechanism for 
withdrawal of the metal cover was controlled 
by a Hunter decade interval timer. 

The apparatus was equipped for the mea- 
surement of starting and running times by 
means of circuits of photocells, Hunter photo 
relays, and Hunter klockounters. Starting 
time was defined as the latency from the 
opening of the start door to the interruption of 
a photobeam located 6 in. past the start door, 
while running time was recorded from the 
latter point to a photobeam located 12 in. 
farther down the runway. Interruption of a 
photobeam located 3 in. from the end of the 
goal box activated the Hunter timer, which, 
after timing the appropriate interval, broke a 
circuit to the delay mechanism, withdrawing 
the cover from the food dish. 

Procedure,—The Ss were housed two to a 
cage and maintained on a schedule of 1 hr. 
of free access to an ample supply of Purina 
Lab Chow at the same time each day, with 
ad lib access to water. This deprivation 
schedule was initiated 38 days prior to the 
beginning of experimental training. During 
this period, each S was handled 2-3 min. per 
day, and in addition, on each of the last 2 
days of the habituation period, Ss were placed 
in individual carrying cages for approximately 
10 min. and allowed access there to a smail 
supply of the pellets which were used as 
reinforcement in experimental training. No 
familiarization training with the apparatus 
was given prior to experimental training. 

On each day of experimental training the 
following procedure was employed: Ss of the 
first squad were placed in individual carrying 
cages and wheeled into the experimental room. 
A trial was initiated by removing S from its 
cage and placing it into the start box of the 
apparatus. Approximately 1 sec. after place- 
ment in the start box, the start door was 
opened. The start door. was closed as soon as 
Shad fully entered the runway, and the goal- 
box door was closed when S had fully entered 
the goal box, thus preventing retracing. After 
the appropriate delay interval (commenced 
by the interruption of the goal-box photo- 
beam), the cover over the food cup slid back, 
allowing access to five 45-mg. Noyes pellets. 
Immediately after the pellets were consumed, 
S was removed and replaced in its carrying 
cage, following which starting and running 
times were recorded and the next .S placed in 
the start box. 
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The Ss were run in rotation in the same 
order each day. Thus the second trial began 
after all Ss had been administered the first, 
etc. The intertrial interval with this pro- 
cedure was approximately 10-15 min., after 
the first trials. Six trials per day, three to 
each stimulus, were administered to each S 
on all days except the first two, on each of 
which three trials were given. Following the 
administration of these daily trials, the squad 
was transported back to their home cages and 
given their hour of access to food, while the 
next squad was run. 

Design.—Three differential delay groups 
were employed in the experiment, designated : 
1-5, 5-10, and 1-10, where the first numeral 
indicates the length in seconds of the reward 
delay period in the “positive” (shorter delay) 
alley (S--), and the second designates the 
delay period in the "negative" (longer delay) 
alley (S—). In addition, three constant Tg 
groups, designated 1-1, 5-5, and 10-10, were 
employed. For these Ss, the pair of numerals 
represents the single, common delay received 
in each alley, one of which was arbitrarily 
labeled S+, and the other, S—, for each S. 
The experiment was run in four squads of 
12 Ss per squad. Among the Ss of each squad, 
experimental treatment groups and brightness 
of the S+ and S— were completely counter- 
balanced, Acquisition training continued until 
96 trials had been administered, 48 in each 
alley. 


RESULTS AND DISCUSSION 


In order to test for contrast effects, 
it is first necessary to demonstrate 
that response speeds to a given T, 
do, in fact, differ with differing lengths 
of T, associated with the alternative 
stimulus. If such an overall difference 
in speeds were demonstrated, a con- 
trast effect would then be shown to 
have occurred if the differences were 
in a particular direction. Thus a posi- 


tive contrast effect would be exhib- 
ited if response speeds to a given 
shorter delayed (S+) discriminandum 
were to vary as a direct function of 
the length of the T, associated with 
the alternative stimulus (S—). On 
the other hand, a negative contrast 
effect would be demonstrated if re- 
sponse speeds to a given T, associated 
with S— were to vary as a direct func- 
tion of the length of T, associated with 
S3- 

Thus in the present experiment, re- 
sponse speeds to a 1-sec. delay, if dif- 
ferent, would form a potential test for 
positive contrast effects, while re- 
sponse speeds to a 10-sec. T, would 
constitute a test of the negative effect. 
Response speeds to the 5-sec. delay, if 
different, would provide a test of both 
positive (Group 5-10 vs. Group 5-5) 
and negative (Group 1-5 vs. Group 
5-5) contrast effects, as well as a con- 
trast test which would not differenti- 
ate whether any obtained effect were 
due to positive or negative contrast 
(Group 5-10). 

The mean response speeds of each 
group over the last 12 training trials 
(6 to each stimulus) are presented for 
both response measures (starting and 
running speeds) and both stimuli (S+ 
and S—) in Table 1. An examination 
of the data in Table 1 reveals little 
difference on either the starting or 
running measure of response speeds to 
a 1-sec. delay. The F test of speeds of 
the three groups experiencing this T, 
in S+ did not reach significance 


TABLE 1 
TERMINAL RESPONSE SPEEDS (FT/SEC) TO EACH DISCRIMINANDUM 


Groups Designated by Length of Delay (sec.) in Each Discriminandum 


Response 
Measure 
1vs.1 5 vs. 5 10 vs. 10 ivs. 5 1 vs. 10 5 vs. 10 
Starting .88 .99 | 1.06 | 1.01 | 1.14 | 1.10 | 1.03 .85 .99 .31 .92 65 
Running | 2.46 | 2.45 | 3.06 | 2.68 | 2.84 | 2.60 | 2.51 2.22 | 2.08 | 1.37 | 2.32 | 1.88 
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(F « 1.00) for either measure, render- 
ing a subsequent test for positive con- 
trast effects superfluous. 

Similarly, the three groups experi- 
encing a 5-sec. T, in one of the dis- 
criminanda can be seen, in Table 1, to 
exhibit no marked differences. Again, 
F tests of the differences (on S+ for 
Groups 5-5 and 5-10, and S— for 
Group 1-5) were nonsignificant for 
both starting, F (2, 20) — 1.33, p > 
.05, and running speeds, F (2, 20) = 
242, p > .05. 

The data on response speeds to a 
10-sec. T,, on the other hand, exhibit 
clear and consistent differences, as 
can be seen in Fig. 1 for the running 
speed measure. The curves for start- 
ing speed are very similar to these in 
form, and exhibit even more marked 
differences between the groups. These 
differences, at the end of acquisition 
(see Table 1), were found to be statis- 
tically reliable (on S— for all three 
groups) for both starting and running 
speed measures, F (2, 20)— 22.39, 
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P «.001; and F (2, 20)- 12.06, 
p < .001, respectively. Subsequent t 
test comparisons of starting speeds on 
each pair of groups indicated that the 
differences between Groups 1-10 and 
10-10 and between Groups 5-10 and 
10-10 were each significant, / (14) — 
7.65, p «.01; and / (13)= 3.93, 
P < .05, respectively, while that be- 
tween Groups 1-10 and 5-10 was not, 
t (13) = 2.13, p > .05. Similar results 
were obtained in the case of running 
speeds, in which the differences be- 
tween Groups 1-10 and 10-10 and be- 
tween 5-10 and 10-10 were again sig- 
nificant, whereas that between Groups 
1-10 and 5-10 was not, t (14) = 4.96, 
p < .01; ¢ (13) = 2.53, p < .05; and t 
(13)= 2.11, p > .05, respectively. 
Thus in two of the three comparisons 
on this value of T,, and with both re- 
sponse measures, significant negative 
contrast effects were found, and in the 
third comparison, the difference was 
in the negative contrast direction, al- 
though not significant. 


O--=-0 GROUP 5-10 
O—-—20 GROUP 1-10 
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BLOCKS OF SIX TRIALS 


Fic. 1. Mean running speeds for Groups 10-10, 5-10, and 1-10 to the nega- 
tive stimulus as a function of blocks of six trials. 
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On the other hand, the first con- 
dition necessary for demonstration of 
contrast effects was not met in any of 
the positive contrast comparisons (and 
one comparison of negative contrast), 
since no significant overall difference 
developed in response speeds to either 
the 1-sec. or 5-sec. T,’s. Further, itcan 
be seen in Table 1 that although the 
group means in all the negative con- 
trast comparisons were, without excep- 
tion, in the direction of a negative con- 
trast effect, the means in the positive 
contrast comparisons were almost uni- 
versally in the opposite direction from 
a positive contrast effect. 

Thus the present results coincide 
with those typically found in the case 
of W, (Bower, 1961; Davenport, 
1962; Hill & Spear, 1965; Matsumoto, 
1965; Spear, 1964 ; Spear & Hill, 1965; 
Spear & Spitzer, 1966) in supporting 
a negative, but not a positive contrast 
effect. One possible theoretical ex- 
planation of the occurrence of this 
negative contrast effect has been re- 
cently proposed by Black (1968). 
This account, based on a modifica- 
tion and extension of Spence's (1936) 
model of discrimination learning, ap- 
pears to incorporate quite well the 
findings of the present study, in ad- 
dition to the typical findings with re- 
spect to W,. Thus it can be assumed 
that a given excitatory tendency (E) 
develops in each alley, as an inverse 
function of the T, administered in 
thatalley. On the other hand, an in- 
hibitory tendency (J) is assumed to 
develop in the longer T, alley (S—), 
to a magnitude directly related to the 
increment of that T, from the average 
T, across the two alleys. Thus, in each 
alley, an effective excitatory potential 
(E), directly related to response 
strength, is assumed to arise as a func- 
tion of the algebraic summation of E 
and J, such that E = E — I. 

Therefore, following this model (cf. 
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Black, 1968), the greater the disparity 
between the two alternative T, values, 
the greater would be the in the pres- 
ence of the negative stimulus, and 
hence, the greater the decrement in re- 
sponse strength to S—. Positive con- 
trast, on the other hand, would 
clearly not be predicted by such an 
interpretation. If anything, a tendency 
in the opposite direction from a posi- 
tive contrast effect would be expected, 
due to generalization of J to the posi- 
tive runway. Some such generaliza- 
tion would certainly be expected on 
the basis of the number of stimulus 
characteristics common to the two 
alleys. Thus the ordering of group 
means in the opposite direction from 
positive contrast, which occurred in 
the present study, has been similarly 
found in differential conditioning 
studies in which W, was manipulated 
(Bower, 1961; Goldstein & Spence, 
1963; Matsumoto, 1965), and in one 
study (Matsumoto, 1965), this rever- 
sal was even found to be a statis- 
tically reliable phenomenon. 

It should be noted that this inhibi- 
tory model bears a resemblance, in 
many respects, to a frustration inter- 
pretation which has been suggested to 
account for the W, negative contrast 
effect (cf. Bower, 1961; Goldstein & 
Spence, 1963; Matsumoto, 1965). 
However, such use of a frustration 
model must omit incorporation of the 
motivational (drive-inducing) aspect 
frequently ascribed to it (Amsel, 
1958). Clearly, with the inclusion of 
this latter factor, the prediction of a 
negative contrast effect is complicated 
considerably, particularly in cases in 
which extensive training is adminis- 
stered (cf. Black, 1968; Matsumoto, 
1965). However, even though applica- 
tion of the more complex frustration 
formulation to account for the nega- 
tive contrast effects, either with W, 
or T,, entails conceptual difficulties, a 
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simpler, strictly inhibitory variable 
such as that discussed above does 
seem to offer considerable explana- 
tory promise. 
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REPETITION EFFECT AND SHORT-TERM MEMORY ' 
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University of Toronto 


2 experiments are reported which attempted to determine the basis of 
the “repetition effect,” i.e. the observed shorter reaction time (RT) 
for repeated events than for nonrepeated events. The Ist experiment 
was designed to determine whether the effect was due primarily to 
peripheral response facilitation, or primarily to more central coding 
effects. By employing a condensing task in which the same response 
was made to 2 different stimuli, it was concluded that the effect was 
not due to peripheral response facilitation, but appeared to be more 
central in origin. The 2nd experiment tested the hypothesis that the 
repetition effect resulted from short-term activation of the S-R memory 
trace. Some support was found for this prediction. It was found that 
the repetition effect declind with increasing intertrial interval (ITI) 
over a range in which decline in short-term memory (STM) is 
typically reported, and that RT for both repeated and nonrepeated 
events increased with increased ITI, indicating that increased fading 
of the memory trace occurred in both conditions. 


In a choice reaction time task, the 
reaction time (RT) for a repeated 
event is faster than for a nonrepeated 
or new event—i.e., one which is dif- 
ferent from the immediately preceding 
event (Bertelson, 1961, 1963, 1965 ; 
Hyman, 1953). Bertelson has named 
this phenomenon the “repetition effect.” 
Recently it has been suggested (Korn- 
blum, 1967) that this phenomenon may 
underlie the observation that mean RT 
increases linearly with transmitted in- 
formation, since under most circum- 
stances with an increase in the number 
of choices there is a decrease in the 
number of repetitions. Because this 
phenomenon could therefore play a 
very important role in the relation- 
ship between RT and transmitted in- 
formation, these experiments attempted 
to provide some understanding of the 
basis of the repetition effect. 

Two experiments are reported here. 
The first experiment was done to deter- 
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mine whether the locus of the repetition 
effect was central or peripheral in 
origin. The effect would be considered 
a peripheral one if after making a re- 
sponse there was some peripheral motor 
facilitation for a short period of time, 
so that a second response which was 
identical to the first could be made more 
quickly. On the other hand, the origin 
would be considered central, if after 
making a particular stimulus-response 
pairing, it was the S-R bond or mem- 
ory trace which remained activated, 
such that when the stimulus was pre- 
sented again it was not necessary to 
make an entire search through memory 
for the correct response as long as the 
S-R bond was still active. This activa- 
tion could correspond to the reverberat- 
ing circuit postulated by Hebb (1961) 
as the basis of short-term memory 
(STM). 

To differentiate between these two 
possibilities a condensing task was em- 
ployed (Posner, 1964) which permitted 
a two-to-one mapping of stimuli to re- 
sponses. Four stimuli were presented, 
two requiring one response and two 
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requiring a second response. Since the 
same response is made to two different 
stimuli, it is possible to examine the 
repetition effect as a function of repeti- 
tion of a particular response (two dif- 
ferent stimuli, same response) or as a 
function of the repetition of a particular 
S-R coding (same stimulus, same re- 
sponse). If the effect is primarily 
peripheral in origin, repetition of the 
response should account for most of 
the effect. If it is a more central phe- 
nomenon, the repetition effect should 
be greatest when the same S-R code is 
repeated. 

This problem has been examined 
previously by Bertelson (1965), who 
concluded that the effect was primarily 
peripheral in origin. However, in his 
experiment some confounding was pos- 
sible. The Ss pressed a left key if the 
Stimuli 2 or 4 were presented and a 
right key if the Stimuli 5 or 7 were 
presented. Bertelson found that the 
repetition of the response was the main 
determinant, since, e.g., RT to a 2 fol- 
lowing a 4 was almost as fast as to a 
2 following a 2, while both were much 
faster than to a 2 following a 5 or 7. 
However, it is possible that Ss coded 
the information in the form: even digit, 
left; odd digit, right, so that either a 2 
or a 4 activated the identical S-R code. 
This experiment therefore reexam- 
ined the situation, attempting to elim- 
inate any such confounding. 

The second experiment which is re- 
ported here attempted to trace the time 
course of the repetition effect, to deter- 
mine whether it was similar to that 
found in STM studies. 


EXPERIMENT I 
Method 


Subjects—Twenty Ss took part in this 
experiment. They were all enrolled in the 
introductory psychology course at the Uni- 
versity of Toronto and participated in the 


experiment as part of their course require- 
ment. 

Apparatus.—Stimuli were presented by an 
on-line read-out device, which permitted pre- 
sentation of the numbers 0-9. Only the 
numbers 1 and 2 were used in this ex- 
periment. The color of the background could 
be manipulated as well, permitting either a 
red or green background. The stimulus sub- 
tended a visual angle of 9.5°. A small orange 
neon bulb, situated 23 in. above the visual 
display, served as a warning signal. Re- 
action time from onset of the stimulus dis- 
play to the pressing of a key was measured 
to the nearest .01 sec. by a Cramer clock, 
The RT keys consisted of four push buttons, 
only two of which were used in this experi- 
ment. The S and E were seated next to one 
another in a booth which was separated by 
a divider so that S’s vision of E was entirely 
occluded. A random noise generator with a 
range of 20 kc was used to provide white 
noise. This noise was presented to S through 
earphones and provided an effective mask of 
any extraneous sounds in the room. 

Procedure.—Each § attended one 1-hr. 
session. A two-to-one mapping of stimuli 
to responses was employed in the condensing 
task. Four stimuli were presented: a Red 1, 
a Green 1, a Red 2, or a Green 2 (the color 
refers to the background), while only two 
responses were used—pressing the left key 
with the left forefinger or the right key with 
the right forefinger. The Ss were instructed 
to press the left key if either a Red 1 or a 
Green 2 was presented, and the right key if 
either a Red 2 or a Green 1 was presented. 
Hence, it was necessary to pay attention to 
both stimulus attributes of color and number, 

The warning signal was presented for 1 
sec, followed after a 1-sec. interval by a 
l-sec. presentation of the stimulus. Prior to 
the experiment, each S was given 12 practice 
trials. In the experiment proper, each of the 
four stimuli was presented 50 times in ran- 
dom order, giving a total of 200 trials. There 
was an intertrial interval of about 4 sec. 
between offset of the stimulus and presenta- 
tion of the next warning signal. 


Results and Discussion 


Response errors were very rare; 
where they occurred the stimulus was 
repeated at the end of the series. Con- 
sequently, the RT measures which are 
presented are for correct responses 
only. 
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Reaction times to each signal (ex- 
cluding the very first) were analyzed 
into three conditions: (a) Identical— 
repetition of the same stimulus (e.g., 
Red 1 following Red 1), (b) Equiv- 
alent—repetition of the same response 
to two different stimuli (e.g, Red 1 
following Green 2), (c) Different— 
different stimulus and different re- 
sponse (e.g., Red 1 following Red 2). 

This classification corresponds to that 
employed by Bertelson (1965). The 
mean RTs for the three conditions were 
as follows: identical, 807 msec.; equiv- 
alent, 937 msec.; and different, 875 
msec. Analysis of variance indicated a 
significant difference among conditions, 
F (2, 38) = 1531, p < 001, and a 
Duncan's multiple comparisons test re- 
vealed that the difference between each 
pair of conditions was significant. 

It may be seen, comparing the iden- 
tical and the different conditions, that 
the usual repetition effect is found —RT 
is faster to a repeated item (807 msec.) 
than to a nonrepeated item (875 msec. ). 
The question of interest is whether the 
equivalent condition more closely re- 
sembles the identical or the different 
conditions. If it is like the identical 
condition, this would support a periph- 
eral origin hypothesis. On the other 
hand, if it more closely resembles the 
different condition, it would appear that 
the effect is of more central origin. As 
may be seen above, RT in the equiv- 
alent condition is greater than both the 
identical and the different conditions. 
This suggests that the repetition effect 
is not peripheral in origin and that in 
fact repetition of the same response to 
a different signal may cause some addi- 
tional inhibition. This may be due to 
a slight reluctance on the part of Ss to 
make the same response to a new sig- 
nal immediately after having made that 
response to a different signal. Thus, 
it appears that the origin of the repeti- 
tion effect is quite central. 
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Experiment II 


One hypothesis to account for the 
repetition effect may be based upon 
STM effects. Considerable evidence 
suggests that when a stimulus is pre- 
sented, S appears to go through a 
search process to select a response 
(Hyman, 1953; Sternberg, 1966). This 
search continues at least until the cor- 
rect S-R pairing has been made. To 
account for the repetition effect it might 
be assumed that after the pairing has 
been made, the particular S-R trace 
which was selected remains activated 
for a short period of time. Conse- 
quently, when the same stimulus is pre- 
sented again, it is easier to select the 
correct S-R trace, and it may not be 
necessary to go through the entire 
search again. If, in fact, the repetition 
effect is due to the activation of a short- 
term trace similar to that postulated to 
underlie short-term verbal memory, 
then as the interval between presenta- 
tions is increased there should be in- 
creased fading of the memory trace or 
perhaps increased interference by other 
traces, and hence there is increased 
likelihood that S will initiate a search 
process. to select the correct response. 
Consequently, with a longer intertrial 
interval (ITI) the difference between 
RT to repeated and nonrepeated events 
should decline. Since there is greater 
necessity of going through the search 
on each trial, one would also expect 
mean RT to increase as ITI is in- 
creased. While previous experiments 
have found a decreased effect with in- 
tervals up to 1 sec. between the release 
of the response key and the presentation 
of the next signal, (Bertelson, 1961; 
Bertelson & Renkin, 1966; Hale, 1967) 
this experiment examined the intertrial 
interval effect over a range in which 
STM for verbal items has been shown 
to decline, namely 3-18 sec. (Peterson 
& Peterson, 1959). 


438 


Method 


Subjects.—Twenty-five Ss, drawn from the 
same pool as in Exp. I, participated in this 
experiment, 

Apparatus—The same apparatus was em- 
ployed as in the above experiment, the only 
difference being that now all four push 
buttons were used. 

Procedure—A four-choice RT task was 
used in this study which employed a one-to- 
one mapping of stimuli to responses. The 
four stimuli employed were the same as in 
Exp. I: Red 1, Red 2, Green 1, and Green 2. 
The Ss were instructed to respond with the 
left hand if the stimulus was red, and with 
the right hand if the stimulus was green. 
The forefinger was used if the number was 
1, and the middle finger if the number was 2. 
Hence, color was mapped to hand, and num- 
ber was mapped to finger, 

The warning signal and the stimulus were 
each presented for 1 sec. There was a 1-sec. 
foreperiod between the offset of the signal 
and the onset of the stimulus display. Three 
ITIs ie, the interval between offset of the 
stimulus on one trial and onset of the warn- 
ing signal on the next trial, were employed : 
2, 6, and 10 sec. Consequently, the total 
elapsed interval between onset of two suc- 
cessive stimulus displays varied from 5 to 
13 sec. 

Order of presentation of conditions was 
randomized and counterbalanced within Ss 
by presenting the three conditions in one 
random order during the first half of the 
session and then presenting them in the re- 
verse order during the second half of the 
session. Thus, a given S might receive the 
conditions in the order 6, 2, 10; 10, 2, 6. 
There was about a l-min. break between 
conditions and a 5-min, break between halves. 
In each half of the experiment, each stimulus 
was presented 12 times in random order dur- 
ing each interval, giving a total of 48 trials 
per interval for each half of the experiment, 
and a total of 96 trials per interval condition. 
At the beginning of the session, which lasted 
approximately 1 hr, S was given several 
practice trials to familiarize him with the 
situation. Prior to each condition S was 
shown the ITI that would be employed. 


Resulis and Discussion 

The mean RT for repeated and non- 
repeated events at each ITI is shown 
in Fig. 1. In no case was a RT longer 
than the shortest (2 sec.) ITI em- 
ployed. 
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It may be seen that RT for repeated 
events was in all cases faster than for 
nonrepeated events. A repeated-mea- 
sures analysis indicates this difference 
to be significant, F (1, 24) = 25.3, p < 
001. This is in accordance with the 
repetition effect which is typically 
found in such situations. 

The question of interest is whether 
the difference between repeated and 
nonrepeated events declines with in- 
creasing ITI. Examination of Fig. 1 
indicates that the difference does in 
fact decrease over the range of ITIs 
employed in this study, and the analy- 
sis of variance indicates that the Inter- 
val X Repetition Effect is significant, 
F (2,48) = 5.3, p < 01. This finding 
supports the prediction of a STM 
theory of the repetition effect, for as the 
ITI is increased, there is increased fad- 
ing of the S-R trace, and consequently 
increased likelihood of S being forced 
to initiate a search process to find the 
correct response associated with a given 
stimulus for repeated events as well as 
for nonrepeated events. 

Some additional support for this 
theory is provided by the significant in- 
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Fic. 1—Mean RT for repeated and non- 


repeated events as a function of intertrial 
interval. 
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crease in RT for both repeated and 
nonrepeated events with increasing ITI. 
This suggests that at shorter ITIs there 
is still some activation of the memory 
trace even for nonrepeated events, so 
that it is not necessary to initiate the 
search process on each trial How- 
ever, with increased ITI the increased 
fading necessitates going through the 
entire search more and more frequently. 

To test the generality of the findings 
the experiment was repeated with a 
two-choice task, and similar results 
were found. 
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SOME EFFECTS OF RHYTHMIC DISTRACTION UPON 
RHYTHMIC SENSORI-MOTOR PERFORMANCE! 


JAMES J. KEENAN 
Dunlap and. Associates, Inc. 


The influence of rhythimc distraction upon accuracy and timing in a 
rhythmic tapping task was investigated by manipulating pacer mo- : 
dality, number of pacers, distractor modality, task precision, and the 
temporal relation between intermittent pacers and distractors. 8 male 
Ss were tested under all conditions with performance under no-distrac- 
tion conditions as the control. Response (aiming) accuracy deteriorated 
under out-of-phase rhythmic distraction particularly when the require- 
ment was for precise aiming. Aiming accuracy was also affected by the 
number of pacers, pacer modality, and the requirement for task pre- N 
cision. Aiming accuracy was not improved by in-phase rhythmic dis- ( 


traction. Response timing was affected only by the requirement for 
precision; reducing the size of the target area lengthened the response 
timing interval, that is, the time it took the S to tap from one side to the 


other side of the tapping surface. 


This study was concerned with the 
effects of rhythmic extracurrent or 
distracting stimuli upon rhythmic sen- 
sori-motor performance. The spe- 
cific interest of the study was in the 
effects of varying the temporal rela- 
tionship between the pacer and the 
distracting stimuli upon the accuracy 
and timing of repetitive, rhythmical 
perceptual-motor performance. Stimu- 
lus conditions, ie., pacer modality, 
number of pacers, distractor modality, 
and response difficulty were also 
varied. 

The evidence from a wide variety 
of research on heteromodal stimula- 
tion indicates equivocal effects on 
performance: sometimes, the addi- 
tion of intercurrent or “distracting” 
stimuli interferes with the response; 
at other times, extrastimuli appear 
to assist performance. Reaction time, 
eg., is lengthened when a second 


1 Adapted from a dissertation submitted to 
Columbia University in partial fulfillment of 
the requirements for the degree of Doctor of 
Philosophy. The writer gratefully acknowl- 
edges the help of A. S. Thompson, Chairman, 
and C. Mueller, J. Davitz, and W. Cummings 
of the Dissertation Committee. 


stimulus closely follows the primary 
stimulus and both stimuli are in the 
same modality (Helson & Steger, 
1962). On the other hand, reaction 
time has been found fastest when 
nearly simultaneous visual and audi- 
tory stimuli are presented inasychrony, 
offset in time by approximately the 
difference between the reaction time to 
each separately (Hershenson, 1962). 
Steady white noise, auditory flutter, 
and simultaneous auditory and visual 
stimulation have been found to 
improve performance (e.g., Watkins, 
1964). Intermittent sounds have been 
resistant to habituation in a re- 
petitive perceptual-motor task (Cas- 
sel & Dallenbach, 1918). Disruptive 
effects of dual-modality stimulation 
also vary with conflicts in the mean- 
ingfulness of the material being pre- 
sented, the difficulty of responding, 
and the sense modality (Bartlett & 
Barlett, 1959; Klemmer, 1958; Mow- 
bray, 1953). 


METHOD 


Subjects —The Ss were eight males between 
the ages of 21 and 40 yr. The average age 
was 32. All Ss had normal hearingand vision. 
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Procedure.—A modified tapping-board task 
was used in which Ss were required to tap out, 
with a metal-tipped stylus, a constant alterna- 
tion dictated by periodically flashing lights or 
sounding buzzers. The task allowed S to 
impose a rhythm upon his movements and, at 
the same time, to treat each stimulus inde- 
pendently. The Ss followed one pacer condi- 
tion at a time and were instructed to maintain 
their tapping closely synchronized with the 
pacer rythm despite the rhythmic distraction 
that was presented under some conditions. 
The accuracy (aiming) and timing (inter- 
response interval) of S’s performance were 
measured under control (no-distraction) and 
experimental (distraction) conditions. Each 
S was tested under all conditions. 

Task difficulty was varied by increasing or 
decreasing the precision required of the S’s 
aiming response. Four sizes of circular 
openings in a mask over the tapping-target 
area were used: f in. diameter, 3 in, diameter, 
3 in. diameter, and } in diameter. Stimulus 
conditions were varied as follows: (a) in 
modality of pacer stimulus—pacer stimuli 
were either visual, light(s) or auditory, 
buzzer(s); (b) in modality of distractor 
stimulus—distractor stimuli were either visual 
(light)or auditory (buzzer) ; and (c) inthenum- 
ber and the location of pacer stimuli—in either 
modality, pacers were either a single light or 
buzzer or two lights or buzzers. When the 
pacer source was a single light or buzzer, it 
was placed centrally and directly in front of 
S. Dual pacers were position-coded so as to 
direct the response of S to one side or the 
other. The temporal relationship between 
pacer and distractor stimuli was varied to 
present the distracting stimulus either (a) 


l Audi tory (Buzzer) Distraction 
2. Visual (NE 5l Lamp) Distraction 
3.Position Coded Visual Pocer 
4.Centrolly Placed Pacer 
5.Tapping Mask Openings 
6.Tapping Stylus 

T. Non- Conductive Separator, 

8. Tapping Surfaces/Mosk 
9.Position Coded Auditory Pacer 
lO.Centrally Placed Auditory Pacer 
l.Aler! Lamp 


Fi. 1. 
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in-phase—regularly simultaneous with the 
pacing stimulus; (b) regularly-delayed—regu- 
larly following the pacing stimulus by .152- 
-03 sec.; (c) out-of-phase—regularly following 
the pacing stimulus by .302-0.3 sec.; or (d) 
regularly preceding—regularly preceding the 
pacing stimulus by .15+.03 sec. The 12 
pacer/distractor conditions were presented 
according to a counterbalanced factorial 
design. Within each condition, size of target 
area and the temporal relation between pacer 
and distractor were varied in counterbalanced 
order. 

The Ss were tested individually and in two 
sessions, each of which was approximately 14 
hr. in duration. The sessions were 24 hr. 
apart. During the first session, S was in- 
structed in the task and study conditions. 
He was allowed to tap out the rhythm of the 
pacers as they were being shown to him. 
Before each trial, S was told which of the 
conditions he could expect. The S was in- 
structed to tap alternately from side to side 
and to follow the pacer rhythm as closely as 
possible. With position-coded pacers, i.e., 
two lights or two buzzers, S was instructed to 
tap on the same side as well as the same time 
as the pacer. An S's performance was re- 
corded for 30 pacers during a trial but the 
recording did not begin at the outset of the 
trial. The Ss were allowed to tap for 15 pacers 
before their performance was recorded. They 
were required to continue tapping until the 
end of the trial which contained 5 or 10 pacers 
in addition to the 30. When the distraction 
was presented, it was introduced after 5 
pacer stimuli and continued through the trial. 

Apparalus.—Four equipment units were 
used. A stimulus program unit provided for 


(ts 
Dio. 


ol. 


Os 8s O3. 


Stimulus-response unit and component parts. 
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the programming of pacers and distractors. 
A stimulus-response unit was used to present 
the pacers and distractors to S and also to 
receive his response (see Fig. 1). It consisted 
of a modified tapping board, on top of which 
was placed a metal mask in which were 
punched circular openings of four sizes. 
From time to time, the mask was rotated 
manually so that a designated pair of open- 
ings, each of the same size, exposed the target 
area underneath. The stimulus-response unit 
also contained the lights and buzzers required 
for the presentation of the various pacer/ 
distractor conditions. 

The Ss were instructed to aim for the target 
area and to tap out the rhythm with the pacer 
alternately from side to side. Electrical 
contact was made each time S touched the 
tapping surface with the stylus. Separate 
counting circuits were closed, depending upon 
whether S hit the target area exposed by the 
mask opening or whether he missed the target 
and hit the surrounding mask area. Contact 
with both areas at the same time was not 
possible. Separate recording circuits made pos- 
sible the recording of the side on which S 
tapped. 

Two other units were used in the study. 
One of these was a recording unit which con- 


TABLE 1 


CHARACTERISTICS OF PaciNG, DISTRACTING, 
AND ENVIRONMENTAL CONDITIONS 


Condition Characteristics 

Pacer Periodicity 0.593 + .03 sec. 
Distractor Periodicity 0.593 + .03 sec. 
Pacer/Distractor, ON 0.10 + .03 sec. 

Period 
Visual Pacer/Distractor | NE 53 (neon) 

—Type bulb 
Auditory Pacer/Dis- 

tractor—Type Edwards Buzzer 


Auditory Pacer/Dis- 
tractor, intensity level 

Auditory Pacer/Dis- 
tractor, predominant 


80 db.” 


octave band 1200-2400 cps? 
Ambient illumination, 

Ss’ room 50-55 ftc. 
Ambient sound level, 

Ss’ room 32 db. 
Ambient temperature, 

Ss’ room + 68 °F 


are: .0002 dynes/sq cm; measurement taken at a 
point 25 in. equidistant from the three sound sources; 
instrument used was General Radio sound level meter, 
T: 1551-B. 

"Measured with calibrated General Radio octave 
band analyzer Type 1550-A., 
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TABLE 2 


ANALYSIS OF THE VARIANCE OF RESPONSE 
Accuracy SCORES FOR THE ErcHT Ss’ 
UNDER THE FIVE INDEPENDENT 
VARIABLE CONDITIONS 


Source of Coral df MS F 
Pacer Modality (1) 1 | 6,408.2 | 12.62** 
No. of Pacers (2) 1 | 1,743.8 | 8.90* 
Distraction 

Modality (3) 1 96.8 | 1.28 
Temporal 

Relations (4) 4 47.1 | 6.19** 
Size of Opening (5) 3 | 4,590.6 | 34.69** 
Subjects (6) 7 | 2,359.1) — 

1X2 1 | 1,544.4 | 10,25* 

1x4 4 41.0} 7.19** 

1x5 3| 112.8) 5.10** 
Sources of Error 

Estimates: 

1x6 7| 5028| — 
2X6 7 195.9 = 
3X6 7 15.1) — 
4x6 28 T15| — 
5X6 21 132.3 = 
1xX2X6 7| 1506| — 
1X4X6 28 5.7) — 
1X5X6 21 22.1 = 
* p= «05. 
poSon 


sisted of a two-channel Magnacord tape 
recorder (Model 728-4) and a Curtiss-Wright 
inkless chart recorder. The second unit was 
the timing unit, consisting of two calibrated 
instruments, a Heathkit audio oscillator and 
a Beckman-Berkeley counter, Model 5010. 
Millisecond relay circuitry was used in this 
unit. 

Daily checkout and calibration of the pro- 
gramming, timing, and recording units was 
maintained, using a General Radio sound level 
meter, a General Radio octave band analyzer, 
and/or a Beckman-Berkeley counter-timer 
(Model 7160) as applicable. Table 1 sum- 
marizes the characteristics of the pacing, 
distracting, and ambient conditions of the 
study. 


RESULTS 


The variances attributable to the 
effects of the experimental pacer, dis- 
tractor, and response conditions, as 
well as individual differences, were 
analyzed separately for response ac- 
curacy and timing. Judged statis- 
tically against the variance contrib- 
uted by the next higher significant 


ats 
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RHYTHMIC DISTRACTION AND RHYTHMIC PERFORMANCE 


interaction involving the variable of 
interest and Ss, accuracy (aiming) 
scores were significantly (.01 level) 
affected by the modality of the pacer, 
number of pacers, the temporal rela- 
tion between pacer and distractor, and 
the size of the target area. Table 2 
presents the analysis of the variance 
of response accuracy scores for the 
eight Ss under the five independent 
variable conditions. Response timing, 
i.e., the temporal length of the inter- 
tap interval, was affected only by the 
size of the target area. 

Performance and pacer character- 
istics.— The Ss hit the target area for 
an average of 23.25 out of a possible 
30 hits. Auditory pacers (M — 25.48) 
yielded greater accuracy than visual 
pacers (M —21.01). Accuracy under 
position-coded pacers (M — 24.41) was 
greater than accuracy under one 


26 


23 


Accuracy Scores (Max=30) 
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2I 


20 
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pacer (M — 22.08), although the differ- 
ence depended upon the pacer mo- 
dality. The mean accuracy score (M — 
23.28) under two visual pacers was 
higher than under one visual pacer 
(M = 18.74). Accuracy scores (M 
7:25.55) were higher for two auditory 
pacers than for either the two visual 
pacers or the one visual pacer. Ac- 
curacy under two auditory pacers, 
however, was not different from that 
under one auditory pacer (M — 
25.42). 

Performance and distraction char- 
acteristics.—The modality of the dis- 
traction did not have differential ef- 
fects upon performance. In general, 
however, distraction produced lower 
accuracy (aiming) scores than were re- 
corded under no-distraction. The ef- 
fect depended upon the temporal re- 
lationship between the pacer and dis- 


^uditory Pacer Scores 


PS Scores 
Out -0f- 
Phase 


Regularly 
Preceding 


Distraction And Phase Relations 


Fie, 2. 


Mean response accuracy scores under no-distraction and under 


pacer/distractor phase relations. 
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tractor and the modality of the pacing 
stimulus. In the visual pacer mode, out- 
of-phase pacer/ distractor relations 
(M = 19.99) significantly reduced 
aiming accuracy over the no-distrac- 
tion (M = 22.09), in-phase (M = 
21.33), and regularly-delayed (M = 
21.23) conditions. Regularly-preced- 
ing (M = 20.41) distraction also pro- 
duced lower accuracy scores than 
under no-distraction. In the audi- 
tory pacer mode, accuracy was not 
different under distraction than it was 
under no-distraction. However, all 
accuracy scores were higher under 
auditory than under visual pacer con- 
ditions regardless of the temporal re- 
lation between the pacer and distrac- 
tor. Figure 2 illustrates the mean 
aiming accuracy under the various dis- 


Accuracy Scores (Mox = 30) 


A 
5/8 in 


B8 
V2 in 
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traction and no-distraction conditions 
for visual and auditory pacing. 

Performance and response charac- 
leristics.—The size of the target area 
significantly affected both the aiming 
accuracy and interresponse timing of 
S's response. Accuracy scores de- 
creased generally with target size. The 
largest size (A) yielded higher ac- 
curacy than did the two smallest sizes 
(C & D). Scores under Sizes B and C 
were also higher than under Size D. 
Figure 3 illustrates these findings. Re- 
sponse difficulty and the previously 
mentioned difficulties of the two pacer 
modality conditions interacted. Per- 
formance on all equivalent sizes of tar- 
get area was less accurate under visual 
pacing than under auditory pacing 
conditions. 


A mean Scores 


ES 


Size Of Tprget Area 


Fic. 3. Mean response accuracy scores under the four sizes of target area 
and for aurally and visually paced conditions. 
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The response timing interval was 
found to be longer for performance 
under the smallest size of target area 
(M = .597 sec.) than under the largest 
size of target area (M — .574 sec.). 


DISCUSSION 


Previous studies, concerned largely 
with sensitivity and simple reactions, 
justified the postulation that the tem- 
poral relationship between multiple in- 
coming signals is important to the sen- 
sory as well as the motor response. From 
the evidence of this study, this appears 
to be so even in the case of rhythmical 
performance in which S is highly prac- 
ticed and is certain regarding task con- 
ditions. It was expected that temporally 
in-phase distracting stimuli would assist 
S in his duplication of the pacer timing 
and in the aiming accuracy of his tap- 
ping. No support, however, was adduced 
for this hypothesis. On the other hand, 
out-of-phase distraction had a deleterious 
effect upon performance, as expected, 
although it affected only S's aiming 
accuracy. The degree of response im- 
provement or impairment under distrac- 
tion was expected to be related directly 
to the temporal proximity of the pacer 
and distractor stimuli. No evidence was 
found for improvement of performance 
under distracting conditions. And, while 
out-of-phase distraction was impairing, 
distraction whose onset was spaced inter- 
mediate between simultaneity and out- 
of-phase, ie. regularly delayed, regu- 
larly preceding, was not proportionately 
less impairing. 

It was also expected that, when dis- 
traction resulted in either improvement 
or impairment (compared to no-distrac- 
tion performance), the effect would be 
greater for the more difficult task. This 
was in fact the case in the significant 
interaction between pacer modality and 
temporal relations. Visual pacers yielded 
less aiming accuracy than did auditory 
pacers. Under distraction, performance 
involving visual pacing was less accurate 
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than was the corresponding performance 
under auditory pacers. 

The results of this study are also seen 
as generally conforming to frequently re- 
ported expectations regarding human 
performance under noise or other extra- 
stimuli: (a) intercurrent stimuli in- 
fluence the difficult tasks more than the 
simpler activities (e.g., Archer, 1954); 
(b) the greater attention that must be 
given to the irrelevant stimuli, the 
greater the decrement in performance 
(e.g., Hodge, 1959); (c) conflicts and 
attendant performance decrements re- 
sult from the presentation, as irrelevant 
stimuli, of signals that formerly were 
relevant to S’s task (Hodge, 1959); and 
(d) the effect of extrastimuli is greatest 
when S must carefully categorize the 
relevant and the irrelevant stimuli, i.e. 
when it is relatively infeasible to dis- 
regard the extrastimulus (Erlick, 1962; 
Morin, Forrin, & Archer, 1961). 
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RANDOM REINFORCEMENT IN CONCEPT 
IDENTIFICATION! 


TOM TRABASSO AND HERMAN STAUDENMAYER 
University of California, Los Angeles 


The effect of random reinforcement (RR) on cues in concept identifica- 
tion was investigated in 2 experiments. Each experiment consisted of 
3 phases: initial learning, RR, and transfer. In Experiment I, with 
letter stimuli, Ss who received no initial training showed negative trans- 
fer whether or not there were identical cues in the RR and transfer 
phases, With initial training, however, negative transfer was problem- 
specific and occurred only when cues in the last 2 phases were identical. 
In Experiment II geometric figures were used as stimuli and the latter 
finding of Experiment I was replicated throughout. Ss also showed nega- 
tive transfer if common dimensions but not identical values appeared in 
the last 2 phases. The data indicate that Ss reject hypotheses based upon 
irrelevant cues but that this rejection is temporary and occurs only if the 
problem or dimensions remain unchanged. 


In an initial study of the random 
reinforcement (RR) effect, Levine 
(1962) first had Ss classify stimulus 
patterns over a series of trials where 
responses were correct or wrong at 
random. Then Ss were shifted to a 
problem where some cues were con- 
sistently paired with reinforcement. 
Levine found that regardless of the 
number of prior RR trials (varying 
4-60 in different experiments), the 
amount of negative transfer was con- 
stant. This finding was replicated by 
Holstein and Permack (1965). 

The constant retardation in transfer 
is inconsistent with the process as- 
sumptions of Bourne and Restle 
(1959) who suppose that the degree 
of "adaptation" or “neutralization” 
of irrelevant cues increases with the 
number of inconsistent reinforcements. 
Under these assumptions, negative 
transfer should have been correlated 
with the amount of prior RR. How- 
ever, the RR effect also contradicts 
one-stage, all-or-none interpretations 
of concept identification, which as- 


1This study was supported by research 
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sume that S resamples cues (or 
hypotheses) with replacement after 
each error (Bower & Trabasso, 1964; 
Restle, 1962). In these models, no 
negative transfer is expected and 
resampling without replacement (or 
with postponement) is a suggested 
alternative. 

A discrete learning interpretation 
was supported by Holstein and Pre- 
mack’s analysis of the presolution 
performance. Prior to S's last error, 
the probability of a correct response 
was found to be stationary or constant 
and near the a priori value of 50. 
Holstein and Premack (1965) con- 
jectured that the decrement in trans- 
fer may be a result of two possible 
strategies: S either rejects temporarily 
disconfirmed hypotheses or adds ir- 
relevant hypotheses. The Restle and 
Bower-Trabasso models assume a 
fixed pool of hypotheses fand either 
strategy decreases the probability that 
S samples a relevant hypothesis in 
transfer. Elsewhere, it has been shown 
that both alternative strategies are 
used: Erickson (1967) and Trabasso 
and Bower (1966) found that Ss 
postpone resampling of specific cues 
for a fixed number of trials following 
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inconsistent reinforcement, and Levine 
(1962) describes various verbal reports 
indicating that Ss rejected particular 
elementary hypotheses and generated 
more complicated, irrelevant ones 
(e.g., the guessing of sequences of 
corrects and wrongs). 

In the present study, the authors 
investigated the above interpretations 
of the RR effect. In particular, the 
concern was with what may be 
termed within and between problem 
effects of RR. To illustrate, consider 
the following three conditions: in 
Cond. 1 Ss receive no prior RR; in 
Cond. 2 Ss receive RR on cues that 
are different from those contained in 
transfer; in Cond. 3 Ss receive RR on 
cues that are identical to those in 
transfer. If RR leads to the addition 
of irrelevant hypotheses, and this 
transfers, then retardation would be 
expected in Cond. 2. If, however, RR 
leads to both the addition of irrelevant 
hypotheses and the rejection of hy- 
potheses based upon common cues, 
then more negative transfer should 
occur for Cond. 3 than Cond. 2. If 
RR is problem-specific and leads to 
rejection of common cues, then nega- 
tive transfer should be observed for 
Cond. 3 but not for Cond. 2. 


EXPERIMENT I 
Method 


Stimuli.—The stimuli were pairs of letters 
drawn in India ink from templates on white 
3 X 5-in. file cards. Each letter could be 
inscribed within an imaginary ł X j-in. 
rectangle. 

Two sets of stimuli were used. Each set 
contained eight cards and a different set was 
used for each problem. In Set A, the letter 
pairs were: FJ, FS, MJ, and MS; in Set B, 
they were: HR, HB, LR, and LB. Within a 
set, the letter positions were counterbalanced 
so that there were eight different stimulus 
patterns. 

Procedure.— The same instructions were 
read to each S who was told that his job was 
to learn to classify cards into two categories 


according to a rule known to E. The S was 
to respond “Yes” if he thought that the card 
fit the rule and “No” otherwise. Cards were 
shown one at a time via a cardholder. The 
SS self-paced his responses ; after each response, 
E informed S as to whether or not his response 
was correct. The card was removed about 4 
sec. after S received information feedback and 
a new card was immediately presented for 
classification, Each .S received a different 
order of presentation by shuffling the cards 
prior to the experiment and after every eight 
trials as necessary. For solvable problems, 
the learning criterion was 15 successive correct 
responses. If the criterion was not begun by 
Trial 60, the training ceased on S's first error 
on or after Trial 60. 

In Exp. Ia, there were two phases. In the 
initial phase, Ss were given 16 trials of RR 
according to a predetermined 50: 50 random 
sequence of corrects and wrongs, The only 
restrictions were: that there be no more than 
three corrects or three wrongs in succession, 
half of the trial outcomes be corrects, and 
Trial 16 end with a wrong according to the 
rule to be used on the final transfer problem. 
In the second, transfer phase, a pair of letters 
was made relevant and there was no interrup- 
tion between phases. In all final problems, 
the Stimulus Set A was used and the Letter 
F was always classified “Yes” while the Letter 
M was always categorized “No.” Defining 
a letter pair as a dimension, there were one 
relevant and one irrelevant dimension in the 
final problem. When Set B was used as a 
problem, the Letter H was always classified 
“Yes” and the Letter L, “No.” The position 
of the letters was irrelevant in all phases. 

In Exp. Ib, there were three phases. In 
the initial phase, Ss were trained on problems 
with one letter pair relevant and the other 
irrelevant. After Ss reached learning criterion, 
they were shifted without a break to 16 trials 
of RR and then, without interruption, to a 
third phase where a letter pair was once more 
made relevant in a final, transfer problem. 

Thus, in Exp. Ia, Ss experienced RR before 
learning a concept identification problem; in 
Exp. Ib, they experienced RR after learning. 

Experimental design and groups.—Table 1 
gives a summary of the experimental design 
and groups for Exp. 1. 

In Table 1, it can be seen that Group 1 
served as a control and received no RR prior 
to learning a problem. Group 2 received RR 
on a different set of stimuli from those in 
transfer. Group 3 received RR on cues 
identical to those in transfer. Group 4 was 
a control which learned an initial problem 
with cues different from those in transfer and 
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TABLE 1 


EXPERIMENTAL DESIGN: Exp. I 
TRAINING PHASES AND 
SriIMULUS SETS 


Phase 
Groups 
Initial RR Transfer 

Exp. la 

1 -— — A 

2 = B A 

3 — A A 
Exp. Ib 

4 B = A 

5 A A A 

6 B B A 

7 B A A 

received no RR between problems. Group 5 


learned, received RR, and relearned a prob- 
lem with the same set of cues throughout. 
Groups 6 and 7 are analogous to Groups 2 
and 3 in that they received RR on cues that 
were, respectively, different or identical to 
those contained in transfer. 

Subjects —The Ss were 137 volunteers re- 
cruited from introductory psychology classes 
at the University of California, Los Angeles. 
Participation in experiments was a course 
requirement. The Ss were assigned irregu- 
larly to groups with the restriction that the 
number of males in each group be equal. 
Experiment Ib was begun after 4 Ss had been 
run in each group of Exp. Ia; thereafter, both 
experiments were run concurrently. In initial 
training, 4 Ss in Group 4 and 3 Ss each in 
Groups 5, 6, and 7 failed to reach the learning 
criterion and were not transferred. These Ss 
were replaced by others who did learn the 
initial problem for purposes of equating the 
number of Ss on transfer. In Group 1, 4 Ss 
also failed to learn. In comparisons of initial 
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learning, only the data from the first 16 .Ss of 
Group 4-7 are used. There were 20 Ss each 
in Groups 1-3, 20 in Group 4, and 19 each in 
Groups 5-7. 


Results 


Analyses of the mean number of 
errors and the mean trial of last error 
yielded similar results and only those 
on the trial of last error are reported. 

The initial training performance of 
Groups 1, 4, 5, 6, and 7 were compared 
for purposes of establishing base line 
for evaluating RR effects on transfer 
without prior learning and as a check 
on S randomization. The respective 
mean trials of last error were 22.65, 
23.65, 23.75, 21.12, and 26.68. A 
simple analysis of variance yielded an 
F-ratio of less than one. Hereafter, 
the initial phase data from these 
groups will be referred to as the 
“pooled control." 

Table 2 gives the mean trial of last 
error and standard deviation for the 
pooled control and transfer conditions 
of Exp. I. In Table 2, one may com- 
pare the conditions by noting the 
similarity of cues contained in either 
initial training or RR phases with 
those in the transfer problem. 

Comparing Groups 2 and 3 with 
the pooled controls, a between prob- 
lem effect of RR is demonstrated. 
Both Groups 2 and 3 took significantly 
longer to learn than controls, F (1, 


TABLE 2 
TRANSFER REsULTs: Exp. I 


Comparison of Transfer Cues 


Group N eee ee sD 
Initial Training RR 

Pool = — 23.53 21.29 
Wadi zT different 37.25 2174 

3 20 — identical 34.05 19.83 

4 16 different = 9.81 6.98 

5 16 identical identical 12.24 11.47 

6 16 different different 9.26 12.98 

7 16 different identical 20.58 12.37 
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106) = 7.01 and 4.51, p < .01, but 
they did not differ appreciably from 
each other (F <1). These findings 
indicate that the RR effect on letter 
cues is not problem-specific but that 
Ss either reject general elementary 
solutions and/or add irrelevant 
hypotheses. 

Despite the above results, the 
performance of Groups 5-7 indicated 
problem-specific RR effects. These 
groups each learned an initial concept 
identification problem prior to RR 
and transfer. Groups 4-7 were com- 
pared in a simple analysis of variance 
and yielded a significant overall 
F-ratio, F (3, 60) = 5.12, p < .0t. 
Recall that Groups 6 and 7 were 
analogues of Groups 2 and 3. Groups 
6 and 7 were compared with the con- 
trol for transfer in Exp. Ib, Group 4. 
No reliable difference was found be- 
tween Groups 4 and 6 but Group 7 
differed statistically from both of 
these groups, F (1, 60) — 12.49 and 
8.45, p < .01. These results indi- 
cate that when Ss have learned that a 
simple solution exists, they reject 
specific hypotheses that are irrelevant 
in the same problem. With such 
knowledge, however, no between- 
problem RR effect is observed that 
might be attributable to S's transfer 
of added irrelevant hypotheses. The 
problem-specific effect of RR is also 
observed in Group 5, which was 
trained on the same cues throughout 
all phases. If RR had no effect, then 
this group should have made no 
errors in transfer. Prior learning 
apparently reduced the RR effect, 
since Group 5 learned the transfer 
problem faster than Group 7, F (1, 
30) = 4.56, p < .01. Finally, com- 
paring Group 7 with Group 3, general 
knowledge of the form of the solution 
also reduced the within-problem RR 
effect, since Group 7 learned faster, 
F (i 34) 2926 p<.01. Ap- 
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parently, Ss are less likely to add 
complicated irrelevant hypotheses if 
they know that the solution is ele- 
mentary in nature. 


EXPERIMENT II 


Experiment II was similar to Exp. I 
in design but employed geometric 
rather than letter stimuli. With 
letters, the stimulus dimensions are 
not as well defined as with geometric 
figures and the generality of the RR 
effect may depend upon this factor. 
Furthermore, previous studies on RR 
have used geometric patterns. Finally 
Exp. II extended the investigation 
to allow a generalized RR effect for 
dimensions by holding the dimensions 
constant throughout both phases but 
changing the particular values be- 
tween RR and transfer. 


Method. 


Stimuli.—The stimuli were geometric fig- 
ures drawn in crayon pencil from templates on 
white 3 X 5-in. file cards. Each figure could 
be inscribed within an imaginary 1} X 2-in. 
rectangle. 

"Three stimulus sets were used and each set 
was employed in a different problem. In Set 
A, the dimensions (and values) were: color 
(blue or yellow) and form (an X or a circle). 
In Set A', different colors and forms were 
used: red or green and a square or a semicircle. 
InSet B, the location of a dot (above or below 
a square figure) and the number of figures and 
dots (one or two) served as dimensions. For 
a given problem, decks of 32 cards were 
constructed. The decks contained eight sub- 
sets of the four possible combinations of the 
dimensions. 

Procedure.—The instructions, procedures, 
etc. of Exp. II were identical to those of Exp. 
I with the following exceptions: (a) the 
responses were pressing of buttons labelled A 
and B; (b) the correct category was signalled 
by a feedback lamp located above each but- 
ton; (c) the learning criterion was 10 succes- 
sive correct responses; and (d) both the stimu- 
lus response assignments and sets of stimuli 
were counterbalanced within each condition. 

Experimental design and groups.—The ex- 
perimental design and groups of Exp. II are 
summarized in Table 3. 


RANDOM REINFORCEMENT IN CONCEPT IDENTIFICATION 451 


TABLE 3 


EXPERIMENTAL DesiGN: Expr. ll 
TRAINING PHASES AND 
SriMULUS SETS 


Phase 
Group 
Initial RR Transfer 
1 mu B A 
2 = A A 
3 = A’ A 
4 B = A 
5 A A A 
6 B B A 
7 B A A 
8 A B A 


It can be seen that the first two groups in 
Toble 3 were analogues of Groups 2 and 3 of 
Exp. la, Group 1 received RR on cues differ- 
eni from those in transfer while Group 2 had 
Group 3 tested 


for a general, dimensional RR effect. 


this condition, color and form stimuli were 
used in both the RR and transfer phases but 
the values of the dimensions differed in the 


In Table 4, one may compare con- 
ditions on the basis of the denoted 
comparison between cues contained 
either in the initial or RR phases with 
those in transfer. Unlike Exp. Ia, the 
RR effect here was problem-specific. 
Group 1 showed no significant re- 
tardation on transfer following RR on 
cues which were different from those 
of the final problem (F <1). The 
RR effect was observed when cues 
were identical or similar (from the 
same dimensions) for the two phases: 
Group 2 showed considerable re- 
tardation, F (1, 94) — 67.83, p « .01, 
while Group 3 also differed statisti- 
cally from the controls, F (1, 94) 
= 6.22, p < .01. The latter result 
suggests that Ss tend to reject hypoth- 
eses based upon both particular 
stimulus values and dimensions that 
have been intermittently reinforced 
and are common to problems. The 


the same cues throughout. 


two phases. The remaining five groups of 
Exp. II were similar to those used in Exp. Ib. 
The initial training of Groups 4-8 provided 
control data for the effect of RR in Groups 
1-3. Group 4 was a control for transfer and 
learned successive problems on different cues 
with no intervening RR. Groups 5-8 con- 
stituted a 2 X 2 factorial design with the 
factors comparing cues used either in the 
initial or the RR phases with those in transfer 
(either the same or different, respectively). 

Subjects.—The Ss were 128 introductory 
psychology students who were each irregularly 
assigned to one of eight experimental condi- 
tions, There were 16 Ss in each condition. 
All Ss learned their initial problems and were 
transferred. 


Results 


The initial training of Groups 1, 4, 
5, 6, and 7 did not differ statistically 
(F « 1) and their data were combined 
as a pooled control. Their respec- 
tive mean trials of last error were 
2.25, 2.12, 2,00, 2.75, and 2.31. Table 
4 gives the transfer results for all 
conditions in Exp. II in terms of the 
mean trial of last error and standard 
deviation. 


between problem RR effect for Group 
2 of Exp. Ia may have been a result 
of naive Ss treating letters as belong- 
ing to a common dimension or stimu- 


lus set. 


Groups 5-8 were compared in the 
2 X 2 factorial analysis of variance. 
Learning was significantly faster when 
the initial training cues were identi- 
cal to those in transfer, as Groups 5 
and 8 required fewer trials to learn 
than Groups 6 and 7, F (1, 60) — 5.45, 


TABLE 4 
TRANSFER REsuLTs: Exp. II 
Comparison of 
Transfer Cues Mean 
Group iu SD 
Initial Error 
Training) RR 
Pooled — = 1.23 2.35 
Control 
1 — different | 2.63 3.26 
2 — identical | 15.38 | 13.23 
3 — similar | 6.00 5,61 
4 different — 2.44 2.36 
5 identical | identical | 5.50 5.62 
6 different | different | 1.31 1.53 
7 different | identical | 11.87 | 10.94 
8 identical | different | — 0.43 0.81 
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p «.01. As in Exp. Ib, when RR 

was on cues identical to those in 

transfer (Groups 5 and 7), significant 

retardation in transfer was observed, 
' F (1, 60) = 25.30, p < .01. 


Discussion 


Taken together, the present experi- 
mental results apparently set certain 
limitations upon the RR effect in con- 
cept identification. When Ss know the 
form of the solution or where stimulus 
dimensions are clearly defined, the effect 
of RR would appear to be either problem- 
or dimension-specific. That is, RR of 
cues leads to subsequent difficulty in 
transfer only in those cases where the 
cues are identical or from within the 
same dimension and no other alteration 
in the problem occurs. 

The findings of the present study shed 
some light on Hammer's (1955) failure 
to find negative transfer when irrelevant 
cues in an initial, solvable problem were 
made relevant. Hammer, using letter 
stimuli, introduced changes in the prob- 
lems by adding novel letters. Thus, he 
did not obtain a between-problem RR 
effect consistent with the negative results 
in both Exp. Ib and II. Furthermore, 
Restle's (1959) conjecture that S ignores 
(or rejects solutions based upon) irrele- 
vent cues relative to other relevant cues 
within a fixed context appears supported 
both by Hammer's (1955) and the 
present findings. 

Although the present data indicate 
that the RR effect is mostly problem- 
specific, the present design does not 
allow a clear decision about which 
strategy Ss most likely follow: rejecting 
the use of hypotheses based on particular 
cues or generating complicated, irrele- 
vant ones. It would seem that adding 
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irrelevant hypotheses is reduced when 
Ss know the form of the logical rule. 
Both strategies complement each other 
and constitute an attentional phe- 
nomena, ie. one "ignores" solutions 
based upon a particular set of stimuli if 
one is attending to and using other cues 
or if one is actively avoiding the use of 
particular cues (cf. Trabasso & Bower, 
1968). 
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TWO-CHOICE DISCRIMINATION LEARNING AS A 
FUNCTION OF STIMULUS AND EVENT 
PROBABILITIES! 


JEROME L. MYERS AND DONNA CRUSE 


University of Massachusetts 


180 Ss were run in a 2-choice discrimination task with mı =.85 for all Ss. 
The groups differed with respect to 7: (.85, .50, .15) and £ (.75, .50, .25). 
P(A1|T1) was lowest at 74—.50, the effect being most pronounced at 


B-.25. 


Marked overshooting occurred on both Type 1 and Type 2 


trials in several groups. The effects of response-event outcomes on sub- 


sequent responding varied over values of 72. 


These results were dis- 


cussed with respect to Atkinson's observing response model. 


In the usual binary prediction situ- 
ation, S is required to predict which of 
two events will occur on each trial. An 
extension of this is the discrimination 
situation in which there are two types 
of trials, differing with respect to the 
stimulus initiating the trials and with 
respect to the event probabilities on 
the trials. The two stimuli, Tı and T», 
occur with probabilities 8 and 1 — 8; 
the events to be predicted, E; and E; 
occur with probabilities 7; and 1 — 7i 
on 7; trials and with probabilities me 
and 1 — zs on Ts trials. One purpose 
of the present experiment was to eval- 
uate the joint effects of 8 and 2 upon 
response probabilities with 7; held 
constant over groups. 

This study was also designed to fur- 
ther test Atkinson's (1958) observing 
response model. According to this 
model, S is either in an observing state 
or a nonobserving state on each trial. 
In the observing state, the event pre- 
diction depends solely upon whether 
the discriminative stimulus peculiar to 
that trial type is conditioned to an 
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Institutes of Health Grant MH-03803-06 and 
National Science Foundation Grant GS-386. 
The authors wish to thank John Schnorr for 
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A; or an A, response. In the non- 
observing state, the event prediction 
depends upon the conditioning state of 
both the discriminative stimulus and 
of background cues common to both 
trial types. When the probability of 
observing is high Atkinson's model ap- 
proaches the pattern (Estes, 1959) 
model, predicting little variation in the 
response probability on one trial type 
as a function of event probabilities on 
the other trial type. When the prob- 
ability of observing is very low, 
Atkinson's predictions approach those 
of the Burke-Estes (1957) model 
which holds that the response prob- 
ability on one trial type is a linear 
function of the event probability on 
the other trial type. Generally, the 
Atkinson model predicts that the 
response probability on one trial type 
will be a convex function of the event 
probability on the other trial type. 
This prediction is borne out by the 
finding that P (41|7T3 (the prob- 
ability of predicting E; on a T; trial) 
decreases as 72 decreases to .5 and 
then increases with further decreases 
in z, (Atkinson, Bogartz, & Turner, 
1959; Popper & Atkinson, 1958). 

In the two studies just cited, 8 was 
held constant at .5. By indepen- 
dently manipulating 8 and rə, the pre- 
sent study provides a further test of 
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the Atkinson observing response 
model. Following Atkinson's (1958, 
p.315) procedure and using the same re- 
strictions on parameter values and in- 
itial response probabilities, the authors 
generated numerical predictionsforthe 
71, T2, and 8 values used in this study. 
A wide range of estimates of w, a pa- 
rameter reflecting the discriminability 
of T, and T; was employed. The 
following predictions held at all values 
of w, although response probabilities 
vary more with variations in v» and 
B when w (i.e., discriminability) is low. 
At 8 = .75, the asymptotic value of 
P (A41| T3) should be almost invariant 
with changes in m; at 8 = .5, P 
(41|T;) should decrease as r, de- 
creases to .5, then increase with fur- 
ther decreases in m2; at B = .25, 
P(A;|T1) should monotonically de- 
cline with decreases in ms For any 
value of 7», the asymptotic values of 
P(A1|T3) and P(A1|T:3) should vary 
directly with 8. Furthermore, this 
effect of 8 should become more pro- 
nounced as the difference between 7i 
and m» increases. 

The present paper also includes an 
analysis of the dependency of re- 
sponses upon the response-event com- 
bination of the preceding trial. Such 
conditional statistics are sensitive in- 
dexes of the strategies used by Ss and 
permit a more valid assessment of 
applicable models than would P 
(A1| T3) and P(Ai|T2) alone. 


METHOD 


Subjects —The Ss were 180 University of 
Massachusetts undergraduate students, 90 
males and 90 females, who were fulfilling a 
requirement for the introductory psychology 
course. There were nine groups with 20 Ss 
randomly assigned to each. 

Apparalus.—Each S sat in one of four 
separated booths in a 9 ft. X 9% ft. room. 
Each booth contained a 12 in. X 13-in. ma- 
sonite panel which formed a 20° angle with 
the table. Two green jewel lights, 1 in. in 
diameter, were mounted 4 in. apart and 2 in. 


down from the top of the panel. For half of 
the Ss, the event lights were mounted 2j in. 
to the left of the center of the panel, while on 
the right there was a plywood box covering a 
similar set of event lights which had been 
used in a previous experiment. The other 
half of the Ss faced panels identical in every 
respect except that the event lights were on 
the right and the covered lights were on the 
left. Mounted directly below each event 
light was a reset toggle switch with which S 
predicted which event would occur Two 
white signal lights, T; and T», were mounted 
in back of the event display on a panel perpen- 
dicular to the table and central to the event 
display. Thesignals were mounted vertically 
on the panel. 

A Tally tape reader was used to present 
the predetermined sequences of reinforcing 
events, and an Easterline-Angus recorder was 
used to record Ss' responses. The E and all 
programming and recording equipment were 
located in an adjacent room separated from 
Ss by a one-way vision screen. 

Procedure.—Each S was randomly assigned 
to one of the nine combinations of the inde- 
pendent variables, ms and 8. The r values 
were .85, .50, and .15 and the 8 values were 
.75, .50, and .25. The value of mı was .85 for 
all nine groups? Two sequences of 500 trials 
each were randomly generated for each of the 
nine experimental conditions. Within each 
sequence, the position of E; (the event appear- 
ing on 85% of the Tı trials for all Ss) was 
counterbalanced. 

At the beginning of the experiment, instruc- 
tions were read to Ss explaining that they 
were to predict which of the two lights would 
occur and that they were to try to predict 
correctly as often as possible. The Ss were 
not told specifically that there was a different 
reinforcement probability associated with each 
signal. 

The signal light initiating the trial came on 
for 2 sec. during which time .S predicted 
which one of the two event lights would 
appear. The event light came on as the 
signal went off and remained on for 2 sec. 
There was a 2-sec. intertrial interval. 


RESULTS 


Response probabilities on Type 1 
(T1) and Type 2 (T3) trials —Type 1 
(13) trials are those on which the r 


*The reported values of mı and m» are 
approximate; for example, 85% of 250 trials 
is not an integer. The actual proportions of 
E;'s were within 1% of the reported values. 
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value is .85; for 85/85 (mı = .85, 
7» — .85) groups, the designation is 
arbitrary. The upper half of Fig. 1 
presents P (4| T3), the probability of 
predicting E; on T; trials; the lower 
half of the figure presents the proba- 
bility of predicting E; on Ts trials. The 
three panels within each half differ 
with respect to 8, the probability of 
T; trials. The curves within each 
panel differ with respect to m», the 
probability of Ei on T% trials. The 


p=.75 


PROBABILITIES 


RESPONSE 


Vee pea eo: | 


Fic. 1. 


E; event is always that event occur- 
ring most frequently on T; trials. 
Table 1 summarizes the analysis of 
variance for the Tı and Ts data. With 
the exception of the 8 effect on Te 
data, all sources of variance are sig- 
nificant. The significant 8 effect on Tiy 
data reflects the decrease in P(A1| 71) 
values with decreases in 8. This may 
be due to the fact that there are fewer 
T trials as 8 decreases; however, this 
is doubtful because the 85/15 curve 


p=.50 p=.25 
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BLOCKS OF TRIALS 
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Probabilities of A; responses on Type 1 and Type 2 trials plotted 


over blocks of trials as a function of 8 and 72. 
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TABLE 1 
ANALYSIS OF VARIANCE FOR P(Ai|7;) AND P(A1|T3) 
P(A1T1) P(A: T3) 
Source df 
MS F MS F 
Bet; 2 -878 17.92*** .103 .96 
"ad 2 1.500 30.61*** 37.065 34,64*** 
BXc 4 403 822** 324 3.03* 
Ss/B X v 171 .049 -107 
Trials (T) 4 -831 103.87*** .066 6.60*** 
8xT 8 028 3.50*** .046 4.60** 
TXT 8 .024 3.00** .299 2.99* 
BXTXT 16 .022 2.159% 020 2.00* 
Ss X T/B X v 684 -008 010 
* p «.025. 
** p.01 
++ p.001 


at 8 = .25 is as high as the curve at 
B =.75. The 8 effect seems pri- 
marily due to the lowering of the 85/ 
50 curve with increased 8. This is 
also consistent with the significant 
8 X v» interaction for 7 trials. The 
finding that the 85/50 curves lie be- 
low the 85/85 and 85/15 curves ac- 
counts for the 7» effect on T trials. 
These main and joint effects of 8 and 
7» generally increase over trials re- 
sulting in the significant interactions 
with trials. 

Since each curve within a panel in 
the bottom half of Fig. 1 represents a 
different percentage of E's, the sig- 
nificant v» effect is to be expected. 
The 8 X m2 interaction reflects the 
increased m effect at 8 = .25, pre- 
sumably due to more exposure to the 
7» levels; as 6 decreases there are more 
T, trials. These effects tend to in- 
crease over trials. 

One further aspect of Fig. 1 deserves 
mention. The terminal values of 
P(A1| T3) overshoot rı when 8 = .75 
or .25 and r: = .85 or .15; similarly, 
P(A1|T3) overshoots v» when m: = 
.85and 8 = .75 or .25 and undershoots 
mo in the 85/15, 8 = .25 condition. 

Conditional statistics —Considera- 
tion of the tendency to make a certain 
prediction on Trial n + 1 asa function 


of the prediction-event combination 
on Trial n (A,,,E,,,) yields informa- 
tion about the stochastic dynamics of 
choice behavior. Such conditional sta- 
tistics may be expressed in the form: 


P (Anal Tini ionAnmEpn) 


where: A = prediction, E = event, T = 
trial type, m,p = 1,2 (1 refers to the 
more frequent event on 7 trials), and 
jk = 1,2. The values of this statistic 
for the three groups in which 8 — .5 
and for all combinations of j, k, m, and 
P are presented for the terminal trial 
block in Table 2. The trends to be 
noted generally hold for the other 8 
levels; however, the data at those 
levels are frequently less reliable be- 
cause many of the conditional statis- 
tics are based on very low numbers of 
observations. Therefore only the data 
for the three groups in which 8 — .5 
have been presented. 

The first consideration is the 85/85 
group. In the first three trial blocks, 
the probability was generally higher 
that a correct response would be re- 
peated than that an erroneous re- 
sponse would be repeated. This is the 
result invariably predicted by stimu- 
lus sampling theory and generally ob- 
served in simple binary prediction ex- 
periments involving only a single trial 
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TABLE 2 


VALUES OF P(Ai nyi Tj, siTk nA m n Ep 
FOR TRIALS 401-500 For EACH 
LEVEL OF ra WITH 8 = .5 


T: 
Tinsi | Tem | Amn | Epis 
85 50 1S 

1 1 1 1 | .9169 | .9113 | .8375 
1 1 1 2 |.9512 | .7544 | .8400 
1 1 2 1 |.4426 | .8295 | .7164 
1 1 2 2 |.5556 | .6154 | .5000 
2 1 1 1 | 8930 | .8098 |.4444 
2 1 1 2 | .9535 | .7483 | .6486 
2 1 2 1 | .5970 | .8462 | .8873 
2 1 2 2 | 5833 | .6582 | .9290 
1 2 1 1 | .8738 | .6196 | .0749 
1 2 1 2 |.9333 | .5493 | .1685 
1 2 2 1 |.4853 |.5890 |.3939 
1 2 2 2 |.6667 |.5556 |.5455 
2 2 1 1 |.9096 | .7635 | .5000 
2 2 jJ 2 |.9744 | .5899 | .3857 
2 2 2 1 |.4912 | .5870 | .1250 
2 2 2 2 |.6364 |.3554 | .1211 


type. However, by the fifth block the 
situation is reversed. Repetitions of 
erroneous responses are more probable 
than are repetitions of correct re- 
sponses; this is more noticeable in the 
case of A» responses, but it is true of 
both and it holds regardless of which 
pair of trial types is under considera- 
tion. In the notation of Table 2, 


P (Anal TimtiTemEmA n) 
«P (AwnalTisaTiosEnAa) 
and 


P (Asnjal DCjniiT in Es nA 25) 
«P (Armi) TjniiTisE15A 272). 


The order of the conditional statis- 
tics for the 85/50 group is consistent 
with the early trial block findings for 
the 85/85 group; as most reinforce- 
ment theorists would predict, correct 
responses are more likely to be re- 
peated than are erroneous ones. In 
the 85/50 group, the differentiation 
among the effects of the preceding re- 
sponse-event combinations is more 


pronounced when the two trial types 
are the same than when they are dif- 
ferent, a result which presumably re- 
flects the difference between direct and 
generalized reinforcement effects. 

The 85/15 data provide us with 
still another type of relationship 
among conditional statistics. When 
the two trial types are the same, cor- 
rect responses are again more likely to 
be repeated (the one slight exception 
to this, 


P (Awnjil Ting 1 ynE nA tn) 
XP (Avia Vinyl inE 2nd yn), 


occurs only in the fifth block). How- 
ever, when the trial types differ, the 
order is exactly the reverse of that ob- 
tained when the trial types are the 
same. Thus the probability of an 41 
response on a 7; trial is highest fol- 
lowing a correct A» response and low- 
est following a correct A; response on 
a T trial. A similar statement holds 
for the probability of an A» response 
on a T, trial following a T, trial. 
This order is consistent over all five 
trial blocks. Furthermore, unlike the 
case in the 85/50 groups, this differ- 
ential effect of the preceding response 
event combination is at least as great 
as the reverse one obtained when the 
two trial types are the same. 


Discussion 


It is clear from the present data set 
that response probability on one trial 
type is a function of the event proba- 
bility on the other trial type and of the 
relative frequencies of the two trial types. 
At B of .5, the present authors have 
closely replicated the findings of previous 
investigators (Atkinson, Bogartz, & 
Turner, 1959; Popper & Atkinson, 1958) ; 
P(Ai|Ti) is lower at w2=.5 than at 
75—.85 or .15. This result is important 
because it flatly contradicts the predic- 
tion of a linear relationship between 
P(Ai|Ti) and v: generated by several 
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models (Burke & Estes, 1957; Lee, 
1966), and is the primary source of sup- 
port of the Atkinson (1958) observing 
response model. Furthermore, terminal 
trial block results are within a few per- 
centage points of the values predicted by 
Atkinson with w (his generalization 
parameter) at .1, the value he found most 
satisfactory. Presumably, the fit would 
have been even better if the authors had 
more precisely estimated w. 

Although the observing response model 
more closely approximates the data than 
any other model of which the present 
authors are aware, several problems do 
exist. The model cannot predict the early 
and consistent overshooting of .85 and 
undershooting of .15 that occurs in the 
85/15 and 85/85 groups with 8—.75 or 
.25, The observing response model also 
fails to handle the P(A;|T:) data for the 
8=.25 conditions. After 500 trials, the 
85/50 and 85/15 curves should be close 
together with the 85/50 curve slightly 
higher. Instead, the 85/15 curve lies quite 
close to the 85/85 curve and the 85/50 
curve is depressed considerably below the 
other two and below the values predicted 
by the model. It appears from the data 
that the probability of an observing re- 
sponse is very low for the 85/50 condition 
and very high for the 85/15 condition. 
Atkinson's model would predict this order 
(i.e., more observing in the 85/15 group) 
but not this magnitude of difference. The 
situation might be improved by modify- 
ing Atkinson's assumption that the con- 
ditioning rate is the same for observing 
responses and for predictive responses. 

The conditional statistics provide dif- 
ficulties not only for the observing re- 
sponse model, but for any other stimulus 
sampling model as well. One problem lies 
in the finding that Ss in the 85/85 group 
repeat erroneous responses more fre- 
quently than correct responses. Because 
this trend appears only late in learning, 
it is tempting to dismiss it as a result of 
chance variability. However, the same 
pattern has been clearly shown by 
Schnorr and Myers (1967) in a discrimi- 
nation situation using a very different 

methodology with mı=m:=.60. They 
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have also pointed out that this pattern 
does not generally occur in simple binary 
prediction studies; the presence of dis- 
criminative stimuli seems to be a factor. 
The explanation of this negative recency 
effect may be related to the fact that the 
authors have provided S with a structure 
thatis lacking in simple binary prediction 
studies; by presenting two types of cues, 
the authors have divided the event se- 
quence into blocks of variable lengths, 
these blocks being the runs of homogene- 
ous trial types. Possibly, S learns to ex- 
pect a certain proportion of E;'s and Ers 
within each of these ''blocks" and con- 
sequently predicts the opposite of the 
event that he has just seen. At best, this 
is a partial account of the data for it 
ignores the fact that this negative re- 
cency is more marked after incorrect 
responses than after correct responses, 
and it does not explain why the effect in 
question appears to be limited to condi- 
tions in which the z's are the same on 
both trial types. However, the hypo- 
thesis that the sequence of trial types 
and event serves as a stimulus for S’s re- 
sponses may prove fruitful. 

An even clearer, but quite different 
pattern, exists among the conditional 
statistics for the 85/15 groups. When two 
consecutive trial types are different, the 
probability of a response is highest when 
the alternative response has just been 
correct, lowest when the response itself 
has just been correct. This result is sur- 
prising only if one accepts the identifica- 
tion of responses and events employed in 
the various models put forth for the 
discrimination situation. These models 
view A, as the prediction of a particular 
position; thus E, might be the left hand 
light and A, a prediction of the left hand 
light. However, suppose S thinks not in 
terms of left and right, but rather in 
terms of moreand less frequent. This sug- 
gests that 4, as prediction of the more 
frequent event be redefined, e.g., in the 
85/15 group, prediction of the left hand 
light on T; trials and prediction of the 
right hand light on T» trials. In this case, 
if Tin and Ty, ,, differ, the statistic 
tabled as Ai,u|A4ssEs, is now 
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Ai, n41|A1,nE1,n and the statistic tabled 
as Aj, ntlAinZin is now Aj, ml 
A2,nE2,n. The order of the conditionals is 
now exactly as predicted by any stimulus 
sampling theory model. An important 
consequence of this redefinition of re- 
sponses and events is that the major 
evidence for the observing response 
model is negated. With some mechanism 
for Ss to learn to respond to event fre- 
quencies as opposed to event positions, 
even the Burke-Estes (1957) model 
could handle the excellent discrimina- 
tions in the 85/15 group. 

In summary, any theorist who at- 
tempts to fully account for two-choice 
discrimination learning will have to take 
note of pronounced overshooting, strong 
generalization from T^ trials when 72 is 
.5 and excellent discrimination when ms 
is .15, tendencies to repeat erroneous re- 
sponses when m, equals vs, and the 
apparent finding that event frequency is 
more important than event position in 
determining responding. Neither Atkin- 
son's (1958) observing response model 
nor any other model of which the present 
authors are aware will handle all (or even 
most) of the results outlined above. The 
present authors’ own bias is that the most 
useful type of model will be one which 
identifies sequences of trial types and 
events as the relevant stimuli to which 
responses are conditioned. A recent paper 
by Gambino and Myers (1967) indicates, 
in the context of the simple binary pre- 


diction experiment, the type of axioms 
the present authors have in mind. 
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GSR CONDITIONING WITH LONG 
INTERSTIMULUS INTERVALS' 


MITCHEL C. MORROW AND THOMAS E. KEOUGH, III 
Georgia State College 


4 groups of Ss (each n 15) received 20 classical conditioning trials with 
an interstimulus interval of either 4, 8, 12, or 16 sec., and 2 groups of 
Ss (each n = 15) received 20 pseudoconditioning trials with CS dura- 
tion of either 4.1 or 16.1 sec, Magnitude and percentage GSR during 4 re- 
sponse intervals and amplitude during 1 response interval following CS 
onset were analyzed. Significant conditioning occurred in each response 
interval though the measure indicating it varied with the interval. 
Significant interstimulus interval effects were present in each response 
interval, and each measure showed the effect at least once across 
intervals. The course of conditioning from the 1st interval to the 4th 
was one of considerable response attenuation in magnitude and per- 
centage. Each measure declined significantly during conditioning in the 
1st interval, and though a decline in response strength within an interval 
was generally absent in the later intervals, there was little resembling 
an acquisition function with any of the measures. 


Although it has been frequently re- amount of GSR conditioning in each | 


ported that the optimum interstimulus 
interval (ISI) for simple GSR con- 
ditioning is in the neighborhood of 
.5 sec, (Kimble, 1961; Moeller, 1954; 
Prokasy, Fawcett, & Hall, 1962; 
White & Schlosberg, 1952), there is 
little question that significant amounts 
of conditioning occur at much longer 
ISIs, eg» 16 sec. (Rodnick, 1937). 
In addition to the amount, it also 
appears that the course of condition- 
ing is strongly influenced by the ISI. 
Both Grings (1960) and Kimmel 
(1966) have noted that following an 
increase in GSR magnitude early in 
conditioning a decline frequently oc- 
curs, and the latter hypothesized that 
the point at which the decline begins 
may be inversely related to the ISI. 

The purposes of the present ex- 
periment were to (a) investigate the 
effects of relatively long ISls on the 


1 This investigation was supported in part 
by Public Health Service Grant MH 12065-01 
made to the senior author. 

2 The data for this experiment were col- 
lected while the senior author was an assistant 
professor at the University of South Florida. 


of four separate time intervals follow- 
ing CS onset, and (b) determine 
whether the time of occurrence of the 
maximum amount of conditioning in 
the first interval—the response to 
which Kimmel's hypothesis applies— 
is inversely related to the ISI. 


METHOD 


Subjects—The Ss were 90 University of 
South Florida volunteer undergraduates, 54 
men and 36 women. They were randomly 
assigned to the experimental conditions with 
the exception of sex balancing, and were 
informed prior to participating that they 
would receive electric shock. Routine in- 
structions requesting 5 to remain motionless 
and pay attention to the stimuli were read 
to him. 

Apparatus.— The S was seated in an electri- 
cally shielded, sound-deadened cubicle (4 
X3 X 7 ft.) which was dimly illuminated by 
two shielded 6-w. bulbs. Willson sound 
barrier ear muffs further reduced extraneous 
auditory stimulation. The temperature inside 
the cubicle was approximately 75° F. 

The CS was a circular, white light with a 
diameter of 1 in. It was placed at eye level 
approximately 30 in. directly in front of S's 
face. The light was produced by a Grason~ 
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Stadler Co, 
(Type E4580). 

The UCS was an electric shock delivered 
through electrodes attached to the middle and 
index fingertips of S's right hand. The shock 
was produced by a Schultz Instruments Co. 
constant current shock generator, and its 
intensity during conditioning was 3 ma. 

The durations of the CS and UCS were 
controlled electronically by a Tektronix, Inc. 
waveform generator, Type 162, and pulse 
generator, Type 161. The duration of the CS 
was, depending on the group, either 4.1, 8.1, 
12.1, or 16.1 sec., and the duration of the UCS 
was .1 sec. The two stimuli terminated 
together in the paired conditioning groups. 

The GSR was amplified by a YSI Model 
22A Fels Dermohmeter after being picked up. 
from the palm and the back of S's left hand 
by the standard Fels electrodes covered with 
Fels electrode paste. It was recorded on an 
Esterline-Angus milliammeter operating at a. 
paper speed of 6 in/min. All responses were 
transformed to change in log conductance. 

Design and procedure.—The main part of 
the experiment consisted of a 4 X 20 factorial 
design in which four groups of 15 Ss each 
received 20 conditioning trials with an ISI of 
either 4, 8, 12, or 16 sec. Two additional 
groups of 15 Ss each received 20 unpaired 
conditioning trials with a CS duration of 
either 4.1 sec, or 16.1 sec. 

The S's left hand and middle and index 
fingertips of his right hand were cleaned with 
alcohol, the GSR electrodes were attached, 
and then he was seated in the experimental 
cubicle. The shock electrodes were attached, 
the instructions were read, and the ear muffs 
were placed comfortably on him, The pro- 
jector which presented the CS was pointed 
out, and then £ left the cubicle and closed 
the door, 

The S was first given two presentations of 
the CS alone. These were followed by three 
presentations of the UCS alone in intensity 
increments up to the intensity used during 
conditioning. Then there were four ad- 
ditional presentations of the CS alone. 
During the conditioning period, which oc- 
curred next, four groups of Ss received 20 
paired CS-UCS presentations with an ISI of 
either 4, 8, 12, or 16 sec. (Groups E4, E8, E12, 
and E16, respectively), and two groups of Ss 
received 20 unpaired CS-UCS presentations 
with either a 4.1- or a 16.1-sec. CS (Groups 
C4 and C16, respectively). 

The interval separating trials was varied 
unsystematically among either 40, 45, or 50 
sec. with an average of 45 sec. Both stimuli 
were presented within a 40-sec. interval for 


multiple stimulus projector 


Groups C4 and C16. These 40-sec, intervals 
were separated from each other in the same 
way that paired conditioning trials were 
separated. Each stimulus occurred first on 
half of the trials. The termination of the CS 
was never closer than 20 sec. to the onset of 
the UCS when the CS was first, and when the 
CS was second its onset was never closer than 
15 sec. to the offset of the UCS. 

Dependent variables,—Four separate GSRs 
were measured, depending on the time inter- 
val in which they occurred. The first interval 
response was measured as the maximum 
decrease in resistance which occurred between 
1.5 and 5.5 sec. after CS onset, the second 
interval response between 5.5 and 9.5 sec., 
the third interval response between 9.5 and 
13.5 sec, and the fourth interval response 
between 13.5 and 17.5 sec. after CS onset. 
Since the UCS was never omitted in the 
presence of the CS, only the first interval 
response was measured in Group E4, i.e., the 
UCR occurred at about 5,5-6 sec. after CS 
onset. Similarly, the first and second interval 
responses were observed in Group E8, and 
the first, second, and third in Group E12. 
Four responses were measured in Group E16, 
For comparative purposes the first interval 
GSR was observed in Group C4, and all four 
responses in Group C16, The second, third, 
and fourth interval GSRs, defined as decreases 
in resistance within a given time interval, were 
not necessarily the second, third or fourth 
deflections chronologically. 

Analyses were performed on magnitude, 
amplitude, and percentage GSR in the first 
response interval, but only magnitude and 
percentage measures were evaluated in the 
remaining intervals. Mean magnitude refers 
to the total GSR of all Ss divided by the 
number of Ss, ie, zero responders are in- 
cluded in the computation. Mean amplitude 
refers to the total GSR divided only by those 
Ss who responded. Percentage was com- 
puted by dividing the number of 5s into the 
number of responses, and it was transformed 
into the arc sin A percentage. 

"The magnitude and percentage measures 
were each analyzed over the 20 conditioning 
trials, but the amplitude measure was ana- 
lyzed across blocks of four trials so that n 
would not be too drastically reduced. Even 
with this procedure S loss was great enough 
with the amplitude measure in the last three 
intervals to render statistical conclusions 
somewhat hazardous. Not all Ss contributed 
to the amplitude measure in the first interval 
as contributed to the magnitude measure, 
Since, as Prokasy and Ebel (1966) point out, 
inferences about the relationships between 
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these measures require data from identical Ss 
over identical trials, one should proceed with 
caution in comparing the amplitude measures 
with the others in this study. 


RESULTS 


First Interval Responses 


Magnitude.—Table 1 shows the 
mean GSR magnitude of both the E 
and C groups during conditioning in 
blocks of two trials for the first 
response interval. In the first inter- 
val, Group E4 produced the greatest 
mean GSR magnitude overall and 
Group C16 the least, while their 
counterparts, Groups C4 and E16, 
responded at an intermediate value 
and were more or less equal to each 
other after the fourth pair of con- 
ditioning trials. A 2 X2 factorial 
analysis of Groups E4, E16, C4, and 
C16 showed that greater magnitude 
was associated with the shorter ISI, 
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F (1, 56) = 8.08, and with pairing, 
F (1, 56) = 9.11, p < .01, for both 
tests. Duncan’s multiple range test 
indicated that each group, except 
Groups E16 and C4, differed from 
each other group, p < .01. 

There was an early increase in mean 
GSR magnitude in the E groups and 
then a steady decline as conditioning 
proceeded. The magnitude of the 
GSR in Groups E8, E12, and E16 
appears to be more or less equal, but 
less than the E4 group throughout 
conditioning. A 4X20 (ISI X Trials) 
factorial analysis of the E groups 
indicated that only the trials effect 
was significant, F (19, 1064) = 7.20, 
p «.001. However, the ISI vari- 
able approached an acceptable sig- 
nificance level, F (3, 56) = 2.23, 
p < 10. Trend analyses of the same 
data showed that there was a signi- 
ficant linear component in the condi- 


TABLE 1 
Mean GSR MAGNITUDE (LoGAC) AND PERCENTAGE (ARC SIN VPERCENTAGE) ACROSS 


10 BLocks or Two CONDITIONING TRIALS, AND 


Megan GSR AMPLITUDE (LocAC) 


Across Five BLocks or Four CONDITIONING TRIALS IN 
THE First RESPONSE INTERVAL 


Trial Blocks 

Group 
(ISI) 

1 2 3 4 5 6 E: 8 9 10 
Magnitude 
E4 0209 | :0406 | .0360 |.0372 | .0297 | .0258 | .0336 | .0235 | .0197 | .0189 
E8 0326 | 10304 | 10233 | .0179 | 0232 | 0127 | 0209 | .0126 | .0115 | 0134 
E12 10192 | 0287 | 0176 | .0179 | 0103 | .0115 | .0122 | .0097 | ‘0063 |.0047 
E16 ‘0116 | 10266 | 10339 | 0248 | 0148 | 10148 | 0121 | 0082 | .0109 | ‘0072 
C4 0207 | 10281 | 10190 | .0188 | .0157 | .0104 | {0116 | .0147 | 0082 | O111 
C16 ‘0078 | 10096 | 0123 | 0052 | .0048 | .0073 | :0056 | .0071 | .0042 | {0034 
Amplitude 
p 0364 | .0304 | .0323 | .0340 | .0292 (n=14 
E8 0508 | 0306 | 10322 | ‘0257 | .0204 n=10 
E12 10242 | ‘0214 | .0166 | ‘0150 | {0097 "12 
E16 20241 | .0286 | 0181 | .0190 | .0112 i13 
ren 0460 | ‘0360 | 0238 | .0270 | .0199 (n=9) 
C16 0138 | .0162 | 10292 | .0094 | :0147 (n=8) 
Percentage | 60.22 | 69.22 | 71.80 | 75.00 | 63.7 

k ; : .00 | 63.75 | 63.75 | 63.44 | 63.75 | 61.16 | 63.44 
E8 $0.69 | 69:22 | 61.16 | 52.76 | 40.94 | 46:80 | 4700 | 4091 | 4694 | 4117 
E12 76.72 | 71.80 | 61.16 | 61.16 | 5710 | 56.82 | 61.16 | 52.76 | 52.89 | 45:00 
E16 61.68 | 63:44 | 71.80 | 71.80 | 59.10 | 69:22 | 48.83 | 54.83 | 56.82 | 54.83 
el 45.00 | 54.76 | 53.28 | 50.86 | 54.86 | 43.09 | 54.76 | 45.00 | 35.17 | 39.23 
Ci6 54.86 | 51.03 | 50.77 | 43.09 | 41.17 | 43.09 | 4503 | 41.10 | 35.17 | 28.32 
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tioning curves, F (1, 1064) = 69.40, 
p < .001, and that the quadratic com- 
ponent barely missed the conventional 
significance level, F (1, 1064) — 3.80, 
where the required F for the .05 
significance level was 3.84. Further, 
the trend analyses revealed that there 
were significant ISI X Trials linear, 
F (3, 1064) = 6.00, p < .001, and 
quadratic, F (3, 1064) =3.40, p<.05, 
interactions. The interactions indi- 
cate that taken alone neither the 
linear nor the quadratic components 
were identical for all ISI groups. 
Each group contributed more or less 
equally to the interactions. 
Amplitude.—Table 1 also shows the 
mean GSR amplitude and number of 
Ss responding during the first re- 
sponse interval for the E and C 
groups over blocks of four trials. The 
mean amplitude measures were based 
upon those Ss who responded at least 
once during each block, and each 
amplitude value shown represents the 
total GSR amplitude during the block 
divided by the number of responses. 
Comparison of E4, E16, C4, and C16 
produced a significant ISI effect, F 
(1, 40) = 6.86, p «.05, but sur- 
prisingly no pairing effect. A 4 X 5 
(ISI X Blocks) factorial analysis of 
the E groups showed that both the 
ISI, F (3, 45) = 3.07, p « .05, and 
Blocks, F (4, 180) = 6.00, p < .005, 
variables were significantly related to 
mean GSR amplitude. 
Percentage.—ln the first response 
interval Group E4 produced the 
highest percentage GSR and, among 
the E groups, Group E8 the least 
(Table 1). Similar to the magnitude 
and amplitude measures, there was 
a continuous decline in percentage 
GSR over the 10 blocks of condition- 
ing trials particularly in the longer 
ISI groups. Analysis of Groups E4, 
E16, C4, and C16 revealed that the 
pairing operation was significantly 


related to percentage GSR, F (1, 56) 
= 24.53, p < .001, but the ISI and 
interaction variables were not. How- 
ever, analysis of the E groups pro- 
duced significant ISI, F (3, 57) 
17.74, p < .001, and Trials, F (19, 
57) = 4.76, p < .001, effects. 

First interval response reduction.— 
To show that there was a significant 
overall decline in each measure from 
early in conditioning to late, differ- 
ence scores were computed between 
the means of Trials 3, 4, 5, and 
Trials 18, 19, 20 for the magnitude 
measure, Block 1 and Block 4 for 
amplitude, and Trials 2-6 and Trials 
16-20 for percentage for each S. The 
mean of the difference score for each 
measure was significantly different 
from zero: magnitude (X = .0183), 
t(3) = 3.45, p < .05, amplitude (X 
= .0163), (3) = 3.14, p < .05, per- 
centage (X = 15.88), 1(3) = 5.12, 
P <.01. Each significant effect in- 
dicated reduced responding late in 
conditioning. 


Second. Interval Responses 


Magnitude.—Inspection of the mean 
magnitude values during the second 
interval, shown in Table 2, reveals 
that there was little difference among 
response magnitudes for Groups E12, 
E16, and C16, but that Group E8 
produced larger average responses 
than any of the former. A compari- 
son of Groups E8 and C16 showed 
that overall there was significantly 
greater mean GSR magnitude in 
Group E8, /(28) = 2.84, p < .005, 
but a similar comparison of Groups 
E16 and C16 was not significant. A 
3 X 20 factorial analysis of the E 
groups produced significant ISI, F (2, 
42) = 5.00, p < .05, and Trials, F 
(19, 798) = 1.96, p < .01, effects. 

Percentage.—The percentage GSR 
during the second interval (Table 2) 
was lower, in general, than those in 
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TABLE 2 


MAGNITUDE (LoGAC) AND PERCENTAGE 
(Arc Sin VPERCENTAGE) GSR MEANS 
AND SDs or SECOND, THIRD, AND 
FOURTH INTERVAL RESPONSES 


Measure 
Interval 
and Magnitude Percentage 
Group 
M SD M SD 
Second 
E8 1517 | .1463 | 39.91 | 23.09 
E12 | .06 .0566 | 34.04 | 15.99 
E16 | .0564 | .0480 | 34.85 | 16.54 
C16 | .0382 | .0447 | 18.96 8.97 
Third 
E12 | .0559 | .0640 | 34.79 | 18.15 
E16 | .0328 | .0458 | 28.88 | 14.35 
C16 | .0220 | .0282 | 16.09 | 11.56 
Fourth 
E16 | .0132" | .0089* | 33.48 | 19.17 
C16 | .0034* | .0075* | 21.49 5.43 


^ Last four trials. 


the first interval. Not only was this 
true in the E groups in contrast to 
their performance in the first interval, 
but the effect was even more striking 
in Group C16. Since a greater per- 
centage GSR overall occurred in 
Group E16 than in Group C16, /(28) 
— 3.28, p « .001, no evaluation be- 
tween Groups E8 and C16 was con- 
sidered necessary due to the even 
greater disparity in performance be- 
tween E8 and C16 than between E16 
and C16. Analysis of the E groups 
produced both ISI, F (2, 38) = 3.96, 
p < .05, and Trials, F (19, 38) — 3.00, 
p < .01, effects. 


Third Interval Responses 


Magnitude.—Although the mean 
GSR magnitude of Group E12 overall 
was larger than it was in either Group 
E16 or C16 during the third interval, 
it was significantly different from 
Group C16 only, #(28) =1.96, p<.05. 
It is of interest to note, however, that 
the trend established in the first two 
response intervals was continued in 


this interval, i.e., the group for whom 
this interval was closest to reinforce- 
ment produced the largest mean GSR 
magnitude. 

Percentage.—The tendency for a 
lower percentage GSR in the later 
intervals was likewise continued dur- 
ing the third interval as shown by 
Table 2. Group C16, similar to its 
performance in the second interval, 
responded at a lower level than Groups 
E12 and E16 as indicated by a com- 
parison of Group E16’s and Group 
C16's overall performance during this 
interval, t(28) = 2.69, p < .02. An 
ISI X Trials analysis of the E groups 
showed that the ISI, F (1, 19) = 9.14, 
p «.01, variable was significantly 
related to percentage GSR during this 
interval. 


Fourth Interval Responses 


Magnitude.—Mean GSR magnitude 
was similar in the fourth interval in 
Groups E16 and C16 over the first 
16 trials, but became differentiated 
after that point. The late divergence 
of the scores resulted from an increase 
in mean magnitude by Group E16 
and a decrease by Group C16 on the 
last four trials (Table 2). A t test 
of the difference on the final two 
blocks of trials showed that there was 
significantly greater responding by the 
E16 group than by the C16 group, 
t(28) = 7.00, p < .001. 

Percentage.—In the fourth response 
interval Group E16 appeared to 
approach an asymptotic level of re- 
sponding on the second block of trials, 
while Group C16, after reaching its 
maximum percentage GSR on the 
sixth block, steadily declined there- 
after. Group E16 produced a signi- 
ficantly greater percentage GSR over- 
all than Group C16, ¢(28) = 5.74, 
p < .001. 

Effects of pre-UCS CS duration.— 
Mean magnitude, amplitude, and per- 
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Mean magnitude, amplitude, and percentage GSR in the interval overlapping 


UCS application as a function of the number of seconds the CS preceded the UCS. 


centage GSR in the interval during 
which the UCS was applied as a func- 
tion of CS duration preceding UCS 
are shown in Fig. 1. That is, Group 
E4's first, Group E8's second, Group 
E12's third, and Group E16's fourth 
interval performances appear. These 
performance measures are alike in 
that they came from the intervals, 
respectively, for each group in which 
the UCS was presented. They differ 
then in CS duration prior to UCS 
presentation. The figure shows that 
each measure decreased in a negatively 
accelerated manner and each appeared 
to be approaching asymptotic levels 
at about 12 sec. The initial drop 
appears to be greatest with the per- 
centage measure than with the others. 
Significant effects were found with 
each measure: magnitude, F (3, 56) 
= 16.43, p < .001, amplitude, F (3, 
53) = 26.00, p < .001, and percent- 
age, F (3, 56) = 11.57, p < .001. 


Amplitude was computed from Ns of 
15, 14, 14, and 13 for Groups E4, E8, 
E12, and E16, respectively. Separate ¢ 
tests between Groups E8 and E12 for 
each measure produced significant 
magnitude, /(28) = 2.29, p < .02, and 
amplitude, /(26) = 2.94, p < .02, 
effects. 


Discussion 


The data of this experiment show 
without question that GSR conditioning 
occurs with ISIs considerably longer 
than some previous ISI studies would 
suggest. Had Group C4 been employed 
as the only sensitization control for each 
paired group, it would have been im- 
possible to conclude that conditioning 
occurred at any ISI other than 4 sec. 
when the magnitude measure in the first 
interval was employed as the dependent 
measure. But that measure in that inter- 
val was precisely the one employed in 
most GSR conditioning studies until very 
recently. Furthermore, there was a con- 
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siderable decline in first interval magni- 
tude late in conditioning, particularly in 
the longer ISI groups, to a value below 
that on the initial conditioning trials. If 
conditioning under these circumstances 
were measured in extinction—as some 
previous ISI studies have done (e.g., 
Prokasy, Fawcett, & Hall, 1962; White 
& Schlosberg, 1952)—it would be in- 
evitable that a negative conclusion re- 
garding the presence of conditioning 
would follow. 

Conditioning was present in each of 
the four response intervals though the 
measure indicating it varied with the 
interval. Pairing led to significantly 
greater magnitude than the control pro- 
cedure in the first and fourth intervals, 
and to greater percentage in all four in- 
tervals. Statistical evaluations of ampli- 
tude were precluded in the second, third, 
and fourth intervals because of N re- 
duction particularly in the control 
group. The failure of the pairing variable 
to lead to greater first interval amplitude 
contrasts with the Prokasy and Ebel 
(1967) experiment in which the intensity 
of a tone-CS was also manipulated. 

Significant ISI effects were secured 
likewise in each response interval, and 
each measure showed the effect atleast 
once across intervals. The ISI variable 
was significantly related to magnitude in 
the second and fourth intervals, and 
magnitude approached an acceptable 
significance level in the first interval. 
There was a significant ISI effect on 
amplitude in the first interval, and on 
percentage in each interval. The effect 
showed itself without exception with 
regard to measure or interval in terms of 
greater responding in the ISI group 
closest to reinforcement. For example, 
in the second interval the ISI group 
closest to reinforcement, Group ES, 
showed greater magnitude and percent- 
age than any of the other groups. 

The course of conditioning from the 
first interval to the fourth was one of 
considerable response attenuation in 
magnitude and percentage. There was 
the suggestion in the percentage measure 
that an ISI group one interval removed 
from reinforcement, e.g., Group E8 in 
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the first interval, Group E12 in the 
second, “‘saved”’ its response for the next, 
reinforced, interval. Moreover, the course 
of conditioning within the first interval 
appeared to differ from each of the final 
three intervals in that there was a sig- 
nificant decline in response strength over 
conditioning with each measure. There 
was seldom a sign of a progressive change 
in conditioning in the later intervals, and 
nothing resembling a negatively accel- 
erated so-called ‘‘acquistion function” 
occurred. 

The strongest support for the hypothe- 
sis that the time of occurrence of the 
maximum amount of conditioning in the 
first interval is inversely related to the 
ISI would have come from a significant 
ISI X Trials interaction in magnitude. 
This failing, the hypothesis, neverthe- 
less, received partial support from the 
significant ISI X Trials linear and quad- 
ratic interactions, The linear and quad- 
ratic components differed across trials 
with each ISI group as the hypothesis in 
general predicts. 

Relative to the first interval decline in 
response strength, a number of explana- 
tory suggestions have appeared which 
could perhaps individually or in combi- 
nation account for the data. 

First, it could be due to Pavlovian 
(1927) inhibition of delay, a type of in- 
ternal inhibition which presumably 
underlies delayed and attenuated CRs. 
This explanation is favored by Kimmel 
(1966). 

Second, the decline could be explained 
by reference to perceptual set factors 
which are assumed by Grings (1960) to 
develop over conditioning. The “set” is 
a preparation for the noxious UCS and 
is presumed to result in smaller CRs and 
indeed UCRs. 

Third, it would be possible to account 
for the decline by assuming that on the 
basis of the development of smaller 
UCRs as conditioning proceeds the 
shock was perceptually less intense late 
in conditioning. And since the shock was 
less intense, conditioning declined. This 
hypothesis rests, however, on the ap- 
parently faulty assumption that GSR 
size is positively related to perceived 
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UCS intensity, which has been found not 
to be the case (Kimmel & Schultz, 1964). 

Fourth, the decline would be ac- 
counted for by some (eg. Stewart, 
Stern, Winokur, & Fredman, 1961) as due 
to the adaptation of the resensitized 
initial response to the CS. 

Fifth, and finally, it is worthy of note 
that Hull (1930) hypothesized that some 
responses—those he called “pure stimu- 
lus acts"—cwould, because of their func- 
tion, become greatly attenuated during 
conditioning. Osgood (1953) has utilized 
this notion in elaborating mediational 
theory, and suggested that autonomic 
responses may frequently serve as pure 
stimulus acts. The latter accounts for 
the reduction in their size as due to the 
great discriminative capacity of humans. 
For this, and perhaps other reasons, large 
autonomic responses are unnecessary. 
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DIVERGENCES AMONG RABBIT RESPONSE SYSTEMS 


DURING THREE-TONE CLASSICAL DIS- 
CRIMINATION CONDITIONING! 


ARTHUR L. YEHLE 
University of Miami 


Differences in acquisition and discrimination among nictitating mem- 
brane (NM), heart rate (HR), and respiration rate ( RR) response sys- 
tems were assessed in a classical conditioning experiment. 4 groups of 
8 rabbits were given acquisition and extinction training in a design that 
reinforced either 1 or 2 of 3 tones in either a unidirectional or a bidirec- 
tional manner. Fastest rates of CR acquisition and discrimination for- 
mation as well as greatest discrimination capability were revealed in the 
HR system. The rapid acquisition of HR discriminations relative to 
slower acquisition and discrimination formations of NM responses in- 
dicated that NM acquisition curves reflect something in addition to S 
learning that a US follows a particular CS. 


The purpose of the present investi- 
gation was to assess differences in ac- 
quisition and discrimination among 
nictitating membrane (NM), heart 
rate (HR), and respiration rate (RR) 
response systems in a classical dis- 
crimination conditioning experiment. 

In conditioning HR, RR, blood 
pressure, and cerebral responses in the 
rabbit, Yehle, Dauth, and Schneider- 
man (1967) observed that reliable con- 
ditioning occurred in each of these 
systems within 10 trials. In contrast, 
Schneiderman, Fuentes, and Gorme- 
zano (1962) have observed that many 
trials are required to obtain reliable 
conditioning of the rabbit's eyelid or 
NM. Similarly, Gantt (1960), Ja- 
worska, Kawolska, and Soltysick 
(1962), and de Toledo and Black 
(1966) have documented differences 
in response acquisition among re- 


1 This report is based upon a doctoral dis. 
seration submitted to the University of Miam! 
in August 1966. The work was conducted 
while the author was a National Science 
Foundation Cooperative Graduate fellow, and 
was supported by research Grant GB 5307 to 
Neil Schneiderman, in whose laboratory the 
author worked as a National Institute of 
Mental Health postdoctoral fellow. The 
author is now at Memphis State University. 


sponse systems during conditioning. 
Few studies, however, have examined 
these differences in detail and none 
have assessed the differences in dis- 
crimination among response systems 
conditioned simultaneously. 

The use of a discrimination para- 
digm permitted qualitative compari- 
sons of response acquisition among re- 
sponse systems to be made without re- 
sort to additional control groups; and 
the use of three tones of different fre- 
quency reinforced in a variety of com- 
binations (see Table 1) permitted the 
assessment of discrimination capabil- 
ity within each particular response 
system. 


METHOD 


Subjects. —The Ss, 32 experimentally naive 
male albino rabbits, 10 to 14 wk. old, were 
housed one to a living cage with free access to 
food and water. 

Apparatus.—A detailed description of the 
apparatus and preparation of Ss for the 
recording of the NM response may be found 
in Gormezano (1966) and Schneiderman and 
Gormezano (1964). Briefly, Ss were restrained 
in a box by an adjustable back plate, an 
adjustable headstock which comprised the 
front of the box, and a foam-rubber coated ear 
clamp. The outer eyelids of .S's right eye 
were held open by small clips, and a loop of 
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TABLE 1 


TONE FREQUENCIES AND REINFORCEMENT 
CONTINGENCIES IN EACH GROUP 
DURING ACQUISITION 


Low Medium High 
Frequency | Frequency | Frequency 

Group Ai + - — 
Group As ~ = ELE 
Group Bi ¥ + = 
Group Bs — + + 
Group C - + - 
Group D + - T 


Note.— + reinforcement, — =nonreinforcement. 


4-0 Ethilon monofilament nylon suture thread 
was passed through the epithelium layer of 
the nictitating membrane and attached to a 
minotorque potentiometer, Connections from 


the potentiometer to a Grass model III-D' 


EEG machine operated at a paper speed of 
50 mm/sec permitted amplification and re- 
cording of membrane extensions. Two stain- 


2 


LOW OR HIGH 


+10: 
+8: 


less steel safety pins, inserted in the skin of 
S's right shoulder and left haunch, and also 
connected to the Grass machine, provided 
a recording of heart beats. A small rubber 
mask containing a thermocouple was fastened 
over S's snout to permit RR recordings. 

Three tones differing in frequency were used 
as CSs, They were equated for an intensity of 
78-db. SPL, and delivered from a 6-in. speaker 
mounted 5 in. in front of S. The duration 
of each CS was 700 msec. and the frequencies 
were 350, 1,650, and 2,950 cps. Two Ss were 
run simultaneously in ventilated, nonacti- 
vated refrigerator shells. The US was a 5-ma. 
electric shock of 300-msec. duration, delivered 
via wound clips installed just above and below 
the lateral canthus of S's right eye. The 
offset of the CS was coincident with the 
onset of the US. 

Procedure.—Thirty-two Ss were assigned 
to four groups of eight each. All groups 
received one day of adaptation, eight days of 
acquisition, and three days of extinction. Each 
day all Ss received 72 trials consisting of 24 
presentations of the three different tone CSs 
randomly mixed with no more than two of 


LOW OR HIGH 
MEDIUM 
HIGH OR LOW 


HEART RATE 
PER CENT CHANGE FROM BASELINE 


23 4 5 6 
ACQUISITION 


| 2 5 
EXTINCTION 


2 


4 5 


3 6 1222075 
ACQUISITION EXTINCTION 


DAYS OF TRAINING 


Fic. 1. Mean percentage of change of heart rate from base line to each tone averaged 
over test trials for each day of acquisition and extinction. (The four group treatments are de- 


scribed in Table 1.) 
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one frequency tone CS occurring consecu- 
tively. In adaptation and extinction none of 
the tone CSs were paired with the shock US. 
In acquisition the shock US was paired with 
either one or two of the tones as shown in 
Table 1. Four test trials for each CS were 
interspersed and never reinforced with the 
shock US. 

Measurement. of  responses.—Nictitating 
membrane extensions producing pen deflec- 
tions of at least 2 mm. from the base line 
in the interval from CS onset to US onset were 
scored as CRs on every trial. Since the gain 
factor was two, a 2-mm. pen deflection rep- 
resented a 1-mm. extension of the NM. 

Changes in HR were assessed only on test 
trials by measuring the interresponse time 
(IRT) of 10 successive heart beats immedi- 
ately prior to the CS onset and comparing 
this with the measurement of 10 heartbeats 
following CS onset. 

Respiration rate was assessed in a manner 
similar to HR, but in this case by a measure- 
ment of four inspiration or expiration cycles 
prior to the CS onset. Respiration rate was 
then compared with a like measure of four 


NICTITATING MEMBRANE 
PER CENT OF CONDITIONED RESPONSES 


2 Se Sas 
ACQUISITION 


EXTINCTION 
DAYS OF TRAINING 


cycles after the CS onset. The HR and RR 
measurement procedures were based on the 
methods used by Yehle, Dauth, and Schnei- 
derman (1967). 

The HR and RR statistical analyses were 
conducted upon percentage of changes from 
base line (the IRT interval immediately prior 
to CS onset) cumulated over test trials, 
whereas NM statistical analyses were con- 
ducted upon the total number of CRs cumu- 
lated over both training and test trials. 


RESULTS 


Figure 1 shows the group mean HR 
responses based on the four test trials 
for each tone during each day of ac- 
quisition and extinction. It can be 
seen that no overlap between response 
measures to reinforced and nonrein- 
forced tones (Table 1) exists in any of 
the four groups from Day 1 onwards. 

An SsXTones within-groups anal- 
ysis of variance followed by Sheffé 


oR 
*--— MEDIUM 
=-= HIGH OR y 
Low 


3 4 5 6 |! 2 5 
ACQUISITION EXTINCTION 


Fic. 2. Percentage of nictitating membrane responses to each tone for 
all trials during each day of acquisition and extinction. 
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RESPIRATION -RATE 
PER CENT CHANGE FROM BASELINE 


2 3 4 6 6 7 
ACQUISITION 


6 12 -s 
EXTINCTION 
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| 2 3 
EXTINCTION 


2.974.8/€ tT 
ACQUISITION 
DAYS OF TRAINING 


Fic, 3. Mean percentage of change of respiration rate from base line to each tone averaged 
over test trials for each day of acquisition and extinction, 


pairwise comparisons indicated that 
all groups made significant discrimi- 
nations: F (2, 14)=38.4(+ — —), 
17.1(—+ —), 33.8(+ +—), 65.2 
(+ — +), p<.01. In addition an Ss X 
Reinforcement contingency analysis of 
variance was conducted upon the data 
of Test Trial 1 of Day 1, F (1, 63)= 
18.9, p<.01; and a similar analysis of 
cumulative responses in acquisition 
and extinction confirmed significant 
discriminations from the first test 
trial through the last day of extinction 
F (1, 63) =95.4, p<.01. 

In Fig. 2, the percentage of NM re- 
sponses in each group are represented. 
The acquisition and extinction curves 
suggest that differential responding 
occurred to the CSs in all but the 
+ — + group. These findings were 
confirmed by analyses of variance in 


conjunction with the Sheffé tests : 
F(2,14) 217.2(4- —),15.3(— +7), 
53.0(++—), p<.01; 
F (2, 14)=2.57(+ — +), p>.10. 


Percentage of changes of RR from 
base line are depicted in Fig. 3. Some 
differential responding can be noted in 
Fig. 3a and 3c, but two groups (3b and 
3d) indicate almost no evidence of dis- 
crimination. Analyses of variance and 
Sheffé tests used to assess HR and NM 
responding when applied to the RR 
data indicated that only the (+ — —) 
group (Fig. 3a) and the (— + —) 
group (Fig. 3c) made significant 
discriminations: F (2, 14)=7.37 
CF =’), 9.42( + A p< lor: 
F<1 for the other two groups. 

For each system, responses to rein- 
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Fic, 4. Combined responses to the rein- 
forced and to the nonreinforced tones for each 
of the three response systems. 


$8 6 ty d LC FR 
EXTINCTION 


forced tones in all groups and respon- 
ses to nonreinforced tones in all groups 
were combined separately (see Fig. 4). 
It is evident that the HR system dis- 
played a clear and consistent discrim- 
ination from Day 1 onwards (Fig. 4a). 
In contrast, a plot of combined re- 
sponses in the NM system indicates 
growth of discrimination throughout 
acquisition and maintenance of dis- 
crimination during extinction. Fi- 
nally, it can be seen in the RR system 
that the responses to reinforced and 
nonreinforced tones failed to reveal 
clear-cut differential responding in 
either acquisition or extinction. 


DISCUSSION 


The major findings of the present in- 
vestigation were that differences among 
response systems concurrently condi- 
tioned occurred in: (a) discrimination 
formation, (b) CR acquistion, and (c) 
maintenance of response level. 

In the HR response system all groups 
showed early and reliable evidence of 
discrimination, whereas in the NM sys- 
tem only three groups revealed reliable 
discriminations; and in the RR system 
only the + — — and the — + — groups 
Showed significant differential respond- 
ing. The differences among response sys- 
tems were most dramatically illustrated 
in the + — + group in which a reliable 
discrimination was made early in the HR 
system, but was never made in the other 
response systems. 

Early and consistent discrimination in 
the HR system indicated Ss had in effect 
associated the CS + with reinforcement 
and the CS — with nonreinforcement by 
the first test trial in acquisition. Gradual 
discrimination manifested in the NM 
system must consequently reflect some- 
thing in addition to a S merely learning 
that a US follows the CS + but not the 
CS -. 

Response acquisition and maintenance 
of response level also showed marked dif- 
ferences among systems. While HR re- 
sponding appeared at full strength on 
Days 1 and 2 followed by a steady de- 
crease throughout extinction, NM re- 
sponding increased gradually from al- 
most zero on Day 1 to a high value on 
Day 8 followed by a sharp drop during 
extinction. Respiration-rate responding 
rose to a maximum on Day 2 or 3 and 
then exhibited a slow decline through 
extinction. 

Supposing a study similar to this one, 
but measuring HR only, had been under- 
taken, the results after Day 2 could be 
ascribed to some phenomenon such as 
adaptation, reactive inhibition, or inhi- 
bition of reinforcement. On the other 
hand, if only NM responses were mea- 
sured, normal acquisition or mnemoniza- 
tion would be postulated. In contrast, 
measuring only RR in the ++ — and 
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the + — + groups, a conclusion would 
have been drawn that a three tone 
classically conditioned discrimination 
was beyond the capability of S. Since 
all of the above results were obtained 
simultaneously within the same Ss, this 
study emphasizes the necessity for a 
thorough consideration of differences in 
response system characteristics in future 
theoretical formulations. 

The differences among response sys- 
tems in acquisition function and dis- 
crimination capability observed in the 
present study suggest that peripheral as 
well as central learning processes play an 
important role in classical conditioning. 
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LEARNING WITH OVERT CUE IDENTIFICATION! 
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Cue similarity and reinforcement schedule were covaried in 2 experi- 
ments (total N =408) utilizing a 2 choice discriminative event prediction 
task with and without overt cue identification. P(41|S1) was a linear 
function of r: — P(E;|S2) when the cues (pure tones) were highly confusa- 
ble and a U-function when S; and S: were highly discriminable. P(A1|S1) 
was independent of v; and did not differ from rı — P(E;|S;) at the inter- 
mediate level of cue similarity, suggesting that, in order to predict prob- 
ability matching, models of discrimination learning require some degree 
of confusability between S; and Ss. Ss tended to shift their event predic- 
tion response whenever they shifted their identification response from 
that of the previous trial and to shift their cue identification following an 
incorrect event prediction. The conditional probabilities found when 
7171—7:could be predicted by redefining the task as a stimulus learning 


rather than a response learning problem. 


A previous study in two-choice 
auditory discrimination learning found 
that the proportion of correct re- 
sponses, P(A;|S;) and P(A:| S»), in- 
creased as the intensity differential AZ 
between the two stimuli, Sı and Sə, 
increased (Moore & Halpern, 1965). 
Popper and Atkinson (1958) have 
shown that P (A;|S:) is also dependent 
on m: = P(E;|S:) in two-choice prob- 
Ability learning using nonsense syl- 
lables as cues. These two results 
indicate that the probability of an 
appropriate response to S; under a 
noncontingent reinforcement schedule 
is dependent upon both the cue 
similarity of S; to S; and the rein- 
forcement schedule of S. However, 
no experimental results have de- 
scribed how these two sources of 
generalization interact. The present 
studies attempted to assess the rela- 
tive contribution of generalization due 
to cue similarity and generalization 


1 This investigation was carried out during 
the first author's tenure as a National Aero- 
nautics and Space Administration predoctoral 
fellow under Grant NSG(T)-137. 

? Now with the Department of Psychology, 
University of Denver. 


due to m» at different levels of psycho- 
physical similarity and different sched- 
ules of v» in a probability learning 
discrimination task. 

Models of discrimination learning, 
e.g., Bush and Mosteller (1951) and 
Burke and Estes (1957), that can 
account for the generalization effects 
due to cue similarity would predict 
that generalization due to m should 
be positively related to the cue 
similarity of S; and S; That is, 
when S; is identical to Ss, P(A1|S1) 
should equal P(Ai;|S;) at asymptote 
and depend only on P(E;), whereas 
with sufficient cue distinctiveness 
between Sı and S; and therefore 
no stimulus overlap or confusability, 
P(Ai|Si) should be independent of 
7» and depend only on rı. Studies 
in two-stimulus probability learning 
usually assume that the cues are 
psychophysically discriminable or that 
the distinctiveness of the cues is 
positively related to discrimina- 
tion performance, the separation of 
P(A1|S1) and P(A;|S;). However, in 
the present study, an identification re- 
sponse preceding the event prediction 
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response was required of half of the 
Ss as an independent observation of 
the amount of confusability of the 
two stimuli at different levels of cue 
similarity. 

In recent experiments (e.g., Hal- 
pern & Moore, 1967), conditional 
statistics of the form P(Ai,n|Si,nSj,n—1 
Ak,n-1Em,n-1)) 4, J, &, m = 1, 2, are 
notalways rank-ordered as they should 
be if the reinforcement assumptions 
of these models are correct and 
sufficient. Omitting the trials sub- 
scripts, P(Ai|A;Es) often exceeds 
P (A1| A3E3), P (A1| A1E3), or even 
P(Ai|A;E:, whereas most learning 


models require that JP(Ai|A;E:) 
> P(AM|AE; > P(Ai|AsE2), i 4 j. 
These ‘“‘inversions,’’ which rarely 


occur in simple probability learn- 
ing, seemed to be most prevalent 
whenever different cues are pre- 
sented on successive trials. It also 
seemed that the likelihood of an 
inversion was directly related to the 
distinctiveness between the cues S; 
and S} These observations suggested 
that inversions might somehow be 
related to covert cue identification 
activity by Ss. As one possible 
example, if S decides that a given 
trial is of a different type from the 
previous one, the outcome of the 
latter may be completely ignored as 
having no bearing on the choice at 
hand; hence, no reinforcement effects 
would appear in the data. Unfortun- 
ately, this account is too simple. 
Observed inversions are often too 
pronounced to be discounted as 
random fluctuation resulting from 
stochastic independence between suc- 
cessive trials (Halpern & Moore, 
1967). Therefore, a purpose of these 
studies was to determine the sequen- 
tial effects of cue identification by 
requiring S to identify the cue he 
thought was being presented before 
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making his event prediction om each 
trial. i 
METHOD 


General 


Subjecis.—The Ss were 408 University of 
Massachusetts undergraduates assigned un- 
systematically to the various experimental 
treatments, 

Apparatus.—Up to three Ss were run at a 
time, each seated at a table top enclosure 
containing a conditioning board (Estes- 
Straughan) consisting of a white center warn- 
ing light and two red event lights posi- 
tioned above each of two spring-loaded lever 
switches. Tones were generated by an audio 
oscillator (Hewlett-Packard Model 200) and 
were presented over matched headphones 
with a continuous white masking noise. 
Experimental events were controlled by pro- 
gramming equipment consisting of a paper 
tape reader, interval timers (Hunter), and 
relays. This equipment was housed in a 
cubicle adjacent to Ss’ room. Events and 
responses were recorded on an event recorder 
(Esterline-Angus). 

Procedure.—The onset of a tone started a 
trial. The tone lasted 2.83 sec. during which 
Ss in the identification task were required to 
make a loud or soft identification response by 
pressing the respective button. At the offset 
of the tone the warning light was illuminated 
for 1.5 sec. and Ss made their prediction 
responses. The event light was illuminated 
for .67 sec. at the offset of the warning light. 
Hence each trial lasted 5 sec. and the intertrial 
interval was also 5 sec. 

The Ss required to identify the tones were 
given the following instructions: 


You will be receiving two tones differing 
slightly in loudness over the headphones. 
Your first task will be to indicate which of 
the tones you are listening to. You will do 
this by pressing one of the buttons. You 
will push the top button for the louder 
(softer) tone and the lower button for the 
softer (louder) tone. You are expected to 
guess if you are not sure which tone is on. 
You will have 23 sec. to make your choice. 
After 2} sec. the tone will go off and the 
white light on top of your panel will go on. 
Your second task will be to predict which of 
the two red lights at the bottom of your 
panel will go on. As soon as possible after 
the white light goes on, you are to press one 
of the two switches. After you have pressed 
a switch, one.of the two red lights will go on. 
If the red light above the switch you pressed 
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goes on, you were correct. If the other one 
goes on, you were incorrect on that trial. 
Remember, you will be told whether or not 
you have been correct on the second task 
only. You will be given no information 
regarding your response on the first task, 
that is indicating which tone is on. 


The Ss not required to identify the tones 
were instructed as follows: 


You will receive two tones differing 
slightly in loudness. When the tone goes 
off, the white light on top of your panel will 
go on. Your task is to predict which of the 
two red lights at the bottom of your panel 
will go on. As soon as possible after the 
white light goes on you are to press the 
switch under the red light you think will 
go on. Be sure to press the switch only once 
and before a red light goes on. If the red 
light above the switch you have pressed 
goes on, you were correct. If the other 
one goes on, you were incorrect on that trial. 


Experiment I 


The Ss required to identify cues did so by 
pushing one of two button switches on a 
smaller remoyable panel located in front of 
the conditioning board. Four levels of cue 
similarity were obtained by varying the 
intensity differential (AZ) between two 800 
Hz. tones in steps of 1 db., from 0 through 3, 
starting at 70 db. (SPL). Hence, the inten- 
sity pairings in db, were 70-70, 70-71, 70-72, 
and 70-73. In the latter three groups the 
more intense tone was S; for half of the Ss and 
Si for the other half. 

Two noncontingent random reinforcement 
schedules were crossed with AI: (a) mı 
= P(Ei|$1) 21and zs — P(E:|1) 2.5 (Group 
1-.5), and (b) rı =.9 and z4—.6 (Group .9—.6). 
Two independent random sequences of 200 
trials were selected for each schedule. The 
only restrictions on these program tapes were 
that no more than four Sı or S: trials occurred 
in succession and that each cue was presented 
100 times. Position of 41 response (right or 
left) and loud identification response (top or 
bottom) were purposely confounded with pro- 
gram tape to make the design more efficient. 
"Thus, counting the contrast between the two 
types of tasks, i.e., with and without iden- 
tification responses (henceforth designated 
Groups IR and NIR, respectively), there 
were 6 Ss in each cell of a 4X 2X2 X2 
between-groups factorial design for a total of 
192 Ss. For analysis of variance of P(A1)s, 
the data were further partitioned by two 


D. W. MASSARO, J. HALPERN, AND J. W. MOORE 


within-Ss factors: cue (Si vs. Sz) and trial 
block (1-100 vs. 101-200). 


Experiment II 


Added to the center of the conditioning 
board was a 2 X 4.5-in. panel recessed 1.5 in. 
with two spring-loaded buttons mounted 
vertically and labeled loud and soft. This 
panel was covered for Ss not required to make 
indentification responses. 

Three levels of cue similarity (AZ) were 
crossed with three stimulus contingent rein- 
forcement schedules for both types of tasks. 
The two types of task, with identification 
response (IR) and without identification 
(NIR), provided the third principle factor. 
In all groups the louder tone was S; for half of 
the Ss and S: for the other half. Thus, there 
were 6 Ss in each cell of a 3X 3X2 X2 
between-groups factorial design for a total of 
216 Ss. The intensity pairings of the two 800 
Hz. tones were 73-74.5, 73-76, and 73-79 db. 
giving a 1.5-, 3-, and 6-db. differential, re- 
spectively, for three decreasing levels of cue 
similarity. The three reinforcement sched- 
ules (r) were mı = P(E,|S:) = .8 for all 
groups, 72 = P(Ei|S:) = .8, .5, and .2, re- 
spectively, for Group .8-.8, Group .8-.5, and 
Group .8-.2. 

A sequence of 300 trials was determined for 
each schedule of + with the restrictions that 
not more than 4 S; or S; trials occur in succes- 
sion and that each cue be presented 25 times 
in each 50 trial block. The events were 
randomized such that the appropriate per- 
centage of Eis were presented in each 50 trial 
block. Both position of 4; response (right 
or left) and loud identification response (top 
or bottom) were counterbalanced between .Ss. 
The analysis of variance of P(A;) included 
the four between variables of Task, AJ, 7, and 
Tone (loud or soft as Si) and the two within 
variables of Cue (S; vs. S2) and Trial Blocks. 
"The analysis of identification response, P(I), 
included only data from Group IR. 


RESULTS 
Experiment I 


Marginal statistics.—Figure 1 shows 
P(A;) plotted at asymptote (defined 
as the last block of 100 trials). The 
figure indicates that P(A;|S) ex- 
ceeded P(Ai|S;), F (1, 160) 275.84, 
p «.001, and this difference was 
inversely related to cue similarity, 
F(3,160) = 23.79, p < .001. Figure 
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Asymptotic proportion of A; responses to Sı and Sz as a function of Task, 


AI, and reinforcement schedule. 


1 also illustrates an increased separa- 
tion between P(A;|S1) and P(A;|S2) 
as AI increased, with greater separa- 
tion in Group 1-.5 than Group.9-.6, 
F (3, 160) = 3.77, p « .025. 

The identification performance of 
Group IR in terms of proportion of 
correct identification of the two tones 
was .51, .59, .69, and .78 at AJ = 0, 
1, 2, and 3 db., respectively. The 
mean proportion of correct identifica- 
tions under the two reinforcement 
schedules for AJ > 0 were .68 and .69 
for Group 1-5 and Group .9-.6, 
respectively. 

Sequential statistics—Asymptotic 
and preasymptotic first order con- 
ditional probabilities of an Ai re- 
sponse were compiled separately for 
each reinforcement schedule and 
task. Data were pooled over levels of 
AI and other factors in order to in- 
crease reliability, and the transition 
point from preasymptotic to asymp- 
totic data was estimated from inspec- 


tion of learning curves (not shown) to 
have been Trial 80. 

An analysis of the sequential data 
from Group IR in which Ss indicated 
that the cues presented on successive 
trials were the same (Li, = Ij,n-1), 
whether they actually were or not, 
revealed that inversions were rare 
and based on very few observations. 
By contrast, when Ss identified cues 
as different (I;,,1j,»-1), 15 out of 16 
sets of four conditionals were inverted, 
either asymptotically or preasymptoti- 
cally. These inversions were striking 
enough to confirm our earlier impres- 
sion that inversions are most likely 
to occur in those situations in which Ss 
can identify at least two types of 
trials and specifically on those trials 
which are identified as being of a 
different type from the preceding one. 

But what process or mechanism 
underlies the occurrence of inversions? 
The data suggest one answer: Out of 
a total of 64 inversions, all but 8 
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involve P (A;| A;E;) or P(A;|A2E2). It 
would therefore appear that Ss tend 
to shift their event prediction response 
(from A» to A; in these cases) when- 
ever they shift their identification 
response from that of the preceding 
trial. This shift bias can evidently 
override the immediate effects of 
reinforcement. The usual name given 
to this phenomenon is response 
generalization or induction. The fact 
that this bias was as strong, if not 
stronger, in preasymptotic as in the 
asymptotic data suggests that it might 
either be brought into the situation 
by S or else instilled by way of in- 
structions. It remains to be deter- 
mined whether the shift bias can be 
reduced or eliminated with different 
instructions, feedback for overt identi- 
fications, or more extended training. 

Like any other response, cue identi- 
fication should be influenced to some 
extent by reinforcement contingencies 
in the situation. Therefore, an analysis 
was made on the observed first order 
conditional probabilities of correct 
identification for that portion of the 
data of Group IR where I; = Ij, i. 
In 13 out of 16 cases, the probability 
Of correct identification was on the 
average higher following correct (A1E; 
and A;E;) than incorrect (AEs and 
A&E:) event predictions. Thus, posi- 
tive reinforcement, being correct, 
tended to increase the probability of 
retaining the identification response of 
the preceding trial; and negative 
reinforcement, being wrong, tended to 
increase the probability of a shift of 
identification response. An analysis 
was made also on that portion of the 
data from Group IR in which I;,, 
~ Ij,n-1. Once again, negative rein- 
forcement tended to increase the 
probability of a shift in identification 
response, and thus 13 out of 16 cases 
showed a higher average probability of 
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a correct identification following in- 
correct event predictions. ` $ 
Atkinson's theoretical writings have 
anticipated many of the kinds of 
findings reported here (Atkinson, 
1958; 1960). One direction for further 
development of an adequate model 
for discrimination learning within 
the framework of stimulus sampling 
theory could take the multiprocess 
observing response model as its start- 
ing point (cf. Atkinson & Estes, 1963). 
Such a model would have greatest 
success in predicting inverted condi- 
tionals if it could incorporate a shift 
bias or response generalization process 
acting between identification or ob- 
serving responses on the one hand 
and event prediction on the other. 


Experiment II 


Marginal statistics —Figure 2 in- 
dicates the significant main effects and 
interactions plotted at asymptote 
(defined as the last block of 100 trials). 
Discrimination performance P(A,|S,) 
— P(A|S;, F (1, 180) = 230.52, 
p < 001, increased.as the difference 
between mı and ms increased, F (2, 
180) = 73.79, p < .001, Discrimina- 
tion performance also increased as the 
intensity differential AZ between S; 
and S, increased, F (2, 180) = 14.28, 
P <.001; and this increase in dis- 
crimination was positively related to 
the difference between mı and 72, 
F (4,180) = 5.78, p < .001. 

Figure 2 shows that at AI —1.5 db., 
P(Ai|S)) depended on r: such that 
P(Ai|S;) decreased with decreases in 
7» A trend analysis (orthogonal 
polynomials) of P(A1|S$1) at asymp- 
tote as a function of m; yielded a 
significant linear component at AI 
—1.5 db., F (1, 180) 2124.5, p<.001. 
In contrast, no components of the 
trend were significant at A/—3 db. 
It thus appears that P(Ai|S1) was 
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Fic. 2. Asymptotic proportion of Ai responses 
to Sı and S; as a function of AJ and r2. 


independent of r and, as seen in Fig. 
2, did not deviate from probability 
matching. However, the significant 
quadratic component seen in Fig. 2 
at AI = 6 db., F (1, 180) = 12.32, 
p «.001, indicates that an even 
larger increase in AJ reinstated the 
dependence of P (A;|S:) on 72. 

The analysis of variance of identi- 
fication responses in Group IR re- 
vealed that correctness of identifica- 
tion increased with increases in in- 
tensity -differential AZ, F (2, 90) 
= 50.10, p «.001, and that the 
number of correct identifications in- 
creased with practice, F (2, 180) 
= 15.39, p < .001. The mean per- 
centage of correct identifications for 
AI = 1.5, 3, and 6 db. were 63%, 
79%, and 93%, respectively. A signi- 
ficant m X Trail Blocks effect, F (4, 
180) = 3.39, p < .01, revealed that 
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Ss improved in correct identification 
from the first trial block to the third 
in Group .8-.2 (77 to 80%) and 
Group .8-.5 (74 to 80%), but that no 
improvement was found in Group 
8-.8 (77 to 77%). These results 
suggest that the discriminability of 
the schedules in Group .8-.2 and 
Group .8-.5 enhanced the distinctive- 
ness of S, and S» such that these Ss 
were able to improve identification 
performance over training. 

Sequential statistics.—The results of 
Exp. I showed that the reinforcement 
effects found when the same cue is 
presented on successive trials are 
either washed out or inverted when 
different cues are presented on suc- 
cessive trials. Therefore, a sequential 
statistic was desired which would 
permit comparison of the reinforce- 
ment effects on same and different 
stimulus trials. To accomplish this, 
the dependent measure chosen for the 
analysis of variance of first order 
conditional probabilities was 


P(A1,n|Si,n91,n—1E1,n—1) 
ai, P(Ai,n|Si,n) 


where i = 1, 2. This statistic provided 
an index of the first order conditional 
effects of SiE: on P(A1|Si) and on 
P (A;|S3) independent of the marginal 
probabilities. If S;E; had no reinforce- 
ment effect on P(A;), the expected 
value of this index would be zero 
since the conditional probability 
would be equal to the marginal 
probability. A positive reinforcement 
effect would be reflected by a positive 
value and a negative reinforcement 
effect by a negative value. This 
statistic was computed for each S at 
each 100 trial block. Table 1 presents 
the mean values of this index for each 
AI group pooled over trial blocks. 
Table 1 shows that the reinforce- 
ment effect increased for P(Ai|S;) 
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TABLE 1 


MEAN Cae eae 
— P(A1,n|Si,n)_] PooLeD OVER 
LEVELS OF v AND TRIAL 
Biocks AS A FUNCTION 
or AI AND CUE 


AI in db. 


Cue 
15 3 6 


Sı .04029 .04326 05917 
Se 02124 .00788 .00312 


and decreased for P(A;|S;)) as the 
intensity differential A7 between S; 
and Se increased, F (2, 180) = 6.0, 
p «.01. This result shows that the 
reinforcement effect of Sını and 
Ei,n-1 on P(Ain) was dependent 
upon the similarity of S; and S; 
trials. A larger positive reinforce- 
ment effect on P(Ai|S) than on 
P(A;|S;) was found in all r groups, 
F (1, 180) = 55.22, p < .001. The 
significant interaction of - X Cue 
X Trial Blocks, F (4, 360) — 4.83, 
p «.001, indicated that the rein- 
forcement effect on P(A;|S;) de- 
creased over training in all m groups, 
whereas the reinforcement effect on 
P (A1|S:) increased in Group .8-.8 and 
Group .8-.5 but decreased to a nega- 


TABLE 2 


OBSERVED First ORDER CONDITIONAL 
PROBABILITIES OF THE ForM P(Ai,.| 
Sinlt,nlin-1Aj,n-1Ek,n-1) FOR GROUP 
.8-.2 POOLED OVER LEVELS OF 
AI AND TRraL BLocks 


Ti, n-i Ajn-i | Ek, n-i 

1 " 1 -8687 | (1417. 
1 2 .6832 (9 
2 1 .6285 393) 
2 2 .5348 (316) 

2 1 1 4938 ni 
1 2 .6923 (221 
2 1 .8302 (377) 
2 2 .9484 | (1162) 


Note.—Entries in parentheses are the number of 
cases contributing to the denominators of each condi- 
tional probability. 
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tive reinforcement effect in Group .8-.2 
This result suggests that inversions 
would probably be found in Group 
.8-.2 when Iin Æ ljn- It is clear 
from Table 2 that the rank-ordering 
of the first order conditional probabili- 
ties of P(A1|Sili)) was inverted when 
the previous trial was identified as 
different (I; However, when the 
previous trial was identified as the 
same, the rank-ordering of the condi- 
tionals was exactly opposite that 
found when the previous trial was 
identified as different. 


Discussion 


The findings that (a) P(A,|Si) did 
not become independent of m, with in- 
creases in correct cue identification, 
(b) overshooting appeared in Group 
.8-.8 at 1.5 and 6 db. and in Group .8-.2 
at 6 db., and (c) first order conditional 
probabilities were inverted in Group 
.8—.2 when Ij, Æ Ij, 1 appear to pro- 
vide strong evidence against extant re- 
inforcement models of discrimination 
learning. 

The fact that probability matching did 
occur in all groups at 3-db. cue separa- 
tion, where Ss in Group IR identified the 
tones correctly 79% of the time, suggests 
that in order to predict probability 
matching, models of discrimination learn- 
ing require some degree of confusability 
between the two stimuli. It may be 
that previous probability matching re- 
sults (e.g., Estes, Burke, Atkinson, & 
Frankmann, 1957) found in two-stimulus 
probability learning were actually the 
result of generalization due to cue simi- 
larity. The fact that Group .8-.2 ex- 
ceeded matching at the 6 db. separation 
suggests that maximizing may occur 
under certain reinforcement schedules 
when Ss are correctly identifying the two 
cues most of the time. By contrast, in- 
creasing the intensity differential from 
3 to 6 db., thereby increasing correct cue 
identification, lowered  P(A4i|S;) in 
Group .8-.5. This difference might be 
explained by the fact that although in- 
creasing cue identification in Group .8-.5 
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increases the validity of S, as a cue for 
an appropriate response, it decreases the 
validity of Sə» Hence it appears that a 
discriminable stimulus with no cue value 
(i.e., one reinforced on a 50-50 basis) has 
a depression effect on P(A,|S:) com- 
pared to a stimulus with cue value (S; in 
Group .8-.2). 

The fact that the rank-ordering of the 
first order conditional probabilities of 
Group .8-.2, when a trial was identified 
as the same as the previous trial, was 
exactly the opposite of the rank-ordering, 
when the present trial was identified as 
different, suggests another reinforcement 
mechanism operating on trials identified 
as differing from the previous trial. The 
results of Exp. I have shown that in- 
versions of the type  P(Ai|AsE;) 
> P(Ai||AiE)) and P(A,|A:E2) 
> P(A|| A,E;) may be due to the fact 
that Ss tend to shift their prediction re- 
sponses when they shift their identifica- 
tion response from that of the previous 
trial because of response generalization. 
But this added process cannot account 
for the details of the rank-ordering of the 
conditionals found in Group .8-.2 in 
Exp. II (cf. Table 2). That is, even 
with an added response generalization 
process, reinforcement theories as pres- 
ently defined would predict P(A;|A;E;) 
> P(Ai|AiEz) and P(Ai| AsE:) 
> P(A:|A2E:). 

In probability learning research E; is 
usually identified arbitrarily as the left 
hand light and E» as the right hand light 
or vice versa. However, one can take the 
view of Spence (1960) that discrimina- 
tion learning is a form of nonspatial 
selective learning in which S learns to 
behave in relation to some particular set 
of discriminanda (stimuli). That is, S 
is rewarded for behaving appropriately 
to a distinctive cue rather than for 
making a specific motor response. 
Therefore, in the two-stimulus situation 
E; can be redefined as the most frequent 
event following that stimulus? An A; 
response now refers to the most appro- 
priate response in the sense of having the 
highest likelihood of being correct on that 


3 This redefinition of E; was suggested by 
Jerome L. Myers. 
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particular stimulus trial. The reinforce- 
ment models now predict that if the ap- 
propriate response is reinforced on the 
previous trial, S will tend to make the ap- 
propriate prediction on the present trial 
even if he identifies the present trial as 
different. The results clearly show that 
if S identified the present trial as dif- 

ferent from the previous one, the proba- 
bility of making an appropriate response 
by predicting the most frequent event 
was highest when the appropriate re- 

sponse was reinforced and lowest when 

the inappropriate, or least frequent, 

event prediction was reinforced on the 

previous trial. When S identified the 

present trial as the same, either definition 

of E; predicts the results. For example, 

if the right hand light is the least fre- 

quent event and S is reinforced for 
making a right hand prediction, he is- 
also reinforced for making an inappro- 

priate response; and, therefore, he will 

be more likely to make the same response 

on the following trial if the two trials are 

identified as the same. But if the trial 

is identified as different, S will still be 

more likely to make the inappropriate 

response, which means a response that is 

physically opposite from the response on 

the previous trial. Therefore, conceiv- 

ing S as being reinforced, not for a par- 

ticular (right or left) event prediction, 

but for an appropriate (most frequently 

correct) or inappropriate response gives 

conditional probabilities predicted by 

reinforcement models. 
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CONDITIONING SITUATION 
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In this study a refutation is offered to Razran’s well-known prediction 
that the amount of conditioning is a function of the intensity of the 
CS. A diffculty in testing Razran’s statement is how to distinguish 
in experimental operations the learning component and the performance 
component of a GSR. In this experiment these components were 
recorded separately in a delayed GSR-conditioning procedure, with a 
CS of 10 sec. The results showed that the conditioned anticipatory 
responses (latency 3-11 sec.) did not vary with CS intensity; how- 
ever, sensitization responses, occurring with shorter latencies, showed 
the expected intensity effect. In the data treatment, GSRs were 
expressed as Alog C only. If however GSRs, expressed as Alog C 
were corrected for conductance changes on adaptation trials, dis- 
similar results were to be expected. It was shown, that such methods 
of correction are inappropriate in comparisons of effects of different 


stimulus-intensity conditions. 


These results are in agreement with 


those of Grant and Schneider (1949). 


The objective of this study is to re- 
fute Razran's (1957) well known pre- 
diction that the amount of conditioning 
is a function of the intensity of the CS. 
Kimmel (1959) has presented positive 
evidence with regard to Razran's state- 
ment; however, his experimental pro- 
cedure deserves critical comment (see 
discussion below). Several authors 
(e.g., Grant & Schneider, 1949; Kim- 
ble, 1964) have recognized that part of 
the problem in testing Razran's state- 
ment is the uncertainty as to whether 
the influence of CS intensity is on the 
learning component or on.the perform- 
ance component of the CR. Grant and 
Schneider (1949) demonstrated in a 
factorial design, that performance var- 
ied systematically with CS intensity, 
but that learning did not. A further 
clarification of this issue for GSR con- 
ditioning can be attained if the two 
components of the conditioned GSR can 
be recorded separately. This is pos- 
sible by means of a delayed condition- 
ing procedure. 


By using a relatively long CS (6.5 
sec.) Stewart, Stern, Winokur, and 
Fredman (1961) were able to dis- 
tinguish three types of responses, viz: 
(a) orienting responses (ORs) evoked 
by the CS with a mean latency of 2.5 
sec. from onset of CS, (b) uncondi- 
tioned responses (UCRs) evoked by the 
UCS with a mean latency of 9.6 sec. 
from onset of CS, and (c) responses 
with a mean latency of 6.3 sec. from 
onset of CS. Responses of type c are 
considered "conditioned anticipatory 
responses” (CARs), and responses of 
type a, sensitization responses (SRs), 
aré responses to the CS during acquisi- 
tion after this OR has been extin- 
guished during a preceding adaptation 
period. (The OR often occurs again 
in conditioning trials.) 

Lockhart and Grings (1964) and 
Kimmel (1964) criticized the results 
of Stewart et al. (1961) because of the 
absence of a proper control for pseudo- 
conditioning. These criticisms how- 
ever have been answered by Leonard 
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and Winokur (1963) and by McDon- 
ald and Johnson (1965) who applied 
an adequate control for pseudocondi- 
tioning and still obtained the same re- 
sults. 

In the present experiment it is shown 
that CARs do not vary with CS in- 
tensity and that SRs do vary with CS 
intensity. The design of the experi- 
ment is such that it can also explain 
Kimmel's (1964) positive results. 


METHOD 


Subjects.—The Ss were 40 freshmen (22 
males and 18 females) who were divided at 
random into two groups, each group receiving 
different CS intensities. 

Procedure—The S was seated in a com- 
fortable chair in a soundproof room, sepa- 
rated from E and registration and stimulus 
equipment. He was told that he had nothing 
to do but to sit absolutely quiet and that he 
could expect a tone or a shock from time to 
time, After the electrodes had been at- 


tached the experiment started with a short 
period of rest during which time the skin 
conductance level was stabilized. This was 
followed by 10 CS presentations without a 
period of shock adaptation. For half of the 
Ss (20) the CS was a tone of 500 cps and 40 
db. For the other half the CS was a tone 
of 500 cps and 80 db. Each tone lasted 10 
sec. The intertrial interval was either 60 or 
90 sec. determined at random. Without in- 
terruption this was followed by the period of 
acquisition, consisting of 16 trials. Each trial 
was composed of the CS for 10 sec. and the 
UCS (a shock) for .5 sec. The UCS fol- 
lowed immediately after termination of the 
CS. The intertrial interval was again 60 
or 90 sec., determined at random. 
Deflections of 1 mm. or greater, which 
occurred from 1.3 to 2.9 sec. after onset of 
the CS were categorized as SRs, whereas 
deflections of 1 mm. or greater, occurring 
3.0 to 11 sec. after onset of the CS were 
categorized as CARs. A UCR was defined 
as every deflection of 1 mm. or greater 
which occurred between 11 and 14 sec. after 
onset of the CS. Data treatment was car- 
ried out in two different ways: (a) all GSRs 


G.S.R — amplitude 


Trials in blocks of two 


Fic. 1. Results according to the first method of calculation. 


(Figure la. represents 


the sensitization responses and Fig. 1b. the conditioned anticipatory responses.) 


` 


CS INTENSITY AND GSR CONDITIONING 


were expressed as Alog C arid (b) the mean 
Alog C of the first three responses for every 
S during the period of adaptation was de- 
fined as unity; all the other responses were 
expressed in this unit. 

The second treatment is very similar to 
Kimmel's (1959) method. Kimmel used as 
CR measure: 


3 -s 
RENA 
vac, me eS 


AC. = conductance change following a 
CS during acquisition. AC, = conductance 
change following either the first, the second, 
or the third CS presentation during adapta- 
tion. 

Part of the rationale for the second treat- 
ment is to reduce individual differences in 
GSR responsivity. It should be further 
noted, that Kimmel’s CS had a 4-sec. dura- 
tion and that he took as the CR the largest 
deflection with a period of 5 sec. from onset 
of CS. The authors’ second method of cal- 
culation has the advantage of avoiding nega- 
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tive values; it has also been used in earlier 
experiments (van Olst & Orlebeke, 1965; 
van Olst et al., 1966). 

Apparatus—The tones were produced by 
a Farnell frequency generator and presented 
binaurally through earphones. Electric 
shocks were produced by a constant-current- 
shock generator and delivered to S through 
the fore- and middlefinger of the right 
hand by means of silver-nickel electrodes. 
Duration of stimuli and time intervals were 
controlled by an electronic timer. Two con- 
tinuous records of skin resistance were ob- 
tained by means of a Wheatstone bridge 
(Wb.). The output of the bridge was fed 
to two parallel dc differential input amplifiers 
of a Hellige polygraph. For the recording 
of the EDR one amplifier had been connected 
with the Wb. via a capacitor (time constant 
42 sec.). The purpose of the second ampli- 
fier was to make it possible to read very 
great deflections, and to obserye long run 
changes in skin resistance. Two zinc elec- 
trodes were used. The positive electrode 
(3.8 cm.*) was attached to the palm of the 
left hand by way of a piece of foam plastic 


G.S.R.—amplitude 


1 c3 016.75. 6 7. $ 
Trials in blocks of two 


Fic. 2. Results according to the second method of calculation. 
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with the same area, soaked in a 2.4% ZnSO. 
solution. The negative electrode (43 cm.) 
was attached to the inner side of the wrist in 
the same way. 


RESULTS 


In Fig. 1 data are represented ac- 
cording to the first method of calcula- 
tion. Figure la gives the SRs and Fig. 
1b the CARs. The difference between 
the two experimental conditions for the 
SRs is significant, (Mann-Whitney U 
test) .U = 13, p = .025. 

For the CARs no difference between 
the experimental conditions can be 
found, U = 31, p=.48. Apparently 
CS intensity influences the SRs mag- 
nitude, but not that of the CAR. 

In Fig. 2 the same results are shown, 
however, following the second method 
of treatment which is similar to Kim- 
mel’s. It is noted that now the SRs 
for the 80-db. CS condition are smaller 
than for the 40-db. condition. In this 
case however significant differences 
exist between the experimental condi- 
tions for both SRs and CARs, U = 6, 
p < .002; U —7, p < 003. 


DISCUSSION 


Following the simplest method of data 
treatment, these experimental findings 
contradict Razran’s statement to the effect 
that the amount of conditioning is a func- 
tion of CS intensity. With this method 
the authors also find a significant differ- 
ence between SRs during acquisition in 
the expected direction. 

The results presented in Fig. 2, both 
for SRs and CARs, are artifacts of a 
method of calculation which is inappro- 
priate for this design. With Treatment 
Method 2, conductance changes in the 
40-db. acquisition series are corrected to 
a much smaller extent by AC, (i.e. ACa 
in the adaptation trials with stimulus in- 
tensity of 40 db.) than the conductance 
changes in the 80-db. acquisition series. 
The latter are corrected by AC, of adapta- 
tion trials with stimulus intensity of 80 db. 
These responses during the adaptation 


trials are similar to orientation responses 
and have an increasing monotonic relation 
to stimulus intensity. Such a method of 
calculation therefore tends to diminish the 
response to the stronger stimulus, thus 
leading to results similar to those in 
Fig. 2. 

In these studies with a delayed GSR 
conditioning procedure, the experimental 
operations are such that the components 
can be categorized separately by way of 
the latency criterion. In this situation it 
becomes clear that the CAR proper does 
not show the intensity effect and that it 
is unwarranted to correct the learned re- 
sponse for an undesired SR component. 

In other procedures, where this dis- 
tinction cannot be made, corrections by 
way of formulas such as Kimmel's for- 
mula and our second treatment are per- 
mitted only if the independent S-condi- 
tions are not related to the strength of 
the adaptation response. In line with 
this reasoning, Kimmel’s (1959) con- 
firmation of Razran's statement should be 
dismissed. 

Also Kimmel used a CS of 4 sec. and 
measured as CR the greatest deflection 
within a period of 5 sec. from onset of 
the CS. It seems probable in the light 
of the later findings of Stewart et al. 
(1961) and of several other authors, in- 
cluding the present authors, that part of 
his measured deflections concerned SRs 
instead of CARs. 

The data as presented in Fig. 1 are in 
agreement with the results of Grant and 
Schneider (1949). 
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LEARNING CONCEPTUAL RULES. 


II: THE ROLE OF POSITIVE AND NEGATIVE INSTANCES! 


LYLE E. BOURNE, JR. AND DONALD E. GUY 
University of Colorado 


Ss solved attribute identification (rule given, stimulus attributes un- 
known) or rule learning (attributes given, rule unknown) problems on 
the basis of information provided by positive instances only, negative 
instances only, or a mixture of positive and negative instances. For 
different Ss, the conceptual rule was conjunctive, disjunctive, or con- 
ditional. In attribute identification, Ss performed best when the infor- 
mational instances were selected from the smaller class, positive or nega- 
tive, depending on the rule and worst when they were selected from the 
larger class. In rule learning, Ss performed best on all rules when the 
greatest variety of instances (mixture of positive and negative) was 


presented. 


Available evidence indicates that 
positive instances are more effective 
than negative instances as usable 
sources of solution-relevant informa- 
tion in conceptual problems (Smoke, 
1933) even when the two are equated 
logically for informational content 
(Hovland & Weiss, 1953). While S 
can be trained to solve problems effi- 
ciently, using only negative instances 
(Friebergs & Tulving, 1961), this 
generalization seems valid for all but 
very sophisticated problem solvers 
(Bourne, 1966). 

Most of the data relevant to this 
proposition have been collected in con- 
cept identification tasks for which the 
solution was a conjunction of attri- 
butes. Ina conjunction, the category 
of positive instances, i.e., stimuli with 
both or all relevant attributes, is 
smaller and less variable than the cat- 
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egory of negative instances, i.e., stim- 
uli which lack one or more relevant 
attributes. The relative homogene- 
ity of the positive category increases, 
moreover, with the number of values 
within the relevant dimensions. Cat- 
egory homogeneity seems likely to be 
correlated with the usability of in- 
stances, if only because S has fewer 
unique instances to remember and to 
associate (with the response category) 
or from which to make inferences. 
Thus, in solving a conjunctive con- 
cept, S might be expected to find posi- 
tive instances easier to use than nega- 
tive instances, and relatively more so 
as intradimensional variability (Bat- 
tig & Bourne, 1961) increases. Simi- 
lar conclusions, based on the common 
occurrence of a positive focusing 
strategy, have been reached by 
Bruner, Goodnow, and Austin (1956) 
and by Hunt (1962). 

For rules other than conjunction 
(e.g., disjunction, "and/or," and con- 
ditional, “if, then") the category of 
negative instances can be more homo- 
geneous than the category of positive 
instances. In these cases, contrary 
to the usual finding, negative instances 
might lead to more rapid learning or 
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problem solving. To check on this 
possibility and to test category homo- 
geneity as a factor contributing to the 
general effect, concepts based on a 
variety of rules were studied in the 
present experiment. The Ss were re- 
quired to learn these concepts, given 
only positive, only negative, or a mix- 
ture of positive and negative in- 
stances. It seemed reasonable at the 
same time to investigate another un- 
explored variable—type of conceptual 
problem. In earlier work on positive 
and negative instances, the task for .S 
uniformly has been to identify the un- 
known attributes of a concept. 
While there appear to be no theoreti- 
cal or empirical bases for expecting 
differences, the present experiment in- 
cluded both the more common attri- 
bute identification task and a rule 
learning task (Bourne, 1967; Bourne 
& Guy, 1968; Haygood & Bourne 
1965), in which S must discover the 
relationship (rule) between two given 
relevant attributes. 


METHOD 


Subjects and design ——The Ss were 216 
undergraduate college students who volun- 
teered for participation. They were assigned 
randomly but in equal number to 36 experi- 
mental conditions prescribed by a 2 (Task 
type: rule-learning—RL, or attribute identifi- 
cation—AI) X 3 (Rule: conjunctive, disjunc- 
tive, or conditional) X 3 (Training instances: 
positive, negative, or a mixture of both) X 2 
(Problem: A—red-square, or B—one-large) 
factorial design. Each S served individually, 
in a single session lasting about 1 hr. 

Task and apparatus.—The task required Ss 
to classify a series of stimulus patterns 
(geometrical designs) into two categories, the 
positive and negative instances of some un- 
known concept. All Ss were told that the 
patterns would vary along four dimensions, 
each with three levels or attributes: size 
(large-medium-small), color (red-yellow-blue), 
form (square-triangle-hexagon), and number 
(one-two-three figures). Each S was also 
informed that, for any given problem, two 
of the dimensions would be relevant and two 
irrelevant; thus all concepts were bidimen- 
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sional, specified by two relevant attributes 
and a rule. 

In a preliminary demonstration, all .Ss 
received instructions regarding a truth-table 
system for coding stimulus patterns (Bourne, 
1967; Bourne & Guy, 1968). A cardboard 
box, divided into four compartments (two 
rows of two compartments each), was placed 
on a table before S. The compartments 
represented four classes of stimuli prescribed 
by the presence or absence of two stimulus 
attributes. These classes were comprised of 
stimulus patterns with both (TT), one but 
not the other (TF and FT) and neither (FF) 
of the two attributes. Given the names of 
two attributes (e.g., redness and squareness), 
S was required to sort 4 X 6-in. cards portray- 
ing geometrical designs into the four com- 
partments, Three problems (different pairs 
of relevant attributes) of this type were 
administered and S achieved a criterion of 10 
consecutively correct responses on each. 
When this series was completed, S was shown 
how the four categories of the truth-table 
might be collapsed into two, i.e., positive 
(examples) or negative (nonexamples) in- 
stances of a concept. Truth-table pretrain- 
ing seemed to facilitate both rule learning and 
attribute identification (Bourne, 1967; Hay- 
good & Kiehlbauch, 1965). It was used 
here to insure relatively efficient problem 
solving. If anything, given the results of 
Freibergs and Tulving (1961), its effect should 
be to reduce performance differences between 
groups using positive and negative instances. 

During the next phase of the experiment, 
S received instructions either about AI or 
about RL tasks, depending upon his condition. 
In Cond. Al, one of three rules (conjunctive, 
disjunctive, or conditional) was explained to 
Sand he was told that the task was to discover 
the two relevant attributes combined by that 
rule. In Cond. RL, the two relevant attri- 
butes were given and S was told that he had 
to discover a rule (one of the three mentioned 
above) combining those attributes. For both 
conditions, the problem began with the pre- 
sentation of eight stimulus patterns—all posi- 
tive, all negative, or a mixture of both, 
depending upon the instance-condition to 
which S was assigned. Following these eight 
patterns (called a training series), four test 
patterns were presented (randomly ordered 
and unknown to S, one from each of the four 
truth-table classes) and S had to make a 
category response for each. No feedback 
was given regarding the correctness of these 
responses. Training (8) and test-trial (4) 
series alternated until 12 successive test 
stimuli were identified correctly by S. It was 
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stressed throughout, regardless of the com- 
position of the training series, that both kinds 
of patterns (positive and negative) would be 
presented during the test series. Each train- 
ing series contained more than enough infor- 
mation logically to solve the problem, Pat- 
terns within each series were selected from 
all stimulus classes, limited only by .S's 
instance-condition. 

A special card (in Cond. AI) illustrating 
how the four classes were assigned to two 
categories for the particular rule to be used 
(Rule card) or (in Cond. RL) listing the pair 
of relevant attributes to be used (Attribute 
card) was exposed for S's inspection through- 
out the experiment. Table 1 shows the 
assignment of stimulus classes to response 
categories for each of the rules used here and 
summarizes the information given to Ss in 
Cond. RL. 

For both the training and test series, 
stimulus patterns were prepared as 2 X 2-in. 
slides. They were presented serially, pro- 
jected to the rear of a translucent viewing 
screen. The presentation of stimulus pat- 
terns during the eight training series was 
controlled by E. The S observed the eight 
training stimulus patterns without any type 
of overt performance; E designated each 
stimulus, positive or negative, as appropriate. 
The S was required to respond to each of the 
four test stimuli (after a self-paced inspection 
interval) by pressing one of two buttons 
mounted in a control panel below the screen. 
The two buttons were labeled “Yes” (posi- 
tive) and “No” (negative) to indicate whether 
a pattern should be classified as an example 
or nonexample of the concept, respectively. 
Immediately after each response, the pattern 
was removed (screen blank) and a new pattern 
appeared after 2 sec. The presentation of 
the four trial test series and the recording 
functions were controlled electronically with 


an apparatus described elsewhere (Bourne & 
Haygood, 1959). 


RESULTS 


Overall measures—The mean num- 
bers of test trials and errors on test 
trials for each of the 18 main condi- 
tions of the experiment are reported 
in Table 2. The results of analyses of 
variance on the two measures were 
identical and demonstrated statis- 
tically reliable effects (p < .01) for 
all main variables. The F-ratios for 
trials only are reported here. First, 
AI tasks were more difficult than RL 
tasks, F (1, 180) = 21.15, a result 
which is consistent with Haygood and 
Bourne (1965). Second, rules dif- 
fered in difficulty, F (2, 180) — 15.12, 
with conjunction easiest and condi- 
tional most difficult; the order is the 
same as that reported earlier (Hay- 
good & Bourne, 1965; Hunt, Marin, 
& Stone, 1966). Third, performance 
was affected by the types of instances 
presented during the training series, 
F (2, 180) — 12.68, mixed positive 
and negative instances being associ- 
ated with fewest and all negative with 
most trials to solution. Fourth, a re- 
liable problem difference obtained, F 
(1, 180) — 6.98, showing that the at- 
tributes one and large were more diffi- 
cult than the attributes red and square. 
Interactions between (a) type of task 


TABLE 1 


ASSIGNMENTS OF STIMULUS CLASSES TO RESPONSE CATEGORIES (POSITIVE 
AND NEGATIVE) FoR EACH OF THE RULES 


"Sein ee Se Conjunctive | Disjunctive | Conditional 
RS RS TT + + 
RTr, RH RS TF E " iB 
YS, BS RS FT is Hi in 
YTr, YH, RS FF E us + 
BTr, BH 


Note: The following abbreviations are used: T, true (present); F, false (absent); R, red; Y, yellow; B, blue; 
S, square; Tr, triangle; H, hexagon; +, positive instance; —, negative instance. 
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TABLE 2 


MEAN NUMBER oF TRIALS (T) AND Errors (E) to SOLUTION IN ATTRIBUTE 
IDENTIFICATION (AI) AND RULE LEARNING (RL), Test Trrats ONLY 


Rule 
Conjunction Disjunction Conditional 
AI RL AI RL AI RL 
7, E uy E aU E Im E T E T E 
Positive | 3.92| 1.00| 6.58 | 1.75 | 12.92 | 4.75 | 4.67 -58 | 28.83 | 10.58 | 20.50 | 8.42 
Negative | 46.25 | 15.17 | 3.00 | 1.00 | 19.75 | 7.75 | 15.33 5.83 |16.92| 6.83 | 23.33] 9.83 
Mixture | 6.08| 1.33| .92| .25 |16.50| 4.25 | 2.42 -83 |17.92| 6.25 | 17.08 | 5.33 


and rule, F (2, 180) = 7.06; (b) rule 
and type of training instances, F 
(4, 180) = 7.25; and (c) type of task, 
rule, and type of training instances, 
F (4, 180) = 10.24, were all signifi- 
cant, p < .01. 

The pattern of these results is com- 
plex. The data indicate that, in gen- 
eral, the efficiency and correctness 
with which Ss use information con- 
tained in positive and negative in- 
stances depends on task requirements 
(AI vs. RL) and the relevant rule. In 
AI tasks, performance with positive 
instances only is in one case (conjunc- 
tive) best and in another (conditional) 
worst. The Ss have considerably 
more difficulty with negative instances 
when the concept is conjunctive than 
when it is disjunctive or conditional. 
A mixture of positive and negative in- 
stances, in all cases, falls between the 
all positive and all negative con- 
ditions. The effects of training-in- 
stance condition on performance in RL 
problems are somewhat smaller than 
in the corresponding AI problems. 
The existing differences indicate that 
negative instances are associated with 
better performance on conjunctive 
problems and positive instances with 
better performance on disjunctives 
and conditionals. A mixture of posi- 
tive and negative instances yields the 


fewest mean trials to solution for all 
rules. 

Stimulus classes—Major differ- 
ences in performance were associated 
with the four stimulus classes. These 
differences were functions primarily 
of the rule (which prescribes the 
unique assignment of classes to re- 
sponse categories) and the type of 
training instances. While there were 
gross overall numerical differences, the 
pattern was essentially the same in AI 
and RL tasks. Table 3 shows for 
each instance condition the mean 


TABLE 3 


MEAN NUMBER OF INSTANCES OF EACH 
SriMULUs CLass Prior TO Last 
Error on THAT CLass, 

Test TRIALS ONLY 


Rule 
Stimulus | Conjunction | Disjunction | Conditional 
Class 101 is nction On i01 
AI | RL | at | RL | ar | rv 
Positive 
TT 25 -00 | 1.17 «00 | 2.42 | 2.42 
TF -33 | 1.83 | 1.92 | 1.17 | 6.67 | 1.92 
FT 5 .83 | 2.83 «50 | 4.42 | 5.08 
FF 42 «08 | 2.50 | 1.17 | 5.92 | 5.00 
Negative 
TT 10.08 42 | 3.33 | 1.67 | 3.25 | 2.33 
TF 7.83 442 | 3.25 | 2.92 | 2.25 | 5.50 
FT 7.42 -50 | 4.83 | 3.42 | 3.25 | 3.33 
FF 7.25 +08 | 2.67 | 1.92 | 2:17 | 4:67 
Mixture 
IET. 1.67 -08 | 1.58 -08 | 3.58 | 2.42 
TF 67 -00 | 3.58 -08 | 4.50 | 3.08 
FT 400 .00 | 3.08 58 | 2.92 | 2.50 
FF A7 .25 | 3.08 47 | 2.17 | 3.75 


492 LYLE E. BOURNE, JR. 


number of examples of each class pre- 
sented prior to S's last error on that 
class. 

In general and as might be ex- 
pected, performance is best on those 
instances which are used in the train- 
ing series. But these results also 
clarify some of the peculiar sources 
of problem difficulty. For example, 
when Ss are required to identify the 
two attributes of a conjunctive con- 
cept from negative instances only, 
errors are made persistently on all 
four stimulus classes, and especially 
on TT instances (positive instances), 
ie., those containing both relevant 
attributes. In this type of problem, 
negative instances are difficult to 
make inferences from (or to base hy- 
potheses on) probably because they 
are relatively numerous and hetero- 
geneous. In the stimulus population 
used here, distinctly different nega- 
tive instances outnumber positive in- 
stances 8:1. A similar but not so ob- 
vious pattern emerges from AI prob- 
lems based on other rules: negatives 
are fewer in number for disjunctions 
and conditionals (the ratios being 4:5 
and 2:7, respectively) and perfor- 
mance with all negative instances is 
considerably better in both cases than 
itisforconjunctions. Moreover when 
S is presented with only positive in- 
stances of these rules, performance is 
worse overall and is especially poor on 
test trial negative instances. 

Recall that for conditional prob- 
lems only the TF instances are nega- 
tive; the (usually) symmetric counter- 
part of that class, FT, and FF in- 
stances are positive along with Class 
TT. It has been shown that learn- 
ing the proper response assignments of 
TF and FF instances are the major 
difficulties associated with this rule 
(Bourne & Guy, 1968). Ordinarily, 
Ss learn that FT instances are positive 
relatively early in an RL problem, 
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and then accumulate errors by placing 
TF instances in the same category. 
The results in this experiment help to 
elucidate that effect. When Ss are 
trained on all negative instances (TF), 
errors tend to be made on members 
of the FT class; when Ss are trained 
on positive instances (including FT), 
errors are relatively greater for TF. 
This outcome suggests a strong tend- 
ency for Ss to treat TF and FT in- 
stances as a single stimulus class 
(stimuli with one but not both rele- 
vant attributes) and to place them 
in the same response category. That 
category (positive or negative) is 
determined in the present case by the 
(prior) instances presented during 
training series. Class FF is difficult 
under any condition in this and in 
earlier experiments largely (as Ss al- 
most uniformly report) because of the 
unnaturalness (incompatibility with 
preexperimental experience) of as- 
signing FF and TT instances to the 
same response category. 


DISCUSSION 


Itisa generally accepted principle that 
positive instances somehow provide more 
information or are easier to use than 
negative instances in solving conceptual 
problems, at least for naive Ss. While 
that generalization is useful as a rule of 
thumb, clearly it must be qualified to 
take account of the type of rule and the 
givens and unknowns in the problem. 

The results of this experiment suggest a 
more detailed description of these effects. 
When the task is to identify the (two) 
relevant attributes of a concept (rule 
given), Ss perform best if information is 
presented in the form of instances from 
the smaller and more homogeneous of the 
two conceptual categories, positive and 
negative instances. Which of the two is 
smaller, for any given stimulus popula- 
tion, is determined by the rule—the cate- 
gory of positive instances for conjunctive 
and the category of negative instances 
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for disjunctive and conditional concepts, 
in the present experiment. The fact 
that the smaller category gives S fewer 
attribute combinations and fewer dis- 
tinctly different stimuli to keep track of 
is, in all probability, critical in this 
outcome. 

But category homogeneity obviously 
is not the only factor of importance, for 
Ss perform slightly better with positive 
instances in disjunctive concepts, despite 
the fact that this category contains more 
different instances than the negative 
category. While it is difficult to be 
certain of the additional complications 
represented here, the evidence is con- 
sistent with the conclusions of others 
(a) that Ss find it easier and more natural 
to use a positive as opposed to a negative 
focusing strategy (e.g, Bruner et al., 
1956) or (b) that the necessity to convert 
or transform negative information (what 
the attributes are not) to positive form 
contributes a significant additional com- 
plication to the task (Hovland & Weiss, 
1953). Thus, ina disjunctive problem, 
where the homogeneity ratio (HR, in- 
dexing of the relative frequency of posi- 
tive and negative instances for any 
closed stimulus population) only slightly 
favors the negative instances (HR— 5:4), 
the performance balance is shifted in 
favor of positive instances by focusing 
and/or information-transform (and/or 
some other) factors. This result argues 
that increasing intradimensional varia- 
bility from three to five or seven levels 
would have the effect of making negative 
instances relatively more difficult to work 
with in disjunctions, for this variable 
would change HR from 5:4 to 9:16 or 
13:36, respectively. Reducing the vari- 
ability to two levels per dimension (the 
only possible reduction) would, of course 
have the opposite effect (HR = 3:1). 

In conditional problems, negative in- 
stances lead clearly to superior per- 
formance, an outcome which also is 
consistent with the homogeneity rule; 
HR here was 7:2. The negative cate- 
gory for this rule is nearly as small and 
homogeneous as is the positive cate- 
gory in conjunctions. This relationship 
is probably sufficient to overcome any 
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focusing advantage for positive instances, 
Furthermore, increasing intradimensional 
variability should only accentuate the 
effect; with five levels per dimension, 
HR = 21:4, and with seven levels, 
HR = 43:6. 

The trends are different in RL prob- 
lems. Here, performance improves for 
all rules directly with the variety of 
instances shown during training. Thus, 
a mixture of positive and negative in- 
stances is best in all cases, Whether 
Positives only or negatives only are better 
depends on which contains the greater 
variety of instances, a result which 
clearly contrasts with that obtained in 
AI problems. When S knows the rele- 
vant attributes and must determine the 
relation among them, the greater the 
number of different examples of that re- 
lationship the better; a straightforward 
outcome in retrospect, but one which 
again conflicts with the normal general- 
ization about the relative usefulness of 
positive and negative instances. 

Results of earlier research — (e.g., 
Haygood & Bourne, 1965; Hunt et al., 
1966; Neisser & Weene, 1962) have 
shown, for untrained Ss, a consistent 
ordering in difficulty of the rules studied 
here: conjunctive, disjunctive, condi- 
tional, from easiest to hardest. This 
ordering is known to hold for both 
RL and AI problems, when Ss are shown 
a mixture of positive and negative in- 
stances. These results are replicated in 
the present experiment (under a some- 
what different testing procedure). But 
the present data also show that the order- 
ing can be changed, at least in AI prob- 
lems. When only negative instances are 
available to S, the difficulty order of 
rules is precisely reversed: conditional, 
disjunctive, conjunctive, from easiest to 
hardest. Both the "natural" order (ob- 
tained with mixed positive and negative 
instances), which is replicable with posi- 
tive instances only, and the inverted 
order (obtained with only negative in- 
stances) are consistent with the principle 
of class homogeneity and S's utilization 
of a positive focusing strategy. 

In summary, when the task requires 
detection and identification of the rele- 
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vant attributes among a large number of 
variable stimulus characteristics, best 
performance obtains if the presented 
stimuli are limited either to the positive 
or negative category, whichever con- 
‘tains the smaller variety of instances. 
This is true, even though Ss seem to have 
a tendency (probably acquired through 
preexperimental experience) to attend to 
or focus on instances of the positive 
category. When the attributes are 
known, and their relationship must be 
established, performance is best under a 
training routine which maximizes the 
variety of instances shown. This analy- 
sis has testable implications, notably for 
the effects of intradimensional variability. 
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TACTUAL-KINESTHETIC FEEDBACK FROM MANIPULATION 
OF VISUAL FORMS AND NONDIFFERENTIAL 
REINFORCEMENT IN TRANSFER OF 
PERCEPTUAL LEARNING? 


THOMAS L. BENNETT? ann HENRY C. ELLIS 


University of New Mexico 


48 rats were preexposed to circles and triangles in an otherwise 
visually sparse environment, with opportunity to manipulate the 
forms and nondifferential reinforcement being covaried during pre- 
exposure, Ss allowed to manipulate the forms performed superior to 
those not allowed to manipulate in a subsequent discrimination-learning 
task, when both groups received concurrent nondifferential reinforce- 
ment. Manipulation of the forms in the absence of nondifferential re- 
inforcement produced no effect, Ss receiving nondifferential reinforce- 
ment showed superior performance to those who received experience 
alone. The findings: (a) question the adequacy of an "attention- 


getting" hypothesis to account sol 


ely for differences in transfer effects 


of studies using 2- and 3-dimensional stimuli and (b) require dif- 
ferentiation theory to provide an account of the effects of nondifferen- 


tial reinforcement. 


Gibson and her associates have 
shown that early experience with visual 
forms enabled rats to respond differ- 
entially to the forms more rapidly than 
rats not exposed to the forms (Gib- 
son & Walk, 1956; Gibson, Walk, 
Pick, & Tighe, 1958; Gibson, Walk, & 
Tighe, 1959; Walk, Gibson, Pick, & 
Tighe, 1958, 1959). In one study, 
however, Gibson et al., (1959) found 
that if two-dimensional forms were 
used as the preexposed stimuli, the 
animal’s subsequent performance in 
discrimination ^ learning was not 
enhanced even though the to-be-dis- 
criminated stimuli were themselves 
two-dimensional. In contrast, discrim- 
ination learning was enhanced if the 
preexposure stimuli were three-dimen- 
sional. To account for this difference, 
Gibson et al (1959) postulated an 
"attention-getting" hypothesis, a view 


1 This research was supported by National 
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that emphasizes the process of sorting 
out critical visual properties as cues 
from all the influx of stimulation. 
Only three-dimensional stimuli such as 
cutouts made this sorting possible, they 
hypothesized, because these stimuli 
stand out from their background, 
having depth at the edges which en- 
ables the animal to respond to them 
as articulated signals or cues. An 
alternative interpretation to an atten- 
tional or perceptual account of these 
findings emphasizes the possible role 
of tactual-kinesthetic feedback resulting 
from manipulation of three-dimensional 
forms. Conceivably, the positive trans- 
fer obtained with the three-dimensional 
forms may be due to the opportunity 
for the animals to manipulate the cut- 
outs, a feature not present with two- 
dimensional stimuli. A test of this 
account was made by employing three- 
dimensional forms as stimuli during 
preexposure; however, one group of 
animals was allowed to manipulate the 
forms whereas a second group was pre- 
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vented from manipulating the forms. 
A difference in performance in favor of 
the animals allowed to manipulate the 
forms would be expected if a feedback 
account were reasonable; such a find- 
ing would not lead to a rejection of an 
attentional account but would indicate 
its incompleteness and the necessity to 
specify an additional mechanism, feed- 
back, in order to account for the trans- 
fer effects. 

The results of preexposure studies 
have also been used to support a dif- 
ferentiation theory of perceptual learn- 
ing, a position emphasizing experience 
with the to-be-discriminated stimuli 
as the sufficient condition in certain 
situations for perceptual learning to 
occur. This position as reviewed by 
Tighe and Tighe (1966) contends that 
associative processes, drive reduction, 
and reinforcement are unnecessary in 
accounting for perceptual learning. 
Moreover, it has been argued (Walk 
et al., 1958) that feeding in the pres- 
ence of the visual forms during early 
experience (nondifferential reinforce- 
ment) cannot account for the relative 
ease with which they are subsequently 
discriminated. Therefore, if visual ex- 
perience per se is all that is necessary 
for perceptual learning to occur, then 
animals given only experience with 
visual forms should perform as well 
on-a subsequent discrimination learn- 
ing task as those given experience ac- 
companied by nondifferential reinforce- 
ment. A recent study by Kerpelman 
(1965) has indicated that the superior 
discrimination performance of preex- 
posed animals may be due, at least in 
part, to the effects of exposure accom- 
panied by nondifferential reinforce- 
ment. Nevertheless, his study leaves 
unspecified the role of tactual-kinesthe- 
tic feedback from manipulation of the 
forms, as well as the possible inter- 
action of the effects of feedback from 
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manipulation with those of nondifferen- 
tial reinforcement during preexposure. 
Therefore, an additional purpose of the 
study was to investigate this possible 
interaction. 


METHOD 


Experimental design—The experiment 
was conducted in two stages. During Stage 
1, rats were assigned to one of four condi- 
tions of exposure to three-dimensional circles 
and triangles: (a) In Group M-R the ani- 
mals were allowed to manipulate the forms, 
and the forms were present during feeding 
(nondifferential reinforcement); (b) In 
Group M-NR the animals were allowed to 
manipulate the forms, and the forms were 
absent during feeding; (c) In Group NM-R 
manipulation of the forms was prevented 
by enclosing them in a wire mesh box (the 
forms were still quite visible), and the 
forms were present during feeding; (d) In 
Group NM-NR the animals were simi- 
larly prevented from manipulating the forms, 
and the forms were absent during feeding. 
The Ss were 48 albino rats raised from 
birth under conditions similar to the experi- 
ments of Gibson and her associates. Twelve 
rats were assigned randomly to each of the 
four conditions with the restriction that each 
group contain an equal number of male and 
female Ss. Feeding was allowed for two 
l-hr. periods each day after the animals 
reached age 21 days, a procedure that 
allowed for the control of nondifferential 
reinforcement. 

At the age of 90 days, rats entered Stage 
2 in which they were given 5 days of dis- 
crimination pretraining. ^ Following . pre- 
training the animals of all groups were 
given a circle-triangle discrimination prob- 
lem as a test of the effects of preexposure 
conditions. 

Rearing—The animals were born and 
reared in cages made of j-in. wire mesh 
which were placed in compartments measur- 
ing 15 X 20 X 24 in. The sides, bottoms, and 
tops of the compartments were painted flat 
white, and the back of the compartments 
were flush with the white wall of the room. 
A white oilcloth screen was placed in front 
of the compartments. Water was available 
ad lib; Ss nursed ad lib until the age of 
21 days. Each of the compartments was 
illuminated from 7:00 a.m. until 6:00 P.M. 
Thus, during the rearing period, the visual 
field of the animals consisted of the cage, 
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the compartment walls and ceiling, the 
screen, food tray, water bottle, and cage 
mates. 

At 21 days of age the animals were 
weaned, separated as to sex, marked for 
identification, and assigned to one of four 
experimental conditions. After weaning, the 
animals were fed twice daily with a mesh 
of powdered meal mixed with water. The 
food was placed in trays attached to the cage 
wall for feeding and then removed after 1 
hr. The first feeding was at approximately 
8:00 A.M. and the second at approximately 
8:00 p.m. 

Stimulus placement procedures were intro- 
duced at age 21 days since the ad lib 
nursing allowed up to that time prevented 
control of nondifferential reinforcement and 
exposure to the forms. The stimuli con- 
sisted of two circles, 3 in. in diameter, and 
two equilateral triangles, 34 in. per side, 
which were constructed from % in. thick 
black Plexiglas. For those groups that were 
not allowed to manipulate the shapes 
(Groups NM-R and NM-NR), the forms 
were enclosed in a 3-in. mesh box measuring 
14 X 4X53 in. This allowed the animals 
to view the forms despite the fact that they 
could not be manipulated. For those groups 
that were allowed to manipulate the forms 
(Groups M-R and M-NR), the Plexiglas 
was covered with identical wire mesh, flush 
with its surface, in an effort to equate the 
visual field of all Ss. Each form was sus- 
pended 1 in. in front of each cage wall by 
means of a 2 in. wire attached to the cage 
ceiling. The shapes were placed in posi- 
tion at each feeding with a circle or triangle 
alternately placed at the wall where Ss were 
fed. The placement procedures were con- 
tinued throughout the experiment. 

Since the experimental conditions required 
that the forms be absent from the cages of 
Groups M-NR and NM-NR during the two 
daily 1 hr. feeding periods, it was necessary 
to remove also the forms from the cages of 
the animals in Groups M-R and NM-R 
(groups receiving nondifferential reinforce- 
ment) during two 1 hr. nonfeeding periods 
in order to equate the amount of daily 
exposure to the stimuli for all animals. 
Several studies (e.g, Reid & Finger, 1955) 
have demonstrated that the activity level of 
animals on a deprivation schedule is high- 
est immediately before and during feeding. 
Therefore, to equate the activity level of 
all groups during the time when the stimuli 
were absent, the forms were removed from 
the M-R and NM-R cages for the hour 


immediately preceding feeding. When the 
animals were switched to only one feeding 
period per day at 85 days of age, the same 
general stimulus-placement procedure was 
continued throughout pretraining and dis- 
crimination training. 

Discrimination training.—The testing ap- 
paratus was a Grice discrimination box as 
modified by Gibson and Walk (1956). At 
the dge of 90 days, the animals were given 
5 days of pretraining, identical to the pro- 
cedures outlined by Gibson and Walk. Pre- 
training consisted of training the animals to 
push open the panels in the absence of visual 
forms attached to the panels. 

Discrimination training was begun at 
the age of 95 days following procedures out- 
lined by Gibson and Walk (1956) and by 
Kerpelman (1965). The S could obtain 
a pinch of wet mash from one of the two 
stimulus panels, a black circle or a black 
triangle on a white background, by pushing 
open the correct stimulus door. Half of 
each group was trained to circle (+) and 
half to triangle (+). The incorrect stimu- 
lus panel was locked but baited. If S's 
choice were correct, food was obtained, and 
the next trial began. If the choice were in- . 
correct, S was allowed to proceed into the 
other compartment where it was confronted 
with the stimuli in the same orientation. 
The S was allowed three errors per trial. 
Following the third error, the door in front 
of the correct stimulus opened and the ani- 
mal was allowed to eat. The Ss were run 
to a criterion of 18 out of 20 correct re- 
sponses with the last 10 consecutive re- 
sponses correct, or for a total of 300 trials; 
Ss were given 10 trials/day. 


RESULTS 


Discrimination learning.—The basic 
data obtained in the discrimination- 
learning task are shown in Fig. 1, 
which shows a plot of the percentage of 
correct responses as a function of suc- 
cessive trial blocks for the four condi- 
tions of the experiment. Inspection 
of this figure indicates that the slopes 
of the performance curves were 
roughly equivalent after the animals 
departed from chance level of respond- 
ing. Thus it appears that the preex- 
posure conditions affected the point at 
which the animal began to master the 
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discrimination task, and that once this 
point was reached, the subsequent rate 
at which they mastered the discrimina- 
tion was unaffected. 

Inspection of Fig. 1 further shows 
that Group M-R made more correct 
responses, followed by Group NM-R ; 
Groups M-NR and NM-NR made the 
fewest correct responses and performed 
in a similar fashion. These data indi- 
cate that opportunity to manipulate the 
visual forms contributed to perceptual 
learning provided that Ss received con- 
current nondifferential reinforcement. 
In turn, opportunity to manipulate the 
forms in the absence of nondifferential 
reinforcement produced no additional 
gain in perceptual learning compared 
with Ss given no opportunity to ma- 
nipulate the forms. Furthermore, the 
data indicate that nondifferential rein- 
forcement did contribute to perceptual 
learning in addition to whatever effects 
were produced by experience per se. 
"These interpretations are supported by 
the results of an analysis of variance 
of the mean number of correct re- 
sponses to reach criterion. The analy- 
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sis revealed that nondifferential rein- 
forcement was a significant source of 
variance, F (1, 44) =75.4, p <.01. 
Similarly, opportunity for manipula- 
tion of the forms was also a significant 
source of variance, F (1, 44) = 11.3, 
p<.01. Finally, a significant Rein- 
forcement X Manipulation interaction 
was obtained, F (1, 44) — 104, p< 
01. 

Auxiliary performance measures.— 
In addition to mean trials to criterion, 
median trials to criterion, total errors 
to criterion, and standard deviations 
were obtained. Medians were obtained 
because of the nature of the distribu- 
tion of the criterion scores. Since Ss 
who had not achieved criterion at the 
end of 300 trials were stopped, the 
resulting distribution of scores was 
truncated; under this circumstance, 
medians provide a more representative 
measure, Inspection of Table 1, how- 
ever, indicates that the means and 


medians were highly correlated. In 
addition, inspection of Table 1 reveals 
that error scores are similar in pattern 
to correct responses. 
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Discrimination learning curves as a function of preexposure conditions. 
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TABLE 1 


MEANS, MEDIANS, AND STANDARD DEVIATIONS 
OF THE PERFORMANCE MEASURES 
FOR THE FOUR TREATMENT 
CONDITIONS 


Number of Trials 
to Criterion 


Total Errors to 
E Criterion 
Condition 


NM-R | 196 | 196 | 76 | 189 | 189 | 71 
NM-NR | 262 | 271 | 90 | 254 | 251 | 101 


variance performed on the error scores 
indicated that the effects of nondiffer- 
ential reinforcement were reliable, 
F (1, 44) = 652, p < .01, the effects 
of manipulation were reliable, F (1, 
44) — 7.5, p< O01, and the interac- 
tion was reliable, F (1, 44) — 157, 
p< 0l. 


DISCUSSION 


The principal finding of this study was 
the positive transfer in discrimination 
learning, following preexposure to visual 
forms, as a result of the animal’s oppor- 
tunity to manipulate the forms. This ef- 
fect occurred, however, only with the 
nondifferential reinforcement condition, 
indicating an interaction between the ef- 
fects of tactual-kinesthetic feedback from 
manipulation and nondifferential rein- 
forcement during preexposure on subse- 
quent discrimination learning. This find- 
ing is consistent with the proposed view 
that tactual-kinesthetic feedback from 
manipulation of the forms may provide 
an important source of positive transfer 
to the discrimination task. Moreover, 
this finding questions a simple "attention- 
getting" hypothesis, based on differences 
in depth cues, to account solely for trans- 
fer differences between studies using two- 
and three-dimensional stimuli; rather, 
these data suggest that positive transfer 
effects are obtained with three-dimen- 
sional stimuli because the animal typically 
has opportunity to manipulate such stim- 
uli, providing for an additional source 
of stimulation during preexposure. With- 


out the opportunity to manipulate the 
forms, transfer following preexposure is 
minimal. Although no measure was taken 
of the degree to which the animals 
manipulated the stimuli, inspection of the 
animals during the preexposure period 
clearly indicated that they did manipulate 
the forms. Moreover, the poor. visual 
equipment of the albino rat offers further 
support for this emphasis on the impor- 
tance of contact with and exploration of 
the stimuli. More generally, these data 
are consistent with Epstein's (1967) con- 
clusion that the degree of facilitation re- 
sulting in preexposure studies depends 
upon the extent to which S “attends” to 
the stimuli, if it is allowed that the 
operations associated with "attending" 
may be varied by controlling the oppor- 
tunity for animals to manipulate the pre- 
exposure stimuli. 

A second finding of this study was the 
pronounced positive transfer in discrimi- 
nation learning following exposure to vis- 
ual forms when feeding was correlated 
with the presence of the visual forms. The 
data clearly show that exposure to the 
forms accompanied by nondifferential re- 
inforcement contributes to perceptual 
learning in addition to whatever effects 
are produced by simple exposure to the 
forms. This finding is in agreement with 
the results of Kerpelman (1965), and sup- 
ports his contention that differentiation 
theory seems incomplete as an account of 
the positive transfer typically found in 
preexposure studies. This argument 
holds, of course, only if it is agreed that 
differentiation theory implies no addi- 
tional transfer effect due to nondifferen- 
tial reinforcement, an interpretation made 
by Kerpelman (1965) and seemingly im- 
plied by Tighe and Tighe (1966) and by 
Walk et al., (1958). It should be noted 
that no animals were run under a non- 
exposure condition as such; however, 
Kerpelman’s data indicated that exposure 
to the forms in the absence of nondifferen- 
tial reinforcement produced relatively lit- 
tle facilitation compared with a nonex- 
posure control group. A principal prob- 
lem remaining for differentiation theory is 
that of indicating how it can account for 
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the positive transfer effects of nondiffer- 
ential reinforcement. 
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ERRATA 


In the article by Norman S. Braveman and Gloria J. Fischer in the April 
issue, Footnote 3 at the end of column one on page 676 indicates a formula 
for standard deviation which should read: SD — V nz(1— -)/n. 

In the article by Daniel Fallon in the April issue, the first sentence in 
the first paragraph of the first column on page 551 should read: “. . . occur 
upon initiation of the consummatory response, i.e., the first lick on a dipper cup 
was followed, after a CS-UCS interval of 450 msec., by electric shock.” Also, 
the first sentence in the first paragraph of the second column on page 551 should 
read: *. . . tingent upon initiation of the consummatory response, i.e., the first 
lick on a full dipper cup was followed 450 msec. later by electric shock. . . ." 
Finally, the second half of the third sentence in the second paragraph of the 
first column on page 553 should read: “. . . i.e., 450 msec., after the first lick 
on any full cup punishment was presented.” 
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tion continuum a particular sensory observation 


According to signal detection theory 
(SDT) if the sensation magnitude of an 
observation exceeds S's criterion, he reports 
“signal present,” but if the sensation magni- 
tude is less than the criterion, he attributes 
the observation to noise and reports “signal 
absent.” Variability in the magnitude of this 
background noise (N), which may originate 
from either an internal or an external source, 
is hypothesized to form a normal distribu- 
tion on the sensation continuum. The re- 
peated application of a signal against the 
noise background is represented by a signal- 
plus-noise (SN) distribution, also normal in 
form, located higher on the sensation con- 
tinuum. In the detection situation S's task 
is to decide whether a particular observation 
originated from the N or the SN distribu- 
tion. The location on the sensation con- 
tinuum of the criterion by which § makes 
this decision is determined by the prevailing 
conditions of the detection situation such as 
the costs and values associated with the 
various decision outcomes, probability of 
signal occurrence, and signal intensity. 

One of the implications of SDT is that de- 
cisions should become increasingly difficult 
the closer on the sensory continuum an ob- 
servation is to the criterion. As a conse- 
quence, decision times should increase system- 
atically as the magnitude of an observa- 
tion approaches the criterion. Consistent 
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with this notion is the finding that disjunc- 
tive reaction time (RT) increases as the 
psychological distance between stimuli de- 
creases (Festinger, 1943; Kellogg, 1931). 
It was therefore expected that the manipula- 
tion of variables known to influence S's cri- 
terion would have a corresponding influence 
upon RT in the detection situation. The 
purpose of the experiment to be reported was 
to test this prediction by examining the 
effects of signal intensity and probability of 
signal occurrence, p(S), on RT in a detec- 
tion task. 

In SDT it is assumed that an increase in 
signal intensity will decrease the degree of 
overlap between the N and SN distributions 
by relocating the SN distribution at a posi- 
tion higher on the sensation continuum, 
Since S generally sets his signal criterion 
at some point between the means of these 
overlapping distributions, the net effect of 
increasing their separation is to reduce the 
number of both N and SN observations orig- 
inating from points on the sensory continuum 
near the location of S's signal criterion, 
where decisions are difficult, i.e., observations 
originating from the two distributions be- 
come more discriminable. Thus, RT is ex- 
pected to show a general decrease with in- 
creases in signal intensity. With increasing 
signal intensity the RTs for observations 
originating from the SN distribution should 
decrease as a result of decreases in neural 
transmission time as well as increases in 
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criminability between the N and SN dis- 
tributions. However, RTs for observations 
` originating from the N distribution, where 
neural transmission time is constant for all 
levels of stimulus intensity, were expected to 
decrease with increasing signal intensity 
solely as a result of increases in discrimi- 
nability between the N and SN distributions. 
Thus, decreases in RT with increases in 
signal intensity were expected to be greater 
for decisions based upon SN observations 
than for decisions based upon N observations. 
Manipulating p(S) was also expected to 
influence RT in a detection task. According 
to SDT the higher the p(S), the lower S 
will set his criterion on the sensation con- 
tinuum. Therefore in both the N and SN 
situations when p(S) is high (eg, .80) 
and S's criterion is low, he should on the 
average make more rapid "signal present" 
decisions than when p(S) is low (e.g. .20) 
and his criterion is high. As the criterion 
is lowered larger portions of the N and SN 
distributions are located substantially above 
the criterion. However, when the criterion 
is lowered by increasing p(S), smaller por- 
tions of the N and SN distributions are 
located substantially below the criterion 
which should cause the RTs for signal ab- 
sent decisions to increase as a function of 
p(S) for both the N and SN situations. 
Method.—The Ss were three male students 
enrolled in introductory psychology classes 
at Hamilton College. The stimuli were 200- 
eps sinusoidal vibrations applied for 1 sec. 
to S's index fingertip through a vibrator 
(Goodmans V-47). The intensity of the 
signal was regulated by a decade-attenuator 
box which followed an oscillator in the cir- 
cuit. 
With his right index fingertip resting on 
a vibrator contact + in. in diameter S sat 
with his left hand in contact with a bidirec- 
tional telephone switch that could be closed 
to the left or right from its open position. 
A l-sec. white light served as a ready sig- 
nal and was followed 1-2 sec. later by a 
l-sec. observation interval which coincided 
with a second 1-sec. presentation of the same 
light. The S's task was to judge the pres- 
ence or absence of the tactile stimulus as 
quickly as possible following the second light 
onset. The S made his judgment by closing 
the telephone switch to the right to indi- 
cate signal present and to the left to indi- 
cate signal absent. The closing of the 
switch in either direction stopped a timer 
(Standard Electric) activated automatically 
at the onset of the observation interval. Re- 
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action times were recorded from the timer 
to the nearest 5 msec. Knowledge of results 
was given to S after each judgment. 

The experimental design consisted basi- 
cally of a 3 X 9 factorial combination of three 
values of p(S) (20, .50, and .80) and nine 
levels of signal intensity (—10, —8, —5, —3, 
0, 3, 5, 8, and 10 db. relative to each S's 
absolute threshold for vibrotactile stimulation 
determined in advance by the method of 
limits). 

Each session was divided by short rest 
periods into three blocks of 30 judgments 
each, and a different signal intensity was 
employed in each block. Hence, 30 judg- 
ments for each of the nine signal intensities 
were obtained in Sessions 1-3, during which 
the p(S) was always the same—either 
20, .50, or .80. The nine signal intensities 
were presented in a similar fashion at a 
second p(S) in Sessions 4-6, and again at 
the third p(S) in Sessions 7-9. The entire 
procedure was replicated within Ss over 
Sessions 10-18 and again over Sessions 19- 
27. Both the order in which the three sig- 
nal probabilities were employed over the 
three three-session blocks within a replica- 
tion and the order in which the nine signal- 
intensity values were presented over a block 
of three sessions were random for each S in 
each replication. 

Results and discussion—In order to deter- 
mine the applicability of SDT to a situation 
involving the detection of vibrotactile signals, 
receiver-operating characteristic (ROC) 
curves were examined by plotting the pro- 
portion of SN trials producing signal pres- 
ent decisions (y|SN) against the proportion 
of N trials producing signal present decisions 
(y[N). The prediction from SDT of linear 
ROC curves plotted on normal probability 
coordinates was confirmed for five of seven 
signal intensities ranging from —8 through 
8 db. (Fig. 1). The straight lines fitted 
to the 3 db. and 5 db. data are only sug- 
gestive and were drawn to be consistent with 
the slopes of the straight lines for the other 
five intensity levels. Data for the —10 db, 
and 10 db. intensities are not considered 
since in both cases Ss failed to make suf- 
ficient numbers of judgments to yield reliable 
estimates of y|SN and y|N. Consistent 
with SDT, the three successive points along 
each ROC curve were obtained from p(S) 
conditions where p(S) was equal to 20, 
.50, and .80, respectively. 

Figure 2 shows the relationship between 
mean RT and p(S) for each of seven signal- 
intensity values when S’s decisions were cor- 
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rect. The curves constructed by solid lines 
and filled points are based on S's signal pres- 
ent decisions when the signal was presented, 
and the dashed-line unfilled-points curves are 
based on S’s signal absent decisions when 
the signal was not presented. Since fewer 
than 30 correct decisions of signal present 
were obtained under p(S) of .20 for —8, 
—5, —3, and 0 db. signals, reliable estimates 
of RT were not available, Consistent with 
the findings of a vast number of experiments 
(see Woodworth & Schlosberg, 1954), RT 
decreased as signal intensity increased. Of 
greater interest is the finding that RT for 
signal absent decisions on trials where the 
signal was absent also decreased progres- 
sively as a function of signal intensity. Also 
as predicted, the decrease in RT with in- 
creasing signal intensity was greater for 
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TABLE 1 $t 


MEAN RT For “SIGNAL ABSENT” DECISIONS WHEN ' 
SIGNALS oF Four INTENSITIES WERE PRESENTED . 
AT THREE SIGNAL PROBABILITIES * 


Stimulus Intensity 


p(S) 

-8 -5 -3 0 
.20 A48 448 414 461 
+50 ‘476 1464 1484 471 
+80 4492 4485 +490 1492 


correct decisions of signal present than for 
correct decisions of signal absent, presum- 
ably because with increasing signal intensity 
RTs for correct signal present decisions are 
reduced by decreasing neural transmission 
time and by increasing the discriminability 
between N and SN distributions, whereas 
RTs for correct signal absent decisions are 
affected by only the second of these factors. 

Figure 2 shows that RT for correct signal 
present decisions decreased with increasing 
p(S) while RT for correct signal absent 
decisions increased with increasing p(S). 
These relations were found to exist at all 
levels of signal intensity with the single ex- 
ception of the —3 db. intensity, where the 
increase in signal probability from .50 to .80 
led to decreasing rather than increasing RT 
for correct signal absent decisions. The 
present results are in agreement with those 
Of Carterette, Friedman, and  Cosmides 
(1965) who, employing acoustic signals of 
two intensity values, found RT for signal 
present decisions to increase, and RT for 
signal absent decisions to decrease, with in- 
creasing signal probability. The form of the 
relationships between RT and p(S) found 
in this and in the present experiment. sup- 
ports the assumptions that (a) increasing 
P(S) lowers S's criterion, and (b) the closer 
a particular sensory observation is to the 
criterion, the longer S’s decision time will be. 

Mean RTs based on 30 or more judgments 
for incorrect signal absent decisions are pre- 
sented in Table 1. Consistent with the hy- 
pothesis that the closer a sensory observation 
is to S's criterion the longer his decision 
time will be, these RTs were found to in- 
crease with increases in p(S), Presumably 
increasing p(S) reduces the Proportion of 
SN observations substantially below criterion 
and thus increases mean RTs for reporting 
the absence of a signal when one has been 
presented. It was also predicted that RT 
for incorrect signal present decisions would 
decrease with increases in p( S). However, 
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a sufficient amount of data to test this hy- 
pothesis was not obtained in the present 
study. 
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This experiment reexamined Neisser’s finding that a previous (pas- 
sive) exposure had a facilitative effect on set words but not on 


their homonyms. 


In the present experiment, 2 set conditions were 


used, 1 active and 1 passive. The tachistoscopic thresholds obtained 
following the set-induction phase found set words to differ signif- 
icantly from homonyms and from controls, and homonyms to have sig- 
nificantly lower thresholds than did controls. The data are interpreted 
as indicating that response tendencies as well as perceptual sensitivity 
have been influenced by the induction of a perceptual set. 


Among the experimental examinations of 
the frequency effect in tachistoscopic recog- 
nition, most have presented data indicating 
that the superior recognition of high-fre- 
quency words is at least partially due to 
their increased availability as responses, in 
addition to, or instead of, the greater ease 
of perceiving such words. This has been 
indicated by the results of such diverse 
studies as those of Goldiamond and Hawkins 
(1958), Newbigging (1961), Smock and 
Kanfer (1961), Spence (1963), and Win- 
nick and Nachbar (1967, among others. 

An exception is a study by Neisser (1954) 
concerned with the role of preparatory sets. 
By allowing Ss 1 min. to inspect a list of 
words prior to tachistoscopic threshold deter- 
mination, Neisser induced a preparatory set 
for these words. Subsequent comparison of 
word-recognition thresholds for these set 
words, homonyms of the set words, and con- 
trol words bears on the relative contribution 
of the “frequency of saying” vs. “the ease 
of seeing” explanations for the effects of 
word frequency. Since the same gross- 
naming response would be made by Ss to 
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both members of any given pair (set words 
and their homonyms), Neisser reasoned that 
“recognition of the second member will 
be facilitated if the set has operated directly 
on response probabilities, but will not be fa- 
cilitated if the set affects perceptual processes 
[Neisser, 1954, p. 399].” Finding the data 
to show no facilitation of recognition of the 
homonyms by acquaintance with set words, 
he concluded (and based this conclusion on 
acceptance of the null hypothesis about con- 
trol-homonym differences) that the effect of 
the preparatory set was only to facilitate the 
perceptual recognition of the presented ma- 
terial. 

Three considerations dictated the present 
reexamination of the effect reported by 
Neisser (1954). First, a study by Rouse 
and Verinis (1962) has found lower thresh- 
olds for words that are associates of pre- 
viously seen words than for control words. 
That associates, which are both spelled dif- 
ferently and pronounced differently from 
previously seen words, show influence of 
such a set suggests that a similar influence 
should be at work for homonyms, which are 
pronounced (and sometimes spelled) simi- 
larly to set words. Second is a recent 
analysis by Broadbent (1967) of the word- 
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frequency effect, which points out the strong 
influence of response effects upon word 
recognition. 

The third consideration stemmed directly 
from an examination of Neisser's (1954) 
study. It was reasoned that his failure to 
find the same lowered thresholds for ho- 
monyms as for set words may have been re- 
lated to the passive set induced in his Ss, 
which may not have optimally increased the 
availability of responses, The present study 
therefore compared thresholds for set words, 
homonyms of these words, and control words 
in two groups; one group had a passive set 
similar to Neisser's, and the other had a 
more active set induced by overtly pronounc- 
ing the set words, It was predicted that, 
although with a passive set the results might 
replicate Neisser's, an active set would show 
the lowered thresholds for homonyms as 
well as for set words to be expected from 
the response availability hypothesis. 

Method.—Forty-eight students from the 
Queens College introductory psychology 
course served as Ss. They were randomly 
assigned to the two experimental groups, in- 
volving active and passive sets, and within 
these groups to six subgroups, devised to 
counterbalance materials across conditions. 

The experimental materials were the six 
5-word lists used by Neisser (1954, p. 400). 
For each of the various subgroups three of 
these lists were used, one to provide the set 
words, the other two, the homonyms and 
control words. For presentation of words 
in the Gerbrands tachistoscope, words were 
typed with pica type in capital letters cen- 
tered on 83 X 11-in. white presentation cards. 
The 10 words for the set-induction phase 
were typed individually in capital letters in 
the center of 3 X5-in, white index cards. 
A Kodak laboratory timer was used to time 
the presentation. 

During the first (set-induction) phase of 
the experiment the cards comprising the 10 
preliminary words were individually pre- 
sented to each S with an exposure duration 
of 3 sec. and an interval between cards of 
approximately 5 sec. Only one presentation 
trial was used, and word order was randomly 
varied across Ss. The Ss in the passive 
group were told to concentrate on each word 
as it was held in front of them, while Ss 
in the active group were told, in addition, 
to repeat the words aloud. 

For the tachistoscopic threshold determina- 
tions the lights were dimmed immediately 
following the first phase and S was seated 
in front of the eyepiece of the tachistoscope. 


TABLE 1 


MEAN THRESHOLD VALUES (IN HUNDREDTHS OF 
SECONDS) FOR THE THREE KINDS OF WORDS 
AND THE Two SET CONDITIONS 


Active Passive | Word Type 
Set Set Means 
Set Words 21.46 23.12 22.19 
Homonyms 26.46 29.32 27.92 
Control Words 27.71 35.00 31.54 


The words, presented in random order, were 
five from the set list, five homonyms of the 
remaining words on the set list, and control 
words not seen before. Instructions were 
read by E which were modified from those 
of Pierce (1963) and encouraged Ss to 
guess. Threshold determinations were car- 
ried out in accordance with the modified 
method of limits, each trial beginning at 
10 msec. until S had twice reported the word 
correctly. 

Results.—Thresholds were obtained from 
each S for 15 words, 5 each from the three 
word types. Median threshold values were 
used as the individual scores for each of 
word classes. Means of these values for the 
two groups are shown in Table 1, where it 
can be seen that group differences are small, 
while both set and homonym word classes 
have lower thresholds than do the control 
words. 

To evaluate the obtained differences, a 
mixed analysis of variance was undertaken, 
which had set and list combination as be- 
tween-Ss effects and word type as the sole 
within-Ss effect. Of these main effects, only 
the effect of word type was found to be sig- 
nificant: F (2, 72) —2040, p<.01. A fol- 
low-up Newman-Keuls test found all of the 
word-type differences to be significant; set 
words differed from homonyms and from 
control words at »<.01, while homonyms 
differed from controls at p < .05.2 

Discussion.—This replication of Neisser's 
(1954) study predicted that overt pronuncia- 
tion of the set words might reveal a reduc- 
tion in thresholds for homonyms of these 
words that had not been observed from the 
passive set induced in his study. While this 
prediction of a greater effect with the active 
set was not confirmed, the lowered thresholds 
for homonyms (compared to control words) 
found with both active and passive sets lends 
some support to the view that response 
tendencies, as well as the perceptual sensi- 
tization, play some role in the development 
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of a set in the tachistosocopic recognition 
situation. That homonyms had higher thresh- 
olds than did the set words is no doubt 
the effect of past perceptions of these words, 
which had clear-cut differences from the 
originals. 
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SPATIAL ENCODING STRATEGIES IN SEQUENTIAL 
SHORT-TERM MEMORY 


Ricuarp A, Monty * 
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Keeping-track performance was examined as a function of cues de- 
signed to control the spatial encoding strategies adopted by Ss. 
Spatial organization of the material to be remembered significantly 
affected short-term retention and interacted with the characteristics 
of the type of stimulus materials employed, demonstrating that visual 
encoding can play an important role in short-term memory. 


One model of keeping-track performance 
postulates that when Ss are required to 
tally mentally the number of occurrences of 
each of four stimuli presented sequentially, 
they do not rehearse the names of the stimuli 
to be remembered. Instead, they attempt to 
order the stimuli spatially and rehearse the 
quantity of information assigned to each 
spatial location (Monty, Taub, & Laughery, 
1965). Monty, Fisher, and Karsh (1967) 
found that when the stimuli possessed a nat- 
ural or inherent order (such as consecutive 
letters of the alphabet or consecutive num- 
bers), performance was better than when 
the stimuli possessed no such order (such as 
geometric symbols). Presumably, the nat- 
ural order reduced task difficulty by enabling 
Ss to more readily assign each successive 
stimulus to its appropriate spatial location. 

The purpose of the present experiment 
was to examine keeping-track performance 
as a function of cues designed to control the 

1The author gratefull vue the assistan 
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spatial encoding strategies adopted by Ss. 
If in contrast to models such as those of 
Conrad (1964) and Sperling (1963) which 
stress acoustical factors, visual encoding 
strategies are a crucial determinant of per- 
formance, then a presentation method which 
displays the various stimuli in a spatial array 
consistent with that that Ss ordinarily im- 
pose on the material should enhance per- 
formance. 

Method.—The Ss were 114 military en- 
listed men with minimum General-Technical 
(GT) subtest scores of 100 on the AGCT. 

The apparatus and procedures have been 
described more fully elsewhere (Monty, 
Taub, & Laughery, 1965). Stimuli were 
projected one at a time onto the surface of 
rear projection display units. The S's task 
was to observe and tally mentally the num- 
ber of occurrences of each of four different 
stimuli (categories) presented sequentially 
and to record the tally in writing at the end 
of each sequence. 

The Ss were randomly assigned to six 
groups of 19 each, which differed in terms 
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of the stimulus class viewed (letters or sym- 
bols) and the method of presentation. Three 
groups viewed the letters Q, R, S, and T 
and three viewed the symbols "plus sign," 
“triangle,” “square,” and “heart.” Two 
groups with each stimulus class viewed the 
stimuli on a single display (Single condi- 
tion). Two groups viewed them on four 
adjacent displays, with each category always 
appearing on the same display (Ordered 
condition). Letters were presented in the 
order (from left to right) Q, R, S, and T; 
symbols, in the order “plus sign,” “triangle,” 
“square,” and “heart.” The remaining two 
groups also viewed the stimuli on four ad- 
jacent displays, with each category appearing 
on all displays (Scrambled condition). The 
position in which each successive stimulus 
appeared was entirely random. In all cases 
the actual sequence of categories presented 
to each group was identical, and the rate 
of presentation was held constant at one 
stimulus every 2.3 sec. with an .1-sec. stim- 
ulus on-time and 22-sec. interstimulus off- 
time. 

Trial length, defined as the total number 
of stimuli presented in a sequence, was in- 
cluded as a within-Ss variable. All Ss were 
presented with trial lengths of 8, 12, 16, 
and 20 stimuli. Each of the four trial 
lengths was presented twice in random order 
during each of four blocks of eight trials. 
A 2,500 hz. tone signalled the beginning and 
end of each trial. 

Results and discussion—The sum of the 
absolute differences between the number of 
stimuli presented in each category and the 
number reported in each category was used 
as the measure of performance. The abso- 
lute error scores for the two equal trial 
lengths within each block were summed and 
subjected to an analysis of variance. The 
data underlying the significant main effects 
indicated that, in general, performance 
varied inversely with trial length, F (3, 
324) = 194.04 (p<.001), directly with the 
amount of practice or blocks, F (3, 324) = 
17.23 (p «.001), and as a function of pre- 
sentation method, F (2, 108) —25.66 (p< 
.001). The Scrambled condition and the 
Single condition led to approximately the 
same level of performance, while perform- 
ance with the Ordered condition was some- 
what superior (mean errors per trial of 2.91, 
2.88, and 1.28, respectively). Thus, it ap- 
pears that keeping-track performance can be 
facilitated by imposing spatial encoding cues 
on the materials to be remembered. Further, 
the Trial Length X Presentation Method in- 
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TRIAL LENGTH 


Fic. 1. Mean errors per trial as a WM of pre- 
sentation method and trial length. 


teraction, F (6, 324) — 2.88 (p<.01) shown - 
in Fig. 1, indicates that the greater the 
task difficulty (trial length), the greater the 
reduction in error, 

Data underlying the Trial Length X 
Blocks interaction, F (9, 972) =7.43, (p< 
001), indicates that the longer the trial 
length, the greater the improvement across 
blocks. 


STIMULUS CLASS 


Fic, 2. Mean errors trial as a function of 
sentation method od and stimulus class, AN en 
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The Presentation Method X Stimulus 
Class interaction, F (2, 108) 2413 (p< 
.05), is shown in Fig. 2. To further de- 
lineate the contribution of differences be- 
tween the Scrambled and Single conditions 
to this interaction, the analysis of variance 
was repeated on these groups only. Of 
greatest interest was the failure of the Pre- 
sentation Method X Stimulus Class inter- 
action to reach significance, F (1, 72) — 
.92 (92.05), coupled with the significant 
main effect for stimulus class, F (1, 72) — 
5.48 (p<.05), indicating that performance 
with letters was superior to performance 
with symbols, irrespective of which of the 
two presentation methods was used. Thus, 
the Presentation Method X Stimulus Class 
interaction noted in the first analysis can be 
accounted for primarily on the basis of dif- 
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ferences between the ordered vs. unordered 
conditions as a function of stimulus class. 

In summary, these data would seem to 
indicate that in contrast to the models of 
Conrad (1964) and Sperling (1963), visual 
encoding can play an important role in 
short-term memory, providing direct sup- 
port for the keeping-track model proposed 
by Monty et al. (1965, 1967). 
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ANAGRAM SOLUTION TIMES AS A FUNCTION OF INDIVIDUAL 
DIFFERENCES IN STORED DIGRAM FREQUENCIES * 


RICHARD HARRIS anp HENRY LOESS 
College of Wooster 


40 Ss were assigned to 2 groups on the basis of scores on a skeleton- 
word test measuring individual stored digram frequencies. Later all 
Ss were asked to solve 15 anagrams, 5 each with high, medium, and 
low solution-word digram frequencies. Results confirmed findings of a 
previous study (Tresselt & Mayzner, 1965) that high solution-word, 
digram-frequency anagrams are solved significantly faster than low 
solution-word, digram-frequency anagrams. However, the significant 
difference previously obtained between Ss with high-vs. Ss with low- 
stored digram frequencies was not replicated. The discrepancy appears 
related to a difference in the solution times of high-stored Ss in the 


2 experiments. 


Stimulus-response mediational interpreta- 
tions of anagram problem solving typically 
assume, among other things, that when pre- 
sented with an anagram S makes a series of 
implicit responses that involve successive re- 
arrangements of the letters into new combi- 
nations. Furthermore, letter arrangements 
(e.g, digrams) with high frequencies of 
occurrence in language are assumed to occur 
higher in S’s response hierarchy than letter 
Bude eeu 
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arrangements of low frequencies of occur- 
rence; and, therefore, the former should 
occur sooner and more frequently than the 
latter among S’s implicit rearrangements of 
the letters in the anagram. Two studies 
(Mayzner & Tresselt, 1963, 1966) have veri- 
fied that anagrams with high solution-word 
digram frequencies are solved significantly 
faster than anagrams with low solution- 
word digram frequencies, and these results 
have been interpreted as providing indirect 
support for an S-R mediational hypothesis. 
However, only one study (Tresselt & Mayz- 
ner, 1965) has attempted to measure letter 
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frequency storage in individual Ss and to 
relate differences in storage ability to ana- 
gram solution time. The results indicate 
that Ss with high-stored, digram-frequency 
scores (according to a specially devised 
test) solve anagrams significantly faster than 
Ss with low-stored, digram-frequency scores. 
In view of the relative importance of the 
results for an S-R mediational interpreta- 
tion of anagram problem-solving, the present 
experiment was designed as a replication of 
that study. 

Method.—The study was designed as an 
exact replication of the original The Ss 
were 40 introductory psychology students at 
the College of Wooster. None had pre- 
viously served in an anagram study. The 
stimuli consisted of 15 five-letter anagrams 
typed in capital letters on white 3 X 5-in. 
cards. These had been chosen by Tresselt 
and Mayzner to control anagram letter order, 
solution-word frequency, and anagram di- 
gram frequency. In addition, solution-word 
digram frequencies were varied so that the 
solution words of 5 anagrams had low sum- 
med-sequential, digram-frequency values, 5 
had medium summed values, and 5 had high 
summed values. The purpose was to deter- 
mine if the major variable, individual dif- 
ferences in stored digram frequencies, might 
interact with solution-word digram fre- 
quencies. 

Individual differences in stored digram- 
frequency values were assessed by a skeleton- 
word procedure developed previously (Tres- 
selt & Mayzner, 1965). Sixty-one Ss were 
given the 60-item skeleton-word test. The 
number of correct items ranged 10-28 (com- 
pared to 9-27 in the original study). From 
this distribution, 20 Ss with scores ranging 
10-17 correct and 20 Ss with scores ranging 
20-28 correct were selected as representing 
a low-stored, digram-frequency group and 
a high-stored, digram-frequency group, re- 
spectively. In the original study, the ranges 
for these two groups were 9-18 and 19-27. 

Several weeks after the skeleton-word 
test, the 40 Ss returned individually to solve 
the 15 anagrams. Each S received the ana- 
grams in a different random order and was 
allowed 240 sec. to reach a solution. If no 
solution occurred within 240 sec., S was told 
the solution and then given the next ana- 
gram. Solutions were given verbally by 
S and solution times were determined with 
a Hunter “KlockKounter.” 

Results and discussion —As in the original, 
scores were obtained for each S at each 
level of solution-word digram frequency 
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(high, medium, and low) by determining 
the median solution time for the five ana- 
grams at a given level. Table 1 presents 
the means and standard deviations for these 
values as obtained in the present and in the 
original study. The table shows that for 
the two studies the distributions of scores 
are highly similar for Ss with low-stored 
digram frequencies but not for Ss with high- 
stored digram frequencies. On the other 
hand, both studies show the expected inverse 
relationship between solution-word digram 
frequency and mean score for solution time. 
These observations are supported by appli- 
cation of a 2 X 3 mixed analysis of variance 
design to the results shown in Table 1. 
In the present study the stored digram-fre- 
quency main effect was insignificant, F (1, 
38) —.866, whereas in the Tresselt and 
Mayzner (1965) study the effect was sig- 
nificant, F (1, 38) =21.90, p<.001. In 
both studies, the solution-word, digram-fre- 
quency main effect was significant, F (2, 
76) = 5.97, p < .01 in the present study, and 
F (2, 76) =7.30, p <.005 in the original. 
In both studies the interaction term was less 
than one. 

The discrepancy between the two studies 
is further illustrated by an analysis of solu- 
tion times for high- and low-stored, digram- 
frequency groups.  Tresselt and Mayzner 
found that for every one of the 15 anagrams 
the median solution time was less for the 
high-stored, digram-frequency group than 
for the low-stored, digram-frequency group. 
However, the same was true for only 9 of 
15 anagrams in the present study. Applica- 
tion of a sign test showed this difference 
between high- v. low-stored groups in the 
present experiment not to be significant, 
2=1.11, p 7.05. 

In their study, Tresselt and Mayzner pro- 
vided median solution times for all 15 ana- 
grams, making possible a direct comparison 
between the two studies. For the high- 
stored, digram-frequency groups a sign test 
revealed significantly greater solution time 
in the present study than in the original, 
2=3.09, p<.01. However, for the two 
low-stored, digram-frequency groups solution 
time did not differ, z = 28, p <.05. 

Reasons for the significantly greater solu- 
tion time for the high-stored digram-fre- 
quency group in the present study or for the 
failure to obtain a significant stored digram- 
frequency main effect are not clear. The 
fact that solution times were higher in the 
present study for the high-stored group but 
not for the low-stored group tends to ex- 
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TABLE 1 
ANAGRAM SoLUTION TIMES (IN SEC.) FOR SoLUTION-WORD AND 
STORED DiGRAM-FREQUENCY GROUPS 
Solution-Word Digram Frequencies 
Stored 
Digram Present Study Tresselt & Mayzner (1965) 
Frequencies 
High Medium Low High Medium Low 
igh 
High 63.90 89.35 110.21 25.20 36.80 80.15 
SD 70.96 80.90 82.81 33.25 22.52 68.55 
L 
^M 1 85.31 95.90 139.10 85.25 123.30 143.90 
SD 83.28 89.88 72.70 85.28 89.39 86,99 


clude the likelihood of systematic bias intro- 
duced by poor instructions or experimental 
conditions, which should have made both 
groups higher or both groups lower in the 
present study. 

As nearly as could be ascertained, the dis- 
tributions of scores on the skeleton-word 
test were nearly the same in the two studies, 
Furthermore, the stored digram-frequency 
difference, if anything, should have been en- 
hanced in the present study because the 
middle scores of 18 and 19 were not used, 
whereas in the original study scores of 18 
were assigned to the low-stored, digram- 
frequency group and scores of 19 were 
assigned to the high-stored, digram-fre- 
quency group. 

The original directions for the skeleton- 
word test were not available at the time of 
the study, However, the test was taken di- 
rectly from Tresselt and Mayzner (1965) 
and every attempt was made to administer 
and to score the test as correctly as possible. 
Likewise, the original instructions to S' re- 
garding anagram solving were not available. 
However, the authors learned subsequently 
from Tresselt and Mayzner that they in- 
structed their Ss to avoid solutions which 
involved plurals, foreign words, past tenses, 
and derived words beginning or ending in 
er, such as retake and shoer, Such was not 
done in the present study. As a result, sev- 
eral Ss gave alternate solutions to several 
of the anagrams although all 15 had been 
selected as having only one solution. Spe- 
cifically, the anagram “AEBRL” (BLARE) 
was solved by some Ss as ABLER. Like- 
wise, “RBSCU” (SCRUB) was solved as 
CURBS, and “EODNW” (ENDOW) as 
OWNED. None of the above alternates 
was accepted as a correct solution, and this 


might have accounted for the higher solu- 
tion times of some Ss in the present study. 
The Ss sometimes reported that once they 
hit upon one apparent solution it interfered 
with a further search for the correct solu- 
tion. These subjective reports are consistent 
with studies that have shown word anagrams 
to be more difficult to solve than scrambled- 
letter anagrams (Beilin & Horn, 1962; 
Ekstrand & Dominowski, 1965). However, 
no tally was kept in the present experiment 
to determine whether the “alternate solu- 
tion” Ss tended to be in the high-stored or 
in the low-stored digram-frequency group. 

Finally, examination of the median solu- 
tion times for anagrams in each solution- 
word, digram-frequency category (high, 
medium, and low) revealed extreme dif- 
ferences in solution times within each cate- 
gory. This was true in both studies. While 
considerable variation is to be expected, in- 
spection of the anagrams having high and 
low median solution times suggests that an 
additional variable, referred to by Ronning 
(1965) as the ruleout factor, might warrant 
closer attention in future anagram problem- 
solving research. For example, even though 
the anagrams AEBRL and IFNLG were 
approximately equal in anagram letter order, 
solution-word frequency, and in anagram 
digram frequency, the median solution time 
for IFNLG was 11.1 sec. in the present 
study (4 sec. in the original), and the 
median solution time for AEBRL was 
240 sec. (192 sec. in the original). Ac- 
cording to the ruleout principle, AEBRL 
was open to many consonant-vowel-conson- 
ant-vowel-consonant and diphthongal re- 
arrangements, whereas IFNLG could 
scarcely be rearranged in any other way 
than FLING and still be pronounced. In 
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the present study this would not have created 
any systematic bias, but a reduction in dif- 
ferences in the ruleout factor might have 
reduced the overall variability of perform- 
ance in the experiment and increased the 
sensitivity of the statistical tests. 

In conclusion, the present results pro- 
vided additional evidence that high solution- 
word, digram-frequency anagrams are easier 
to solve than low solution-word, digram- 
frequency anagrams, However, results test- 
ing a second assumption that solution time 
is related to individual differences in stored 
digram frequencies, while in the expected 
direction, did not achieve statistical signif- 


icance. 
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CHILDREN'S TACHISTOSCOPIC RECOGNITION OF WORDS AND 
PSEUDOWORDS VARYING IN PRONOUNCEABILITY AND 
CONSONANT-VOWEL SEQUENCE 


HOBEN THOMAS 
Pennsylvania State University 


3 letter words and pseudoword trigrams varying in consonant-vowel 
order and pronounceability were presented tachistoscopically to 90 1st, 
2nd, and 3rd grade Ss. The hypothesis that recognition accuracy 
should be facilitated with trigrams of high pronounceability, inde- 
pendent of consonant-yowel order, was not confirmed. Recognition 
accuracy does not appear to be a simple function of pronounceability 


as has been maintained. 


Gibson and her colleagues have suggested 
that the significant variable in recognition of 
tachistoscopically presented pseudowords is 
the degree of correspondence between the 
pseudoword's spelling and spelling-to-sound 
relations of the English language. Further, 
it is suggested that a measure of this cor- 
respondence is the pronounceability of the 
pseudoword; the more pronounceable, the 
closer the pseudoword's spelling fits English 
spelling-to-sound rules (Gibson, Pick, Osser, 
& Hammond, 1962). For example, consider 
the pseudoword BESKS. It is rated as 
more pronounceable and is easier to pro- 
nounce than SKSEB. Thus, BESKS 
should correspond more closely to English 
spelling-to-sound rules than SKSEB. There- 
fore, BESKS should be, and is, more easily 
recognized than SKSEB when presented to 
adults tachistoscopically (Gibson et al, 
1962). . 


Suppose it is of interest to determine when 
children acquire spelling-to-sound corre- 
spondence rules. If children, like adults, 
more accurately recognized pronounceable 
than unpronounceable pseudwords it could be 
inferred that they had learned spelling-to- 
sound correspondence rules, However, if 
children, unlike adults, did not show recog- 
nition differences between pseudowords dif- 
fering in pronounceability, then it could be 
inferred they did not know the rules, Such 
was the reasoning of Gibson, Osser, and 
Pick (1963). They showed tachistoscopi- 
cally simple and familiar three letter words 
(eg. PUT) and two pseudoword trigrams 
of the same letters as each word. One tri- 
gram was pronounceable, as determined by 
Underwood and Schulz ( 1960) ratings (e.g., 
TUP); the second trigram was unpro- 
nounceable (eg, PTU). Their Ss, first 
and third graders, were asked to spell what 
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they saw after each word or trigram was 
flashed for M5 sec. onto a screen. Third 
graders were expected to read familiar words 
best, pronounceable trigrams second best, 
and unpronounceable trigrams least well, 
presumably because they should have re- 
ceived instruction in phonics and thus should 
know some correspondence rules. Their data 
supported this expectation. However first 
graders showed the same relative success ; 
they also read pronounceable trigrams more 
accurately than unpronounceable trigrams. 
Thus, the authors tentatively concluded that 
first grade readers acquire spelling-to-sound 
correspondence rules without the benefit of 
instruction in phonics, normally postponed 
until later in the child’s schooling. 

This paper reports an extended replication 
of the Gibson et al. ( 1963) study but with a 
control for a variable which could have 
accounted for their findings. Their words 
and pronounceable trigrams were CVCs, 
Their unpronounceable trigrams were all 
CCVs. Since children experience in their 
books considerably more CVC three letter 
words than CCVs, (actual samplings of 
several primary textbooks revealed ratios 
as high as 15:1) they may be relatively un- 
familiar with CCVs. Some CCVs may ap- 
pear strange to them; and, therefore, they 
might error in their reports of CCVs, not 
because of their unpronounceability but be- 
cause of their relative unfamiliarity with 
CCV letter groups. 

The study to be reported investigated 
consonant-vowel order in the recognition of 
words and trigrams varying in four levels of 
rated pronounceability. If pronounceability 
is the major determinant of tachistoscopic 
recognition of pseudowords, the performance 
of the children should be ordered in cor- 
respondence with the pronounceability of the 
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Method.—Ninety Ss, 15 boys and 15 girls, 
were selected from each of the first three 
grades in a large elementary school! that 
groups the children on the basis of their 
scholastic ability. Children from average 
classrooms participated. $ 

The stimulus words and the three sets 
of trigrams used are shown in Table 1, 
The 8 words were selected from among the 
10 words used by Gibson et al. (1963) and 
were common words familiar to first graders. 

Each of the eight words and the 24 tri- 


i 


grams were reproduced in a random order on | 


a mimeographed sheet and were given, 


together with Underwood and Schultz (1960) fi 


pronounceability rating instructions, to .59 


college students. Table 1 reports the mea. | 


pronounceability ratings for each of the 3 


stimuli and the overall means and standar 


deviations for the word and 3 trigram condi- 


tions. A within-Ss analysis of variance of the _ 
pronounceability conditions was significant, | 


F (3, 147) =238, p <.001. As can be seen 
in Table 1 the stimuli represented almost 
perfectly ordered sets with respect to their 
pronounceability ratings. In order of de- 


creasing pronounceability ratings, words, as ` 


expected, were most pronounceable then 
CVCs, followed by the two CCV sets, The 
probability that this ordering could have 
occurred by chance is small, P «.01 when 
evaluated by Jonckheere's (1954) test. 
Each of these stimuli was reproduced in 
black capital letters with a guide (Leroy 
240). They were photographed on copy film 
and the negatives mounted as 35-mm. slides, 
The procedure closely duplicated Gibson 
et al. (1963). Data were gathered for each 
S separately. The stimuli were projected 
onto a screen 6 ft, in front of S who was 


stimuli, i " 2 Thanks to Ott Kantner and his staff at Bellefonte 
d » independent of consonant-vowel Elementary School for allowing their children to be 
order. used as subjects in this study. 
TABLE 1 
WORD AND TRIGRAM SETS AND MEAN PRONOUNCEABILITY RATINGS 
Words PR cvc PR ccv PR CCV: PR 
PIG 2.24 HAW 3.30 WHA 4.30 HWA 7.58 
GUN 2.52 DIR 4.12 DRI 3.86 RDI 7.98 
PUR 1.94 TAS 3.14 STA 4.20 TSA 6.88 
HUN 3.08 TOH 5.00 THO 4.76 HTO 7.66 
N 2.22 RUF 3.74 FRU 4.62 RFU 7.84 
CAN 2.04 LEP 3.86 PLE 4.72 LPE 8.24 
PET 2.80 RET 3.88 TRE 3.82 RTE 7.56 nm 
SUN 2.46 RAC 3.40 CRA 4.60 RCA 7.38 
Overall 2.41 3.80 4, 
7 1.65 1.92 Ed 185 


Note.—Small numbers represent high, 
values from 1 to 9, 


large numbers low Pronounceability ratings (PR), Ratingémay take | - 


e 


p 
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sitting beside E. The bright projected image 
was 2 in. tall. 

The S was told some letters would be 
flashed on the screen in front of him. His 
task was to spell what he saw. The E 
announced "ready" and four trial stimuli 
were exposed for ló5 sec. to acclimate S$ 
to the task. Instructions were clarified if 
necessary, then the 32 stimuli were presented 
for Vg sec. each. The E recorded S's re- 
sponses. Each S was presented with an 
independently determined random order of 
the 32 stimuli. 

Results and discussion.—Scores were de- 
termined for each S for each of the four 
treatment conditions, ie. for the words and 
geach of the three trigram sets. Only cor- 
"erect spelling orders were counted. Data 
‘analysis took the form of a within- and 
"between-Ss analysis of variance design. Be- 
.'tween-variables were grade and sex, while 

treatments was the within-variable. Only 

treatments reached significance, F (3, 252) 

— 9,69, p<.001. Differences among grades, 

the sex variable, and all interactions were 

nonsignificant. 

Of more interest are the differences among 
words and trigram sets. Table 2 shows the 
means and standard deviations for each 
condition for each grade. There are slight 
grade differences in the expected direction. 
Since neither the grade variable nor any 


TABLE 2 


Means AND SDs FOR EACH GRADE AND FOR GRADES 
COMBINED FOR WORDS AND TRIGRAM SETS 


pic | maw | WHA | HWA 
Grade 1 6.70 | 643 640 | 623 
5 119 | r7 ria | 150 
2 zair | 663 | em | 6.17 
e i21 | 149 | 133 | 163 
: 3 vss | 7.00 | 660 | 6.60 
oes ‘ea | ris | 1.36 | 120 
Combined | 714 | 668 | 6.41 | $33 
7 ais | 142 | 129 | 147 
Note.—PIG = words, HAW = CVCs, WHA = high 


pronounceable CCVs, HWA = low pronounceable 
CVs. All combined means not under! ined by same 
solid line are significantly different from each other, 
s Q5 >p 2.001. For no grade was WHA significantly 


different from 


ae 
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interactions were significant, the grades 
were combined and the treatment means 
compared by Newman and Keuls procedure 
discussed by Winer (1962). Only the two 
CCV sets were not significantly different 
from each other. Other means were sig- 
nificantly different as indicated in Table 2 

The finding of central interest is that even 
with sizable differences in pronounceability, 
for CCV sets there were no differences in 
spelling accuracy. It would appear that 
Gibson, Osser, and Pick's results could be 
accounted for in terms of consonant-vowel 
order, and not necessarily in terms of pro- 
nounceability. It could still be maintained, 
however, that the only real test of whether 
differences between CVC and CCV trigrams 
can be attributed to differences in order and 
not pronounceability would be to hold pro- 
nounceability constant, However, as Table 1 
indicates, CVCs are generally more pro- 
nounceable than the same letters as CCVs. 
Certainly, a conservative conclusion is that 
pronounceability, when used as an independ- 
ent variable in studies of tachistoscopic 
recognition, must be considered in relation 
to consonant-vowel order, at least for chil- 
dren. 

The fact that words were more accurately 
recognized than were CVCs is consistent 
with the Gibson et al (1963) study, 
although the word and CVC sets are not 
directly comparable since different letters 
were used and letters are not equal in their 
discriminability (Gibson, 1965). 
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ROC CURVES FOR DISCRIMINATION OF LINEAR EXTENT' 


C. DOUGLAS CREELMAN anp WAYNE DONALDSON 2 
University of Toronto 


Receiver Operating Characteristics were obtained for identifying 1 of 
2 possible lines as the longer of the pair, for 2 different pairs, and for 
2 different levels of monetary payoff. A priori probability was varied 
between experimental runs at 11 different levels. Monetary reward 
affected criterion, but did not affect detectability; and contrary to an 
earlier report, a priori probability did not affect measured discrimina- 
tion. The data suggest that a decision criterion was set to match 
response proportions to the a priori stimulus probability. 


In a study of discrimination of linear 
extent (Creelman, 1965) the data were 
mostly consistent with a simplified signal 
detectability model (Green & Swets, 1966). 
The single exception was where unequal 
probabilities of signal presentation (.75 and 
.25) seemed to yield higher detectability than 
any of the other conditions. An attempt was 
made to extend and to replicate this finding. 

The stimuli, the experimental setup, and 
procedures were the same as reported earlier, 
Two highly trained male undergraduates were 
paid an hourly rate plus a bonus based on 
performance. Stimuli for a single experi- 
mental run consisted of 100 3 X 5-in. index 
cards, each with a horizontal line that could 
be one of two lengths. The number of occur- 
rences of each length within each run was 
determined by randomly selecting one of the 
experimental conditions, with added varia- 
tion determined by the expected binomial 
variability; i.e., the pack of 100 cards which 
was announced as having 95% longer lines 
seldom contained exactly 95 longer and 
5 shorter stimuli. The Os were fully aware 
of the procedures used to make up the packs 
of stimuli. Experiments intermixed packs 
containing lines 6.3 or 6.5 cm, with ones con- 
taining either 3.2 or 3.4 cm. 

In the first experiment Os were paid 1¢ 
for a correct response, and fined the same 
amount for an incorrect response. In sepa- 
rate conditions the probability of a longer 
line was .05, .10, .20, .30, .40, .50, .60, .70, 


1The data were collected while the senior author 
was under United States Public Health Service Post- 
doctoral Fellowship No. MH12, 140. Partial support 
was from Contract No. NONR-4010(03) between the 
Office of Naval Research and the Johns Hopkins 
University, Assistance from the National Research 
Council of Canada, Grant NRC-APB-111 aided in the 


analysis. zs 3 
?Now at the University of Pittsburgh. 


.80, .90, or .95, and 800 to 900 observations 
were collected under each. In the second 
experiment payoffs and penalties were in- 
creased tenfold, day-to-day earnings were 
posted on the wall of the experimental room, 
and other less formal pressures were exerted 
in an attempt to optimize performance. Not 
all a priori conditions were run in the second 
experiment, since data collection was quite 
expensive; each O was given the two most 
extreme a priori probabilities with each 
stimulus pair and three other conditions, 
either .9, 7, and .5 or .1, .3, and .5, making 
five conditions in all. The results are 
shown in Fig. 1. The ROC plots were fit 
to a logistic model with a maximum likeli- 
hood procedure described by Ogilvie and 
Creelman (1968). The computer program 
which accomplishes the fits also tests whether 
the obtained slope yields an apparent vari- 
ance ratio different from unity, and goes 
on to refit the data on the assumption of 
unit slope, if the test fails. The lines plotted 
represent the best fit, with the assumption 
of equal variance of the underlying distribu- 
tions. Ellipses are drawn around some 
points at two standard error confidence 
limits to give the reader some feeling for 
the inherent variability in the data. It is 
difficult to show variability on an ROC plot 
but ellipses are one way of attempting to 
do so. Two conclusions can be drawn from 
the data. The earlier finding of apparent 
higher detectability for differing a priori 
probability does not hold up. The difference 
is most likely that in this study performance 
was asymptotic before the experiment began, 
whereas there was a suggestion in the earlier 
data of lower initial performance under 
equiprobable conditions. Increasing the 
monetary rewards and penalties moves de- 
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Fic. 1. Probability of rting "Long" when the stimulus was in fact the longer of two alternative 
simnl, as a function of pri lity of reporting “Long” incorrectly. (Results from two stimulus pairs for 


O: JON 


— PREDICTED 3.2 vs 3.4 
o DATA WITH LOW PAYOFF 
€ DATA WITH HIGH PAYOFF 
---- PREDICTED 6.3 vs 6.5 
A DATA WITH LOW PAYOFF 
A DATA WITH HIGH PAYOFF 
——- OPTIMAL EXPECTED 


CRITERION -6 


`~ 


eS 9 lula 2 324r155.6 7 8 9 0 


A-PRIORI PROBABILITY 


Fic. 2, Predicted and obtained likelihood-ratio criterion values for each O; response probability 
matching model and optimal expected value. 
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cision criteria closer to optimum than is 
usually found (cf. Green, 1964), although 
performance still remains far from optimiza- 
tion of expected value under extreme a priori 
probability, 

Rather than operating as an inefficient or 
cautious "ideal observer," the data of Fig.2 
suggest that O adjusts his criterion so he 
will indicate the longer line to be present on 
a proportion of trials equal to the a priori 
probability. For example, if the a priori 
probability is .10, then O will respond 
"longer" approximately 10% of the time. 

The predicted criterion (8) under this 
assumption can be calculated from the a 
priori probability and the obtained d’. For a 
specific value of d', values of Pi(L) and 
Ps(L) were found such that 


P(L) Px(L) +[1—P(L)] Ps(L) =P(L), 


where P(L) is the a priori probability of a 
longer line, and the other terms are the 
axes of Fig. 1. According to this model, 
the resultant likelihood ratio criterion, 8, is 
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not independent of d' For a priori prob- 
ability less than .50, the predicted B in- 
creases with d' and for values greater than 
-50, B decreases as d' increases. 

The predicted and obtained values of B 
are shown in Fig. 2 along with the prediction 
based on optimal expected value. It is clear 
that the data are more consistent with pre- 
dictions from the probability matching 
model, and that high payoff increased the 
closeness of the data to the model, 
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A VECTOR MODEL FOR PSYCHOPHYSICAL JUDGMENT * 


JOHN ROSS ax» VINCENT Dr LOLLO 


University of Western Australia 


27 groups of 20 Ss each made magnitude estimates of 2 successive 
series of weights (light, heavy, very heavy) using 1 of 3 standards 
presented once only before judgments began. The design was a 
3X3X3 factorial, The shape of the judgment functions varied with 
the standard, as did the effects of upward and downward series shifts. 
The main results and supplementary results failed to follow power 
functions, A variable basis judgment model, proposed to explain the 
main results, was tested with follow-up experiments on shift effects 
and on the accuracy of judgments. The variable basis model implies 


that there is no single psychophysical function for lifted weight. 


The traditional form of the psycho- 
physical problem has been to find the 
function relating psychological magni- 
tude to physical magnitude, each being 
treated as a single-valued scalar quan- 
tity. The two main contenders for the 
title of psychophysical function have 
been the Weber-Fechner logarithmic 
law and the Stevens (1958) power law. 

The suggestion made and examined 
here is that the determinants of psycho- 
physical judgment are generally vec- 
tors, i.e., collections of several values, 

1 This research was mainly supported by 
the Air Force Office of Scientific Research, 
Office of Aerospace Research, United States 
Air Force, under Grant No. AF-AFOSR- 
968-66 to the second author. Some support 
was also provided from the Australian Re- 
search Grants Committee under Grant No. 
ARG 15-226 to the first author. 

The authors express thanks to the Educa- 
tion Department of Western Australia and 
to the principals of the high schools in which 
the data were collected. 
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not single valued scalar quantities. A 
model is proposed for magnitude esti- 
mation of lifted weight based on as- 
sumptions about the collection of values 
acting as determinants of judgment of 
weight and the manner in which they 
are combined. The model is simliar to 
one proposed independently by Hake, 
Faust, McIntyre, and Murray (1967) 
to account for contextually produced 
errors in visual size estimation. It has 
often been suggested that stimulus at- 
tributes other than weight affect judg- 
ment of heaviness. Babington Smith 
(1965) reported that judged heaviness 
frequently correlates more highly with 
density than weight for heterogeneous 
assortments of objects (keyhole saws, 

shot, horse’s teeth, and sheep 
shanks). Helson (1964, p. 254) has 
also suggested that the size-weight il- 
lusion depends on a confounding of 
weight and density. 
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' The experiments to be reported ex- 

. amine magnitude estimation of lifted 
weight with repeated trials, with dif- 
ferent standards, with different series, 
and with shift from one series to an- 
other. Magnitude estimation was 
chosen rather than category scaling 
since other studies had shown that 
category systems were seriously af- 
fected by stimulus range and by shift 
(Parducci, 1965; Ross & Di Lollo, 
1968; Stevens, 1958). Constant stim- 
ulus methods were discarded on the 
grounds that any set of standards used 
for comparison might be subject to the 
same influences as the stimulus set 
and might obscure effects of interest, 
especially the sharpness of any contrast 
effects produced by shift. 


Tur MODEL 


The model is built on four assump- 
tions : 


Assumption 1: Representation of 
Stimuli 


Stimulus objects may be represented 
as points in a space of several dimen- 
sions. Each dimension represents ef- 
fective, as distinct from physical, mag- 
nitude of an attribute (eg. weight, 
density). The location of each point 
represents a collection of values, each 
being an effective magnitude for a 
stimulus object on an attribute dimen- 
sion. 


Assumption 2: Effective Attribute 
Dimensions 


The mapping from a space repre- 
senting the physical magnitudes of an 
object to the space representing effec- 
tive magnitudes is nonlinear. A con- 
jecture for two sets of stimulus objects, 
considering only weight and density as 
attributes, is illustrated in Fig. 1, and 
is explained below. 


Assumption 3: Combination of Values 


To form a judgment of magnitude, 
the values represented by the location 
of a point in the space of effective mag- 
nitude dimensions are combined so as 
to maximize discrimination within the 
set of objects judged. 


Assumption 4: Recency Principle 


Experience etablishes the set of ob- 
jects in a multidimensional space. 
Changes in the stimulus objects ex- 
perienced produce changes in the repre- 
sentational set: stimuli experienced 
more recently being more prominent, 
or better established, than those ex- 
perienced longer ago. 

The first assumption says that a psy- 
chophysical judgment is determined 
from a vector, that is a collection of 
values, representable as a point in a 
multidimensional space. The second 
says that different attributes assume 
greater salience, or contribute more 
strongly to discrimination, at different 
ranges of physical stimulus values. As 
an assumption it has some immediate 
plausibility. To lift a heavy small 
weight is to be impressed with its den- 
sity. To lift a light then a heavy weight 
is to be impressed with a difference in 
kind as well as a difference in mag- 
nitude or degree. 

The third assumption says that the 
values of a stimulus vector are com- 
bined so as to maximize discrimination 
within a set of stimuli. If a set lies 
within a range where density differ- 
ences are readily discriminable, but 
weight differences are not, the assump- 
tion implies that the effective density 
value will count more heavily than the 
effective weight value; and conversely 
if weight is the more readily discrim- 
inable. The fourth assumption does no 
more than allow for changes in the 
stimulus set and for change in the basis 
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of combination of values with change 
in the stimulus set. It invokes a re- 
cency principle without developing any 
strong form for it. 

The effect of the assumptions is il- 
lustrated in Fig. 1. On the left, three 
series of weights, L (light), H 
(heavy), and V (very heavy) are 
represented as points in a space of 
physical stimulus magnitudes. Each 
weight series appears twice, once in the 
medium of large tin cans of low density, 
and once in the medium of small plastic 
containers of high density. The three 
series in both types of medium are 
used in the experiments to be reported. 
On the right of Fig. 1 the straight lines 
from the left hand side are shown as 
curved lines because of the nonlinear 
relationship (Assumption 2) between 
the physical space and the judgmental 
Space. A tentative assumption, made 
to explain the results of the experi- 
ments to be reported is that, as a de- 
terminant of judgment, weight becomes 
progressively less salient and density 
progressively more salient as density 


increases. The five points shown as 
circles on the right hand side of Fig. 1 
represent the five weights from Series 
H in the plastic container medium in 
the (judgmental) space of effective at- 
tribute magnitude (Assumption 1). 
The S making estimates is assumed to 
set up a judgment line in the space 
passing through the origin and having 
a slope which permits maximal dis- 
crimination between stimulus points in 
terms of a linear combination of 
their component attribute values. The 
straight line shown passing through the 
five Series H points in the right hand 
portion of Fig. 1 has a slope maximiz- 
ing discrimination. The line labeled 
judgment line is the line passing 
through the origin and having the same 
slope. The distance from the origin to 
the projection of any point onto the 
judgment line forms the magnitude for 
the stimulus located at that point. The 
projections for the five Series H points 
are shown by arrows on the judgment 
line. As the judgment line varies in 
slope, the magnitude for any stimulus 
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will vary, since the distance from the 
origin to its projection on the line will 
vary. 

The slope of the judgment line de- 
pends upon: the series of weights 
judged, since they primarily determine 
the relevant set of objects ; the standard 
employed, since it forms part of the set 
of objects established by experience; 
and any series judged prior to another, 
since a shift in series will usually re- 
quire a progressive readjustment of the 
slope as experience with a new series 
establishes a new set of stimulus ob- 
jects in place of the old. 

The effects of series, standard, and 
prior series on the judgment line are 
illustrated in Fig. 2. A standard of 
200 gm. is used for illustration. The 
labels 200L and 200H refer to the con- 
ditions employing a standard of 200 
gm. and requiring repeated estimates of 
Series L by one group of Ss and Series 
H by another group of Ss, with no shift 
in.series. The labels 200LV, 200HV, 
and 200VV refer to two shift and one 
control condition, with. standard 200 
gm., a preshift Series L, H, or V, and 
postshift Series V. The judgment 
lines labeled are final judgment lines at 


the end of the experiment for each con- 
dition. 

The three sets of five points labeled 
Series L, Series H, and Series V repre- 
sent the five weights in each series. 
The isolated point labeled STD repre- 
sents a standard of 200 gm. It is pre- 
sumed to be at this weight rather than 
at the midweight of Series L (also 200 
gm.) because of some change over time 
in the representation of a standard pre- 
sented once and never again which was 
the procedure followed in the experi- 
ments reported below. 

The points indicated by arrows on 
the judgment lines represent the pro- 
jections of the 200-gm. standard and of 
the midpoints of the appropriate series ; 
distances along the judgment line from 
the origin to the projections specify 
magnitudes on the last trials of each 
condition. The first three values in the 
last row of the table inset on Fig. 2 
give the ratios of series to standard 
magnitudes. These ratios represent an 
account of the terminal preshift estima- 
tion levels (average judgment) for 
Cond. 200 LV and 200 HV and of the 
200 VV control, since standard and 
series are presumed to be compared in 
terms of their projections on the judg- 
ment line established by the standard 
and preshift series experienced by each 
group. 

For the sake of demonstration it is 
supposed that postshift estimates given 
by Ss are the product of the preshift 
estimation levels just referred to, and 
the ratio of postshift to preshift series 
magnitudes on the final postshift judg- 
ment line. Following a shift in series, 
the judgment line shifts from the pre- 
shift position to a position determined 
by experience with the postshift series 
as well as by preshift experience. Ra- 
tios of postshift to preshift series mag- 
nitudes are determined by projections 
on the terminal postshift judgment line. 
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The last two values in the last row 
of the table inset of Fig. 2 give the 
products of preshift series-standard 
ratios and postshift to preshift series 
magnitudes, as just described, for 
Cond. 200 LV and 200 HV. The value 
for the Shift Cond. 200 LV is greater 
than for the Control Cond. 200 VV, 
but the value for 200 HV is less than 
the control value. The model, supple- 
mented with a special assumption about 
the estimation of relative magnitudes 
under shift conditions, predicts positive 
contrasts for one upward shift, 200 LV, 
but negative contrast for another up- 
ward shift, 200 HV. This effect is 
demonstrated in Exp. I. 

Tt should be emphasized that the 
central features of the model are the 
assumption of multidimensional repre- 
sentation of the stimulus as a vector 
and of a judgment line which varies 
with standard and context (subjective 
magnitudes depend on both). The as- 
sumptions made about the comparison 
of magnitudes in order to arrive at 
estimates, especially under conditions 
of shift, and about the representation 
of the stimuli used in the experiments 
reported are tentative. They are used 
to show how a variety of effects, some 
seemingly paradoxical, may be derived 
Írom the central assumptions of the 
model. 

Whatever supplementary assump- 
tions are made about the estimation of 
relative magnitudes, the consequences 
of the model are that judgment lines 
will be affected by the standard and by 
any prior series and will change over 
repeated trials with one series as the 
series comes to dominate the repre- 
sentational set. Some important ex- 
perimental consequences follow: 

1. Since the slope of the judgment 
line will depend both upon the series 
and the standard used, the level of esti- 
mates should not increase in the same 


proportion from series to series with 
different standards. The standard, in 
other words, should have effects other 
than a change in modulus in which esti- 
mates are expressed, and thus should 
affect the shape of estimate functions. 

2. Since the slope of the judgment 
line changes with repeated trials in a 
direction more favorable for discrim- 
ination within a series, the level of 
estimation for the series should change 
with repeated trials. 

3. Shifts from one series to another 
should produce contrast effects which 
vary in magnitude and direction with 
the standard (see Fig. 2). 


EXPERIMENT I 
Method 


Subjects.—The Ss were 540 boys, 16-17 yr. 
old, who were selected haphazardly from 
the fourth-year high school population in 
the metropolitan area of Perth, Western 
Australia. 

Apparatus—The stimulus objects consisted 
of 4-oz. polythene containers, 5.3 cm. in 
diameter and 7.3 cm. high, filled with lead 
shot and paraffin wax to the weight required. 
There were three standard weights, 200, 500, 
and 800 gm., and three stimulus series, each 
consisting of five weights spaced 50 gm. 
apart: Series L (100-300 gm.), Series H 
(400-600 gm.), and Series V (700-900 gm.). 
It should be noted that each standard weight 
is the middle of one of the three series. 

Procedure.—The task throughout was to 
judge the heaviness of the weights in a series 
relative to one of the three standards. At 
the beginning of a session S was seated at a 
table and asked to put his hand through a 
curtain hiding the weights from his view. 
The E then read the following instructions: 


You will be presented with a series of 
weights. The weights will be presented 
. one at a time. Each time I put a weight 
in your hand I would like you to lift 
it once and then put it down. You 
should lift the weight with a motion of 
the hand and wrist only, leaving your 
forearm resting on the table. After you 
put the weight down I would like you 
to tell me your judgment of the heavi- 
ness of that weight. In making your 
judgment, I do not want you to use 


ounces or pounds or grams or any other 
standard measure of weight. Instead, 
I would like you to use numbers. The 
very first weight that you judge you will 
call 100, and your judgment of all sub- 
sequent weights should be proportional 
to the first weight. Thus if the second 
weight feels twice as heavy as the first, 
you will call it 200. If another weight 
feels half as heavy as the very first 
weight you lifted, you should call it 50, 
and so on. 

Remember: each time I put a weight 
in your hand, you are to lift it only 
once and then put it down. In lifting 
the weight move only your hand and 
wrist and leave your forearm resting on 
the table. After you put down each 
weight you are to tell me your judg- 
ment of its heaviness, but I do not want 
you to use ounces, or grams or any other 
standard measure of heaviness. Instead, 
you should call the first weight 100 and 
then judge every other weight accord- 
ingly. 


The S' was presented with a standard, once 
only, at the beginning of the session and was 
then asked for judgment of one stimulus 
series (L, H, or V) for 10 trials (preshift 
stage) and of a second stimulus series for a 
further 10 trials (postshift stage). Each trial 
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consisted of the presentation of all five * 
weights in the series in random order. The 
order was independently randomized for each 
trial but was used for all Ss in all groups. 

The 540 Ss were allocated to 27 groups of 
20 Ss each. The 27 groups were divided into 
three sets of 9 groups: one to each of the 
three standards (200, 500, or 800 gm.). 
Each of the 9 groups in each set was 
allocated to one of nine conditions: LL, LH, 
LV, HL, HH, HV, VL, VH, VV (where 
the first letter refers to the preshift series 
of weights and the second to the postshift 
series). The design was thus a 3 (standard) 
X3 (preshift series) X 3 (postshift series) 
factorial with 20 replications in each cell, 
Each of the three standards was thus paired 
with all nine possible combinations of the 
three series. 

The experiment proceeded smoothly from 
the preshift to the postshift stage without any 
interruption or warning to S. All testing 
was done by female experimental assistants 
who were not aware of the theoretical pur- 
pose of the study. 


Results 


Level and slope of judgments prior 
to shift—The average magnitude esti- 
mates over Trials 6-10 for the weights 
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Fie. 3. Mean magnitude estimate for each weight in each standard-series 
combination over Trials 6-10, 
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* in each series (L, H, or V) are shown 
in Fig. 3 separately for each standard. 
The plots are in terms of untransformed 
coordinates, not the log-log coordinates 
typically employed with magnitude esti- 
mation data. Each point shown repre- 
sents the average response over five 
trials (Trials 6-10) and over three 
groups (groups with identical standard 
and series during the preshift stage). 
Since » — 20 for each group in the ex- 
periment, each point in Fig. 3 repre- 
sents the mean for 300 observations. 

The nine plots are uniformly linear 
and, if the power law holds, the expo- 
nent is in every case clearly close 
enough to unity to treat the data as a 
set of linear functions describable in 
terms of level and slope. 

Figure 4 shows the mean level of the 
judgments for each of the nine com- 
binations of standard and series during 
the preshift stage. Each point shown 
represents the average response over all 
five weights in the series, over five 
trials (Trials 6-10), and over the three 
groups which experienced identical 
standard and series during the preshift 
stage. Since n = 20 for each group in 
the experiment, each point in Fig. 4 
represents the mean of 1,500 observa- 
tions. 

"Thé mean level is consistently great- 
est for Series V and least for Series L, 
and for each series the mean level is 
greatest for the 200 standard and least 
for the 800 standard. 

Table 1 shows the average slope of 
the judgment functions for each of the 
nine combinations of standard and ser- 
ies during the preshift stage. As in 
Fig. 4, each value reported is based on 
1,500 observations. While the slope 
for each series is greatest for Standard 
200 and least for Standard 800, as was 
the case with levels, slope does not vary 
with series in the same’ way for all 
standards. There is a significant inter- 


Standard 
200 gm. 
500 gn. 
800 gn. 


MEAN MAGNITUDE’ ESTIMATE 


t H " 
SERIES 
Frc. 4. Mean magnitude estimate for 


each standard-series combination over Trials 
6-10. 


action effect in the determination of 
slope by series and standard, F (4, 
531) = 3.02, p < .02, which is not sim- 
ply due to multiplicative function. The 
interaction suggests that more than a 
change of modulus takes place with 
variation of the standard used for judg- 
ments of a given series. 

Accuracy of judgments prior to shift. 
—On the assumption, still to be jus- 
tified, that the estimates for each series 
are linear functions of the weights in 
the series, the product-moment correla- 
tion coefficient can be used to measure 
the accuracy of judgments in each case. 


TABLE 1 


MEAN SLOPE OF THE JUDGMENT FUNCTIONS 
FOR EACH STANDARD-SERIES COMBINA- 
TION OVER TRIALS 6-10 


Series 
Standard 
L H v 
200 1.297 1.358 .929 
500 -118 -156 .614 
800 519 549 527 
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TABLE 2 


MEAN CORRELATION BETWEEN WEIGHT AND MAGNITUDE EsTIMATE (BOTH 
RAW AND STANDARDIZED) OVER TRIALS 6-10 AND (STANDARDIZED 
EsTIMATES ONLY) TRIALs 16-20 


Preshift Correlation 


Postshift Correlation 


Preshift Series 
Preshift Raw Standard Postshift 
Series $ r Series 
L H v 

200 

L 91 93 L 92 .90 91 

H .81 86 H 79 87 82 

V .66 74 Vi 70 13 74 
500 

L .89 91 L 93 90 90 

H 78 85 H 82 .82 85 

V 65 74 V 66 .68 7" 
800 

L .89 92 L 93 -90 91 

H AT 83 H 74 .85 81 

V «63 74 V 66 .69 68 


Correlations were computed between 
weight and judgment for each S, taking 
all 5 judgments for each weight over 
Trials 6-10, or 25 judgments in all. 
Table 2 shows the correlations, aver- 
aged, as with slopes, over Ss, groups, 
and conditions. The striking fact is 
the huge effect of the series on correla- 
tion and the lack of effect of standard, 
or of an interaction between series and 
standard as in the case of slope. 

It is the standard that has the large 
effect on slope, and series that has the 
large effect on correlation, F (2, 531) 
= 10.81, p < .001, the standard not af- 
fecting correlation at all, F (2, 531) < 
1. If slope and correlation were both 
dependent on sensitivity to weight dif- 
ference, both would respond to the same 
variables. The fact that they do not 
makes it impossible to interpret both as 
dependent on changing sensitivity to 
difference at different weight ranges, 
without the introduction of a second 
factor governing one and not the other, 
or separate factors for each. 

Correlations were recomputed stan- 
dardizing the mean judgment on Trials 
6-10 to remove one factor that might 


affect correlation but not slope, namely 
variation in the level of judgments over 
trials due possibly to a time-order ef- 
fect. 

The results are also 
Table 2. 

Effect of shift on level of judgments. 
—Mean levels of judgment are shown 
in Fig. 5 for alternate trials, for each 
of the 27 groups in the main study. 
Each point is an average of all weights 
presented on a given trial and of all Ss 
in the group. Each point is thus based 
on 100 observations. 

In comparison with the control 
groups, which had the same standard 
but no shift at Trial 11, it is mostly 
true that a downward shift produces 
some underestimation of the postshift 
series, and an upward shift produces an 
overestimation. Two marked excep- 
tions to the general trend are Groups 
200 HV (Standard 200, Series H 
Trials 1-10, Series V Trials 11-20) 
and 800 LV. Both received upward 
shifts, but in both cases the postshift 
judgments for Series V are below the 
control level (see Fig. 2 and the ac- 
companying comments). 


shown in 
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Whether the shift was upward or 
downward, and whether the postshift 
estimates are under- or overestimates, 
the level for the series increases for all 
groups except one (800 LH) from 
Trial 11 to Trial 20, in marked con- 
trast to findings typical with rating 
scale methods (Di Lollo, 1964). In 
rating scale studies an initial under- 
estimate is followed by a rise in level 
over trials, but an initial overestimate 
is followed by a decline. The effect of 
the rise in level over the postshift trials 
in the present study is to diminish 
undershooting where it occurs, but to 
increase overshooting. The rise in level 
is most marked for Cond. 200 LV, 
which shows overestimation on Trial 11 
and increasing overestimation on sub- 
sequent trials. 

Effect of shift on slope and accuracy. 
—Figure 6 shows the average judg- 
ment function on the first (Trial 11) 


STO = 2009m. 


ESTIMATE 


MEAN MAGNITUDE 


and the last (Trial 20) postshift trials 
for nine representative groups in the 
study. Each point represents a group 
average based on one judgment by each 
S for each weight in the series. 

In all shift conditions the judgment 
function is higher on Trial 20 than on 
Trial 11. In some cases the whole 
judgment function is displaced upward, 
in other cases judgments for the lower 
weights in the series are displaced up- 
ward only slightly, but judgments for 
the higher weights are displaced 
sharply upwards. 

The more striking fact is a char- 
acteristic bending of the judgment func- 
tion on Trial 11 away from the linear, 
most clearly observable in Cond. 200 
LV and 500 HV. Within each series 
of five weights, the points for Weights 
1 and 4 are displaced downwards, 1 
more than 4, and the points for Weights 
5, 3, and 2 are displaced upwards, 2 
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Frc. 5. Mean magnitude estimate for each group on alternate trials. 


10 JOHN ROSS AND VINCENT D: LOLLO 


$29,090 99 (1890, 1499 //499.1; 6900 | 090; ] 900 


*00 
v 
STO OOpa. 
V meee SE em 


ea ce 


WEIGHT (Grems) 


Fic. 6. Mean magnitude estimate on 
Trials 11 and 20 for each weight for nine 
representative groups. (Circles and triangles 
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more than 3, and 3 more than 5. The 
order of presentation of the five weights 
in each series on Trial 11 was 1, 4, 5, 3, 
2. For example, for Series L the oder 
was 100 gm., 250 gm., 300 gm., 200 gm., 
and 150 gm. Thus, the earlier a weight 
is presented on Trial 11 the more it is 
judged below the linear regression line, 
and the later it is presented the more 
itis judged above. The bending effect 
is detectable in almost every shift con- 
dition, upward or downward, but is 
most pronounced for Cond. 200 LV. 
In this extreme condition the judg- 
ments for Weights 2, 3, and 4 (750 
gm., 800 gm., and 850 gm.) are nega- 
tively correlated with weight itself on 
Trial 11. 

A comparison of Fig. 5 with Fig. 6 
will show that the characteristic bend- 


ing of judgments on Trial 11 away 
from the linear is greatest in just those 
cases where the mean level for the 
whole series rises most sharply from 
Trial 11 to Trial 20. A reason for 
the shape of the judgment function on 
Trial 11 and the connection of the 
amount of departure from linearity with 
rise in mean level from Trials 11-20 is 
suggested later. 

Accuracy was measured by correla- 
tions between judgments and weight on 
Trials 16-20, with the mean of the 
judgments standardized by trials. The 
results are shown in Table 2. Once 
again the major effect is due to the 
series itself, with minor effects due to 
standard and preshift series. 

Shape of preshift judgment function. 
—The judgment functions for separate 
series (Fig. 3) look linear, except on 
Trial 11 (Fig. 6), where neither a 
straight line nor a power function of 
any exponent provides an adequate fit. 
To determine if the linearity of the 
functions might be an artifact of aver- 
aging across Ss, correlations were cal- 
culated between log judgments and log 
weights for individual Ss using data 
from Trials 6-10. A ratio was com- 
puted, for each S, of the log-log cor- 
relation to the correlation for the un- 
transformed values. If individual Ss’ 
regressions tended to follow a power 
law, even with exponents varying from 
condition to condition and S to S, the 
average ratio should exceed unity for 
all conditions. 

The average ratio for each condition 
of the main study is given in Table 3. 
There is no evidence that the log-log 
regression is more linear overall than 
the raw value regression. The log-log 
values produce lower average correla- 
tions slightly more often than they pro- 
duce higher correlations, 

Figure 3 showed not only that the 
judgment functions are linear for each 
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TABLE 3 


AVERAGE Ratio or Loc-Loc TO LINEAR 
Propuct-MomMENT CORRELATION BETWEEN 
JUDGMENT AND WEIGHT FOR ALL 27 
PRESHIFT CONDITIONS, BASED ON 
RESULTS FOR TRIALS 6-10 


Series 
Standard 
L H y 
1.002 -985 1.000 
200 1.003 997 992 
1,000 984 .982 
1.022 991 1.000 
500 1.013 .989 1.002 
1.032 .982 .985 
1.012 .985 .989 
800 1.029 .998 1.005 
1.006 -970 -993 


Note,—Average ratio for all conditions = .998, 


series with each standard, but that the 
slope of the function changes sharply 
from series to series for each standard. 
A subsidiary study was carried out to 
determine whether (a) the linearity 
within the range of each series and 
(b) the shift in slope from series to 
series depended upon the fact that sep- 
arate groups worked with each series. 


EXPERIMENT II 


Two series of weights were used: 
Series U15 consisted of 15 weights 
representing the three short series (L, 
H, and V) combined. Series U17 con- 
sisted of the same 15 weights plus a 
weight of 350 gm. and a weight of 750 
gm., so that it ran from 100 gm. to 900 
gm. in 50 gm. steps, with no gaps. Six 
groups (n=10 in each case) were 
employed, each judging one of the two 
series, and using one of the three stan- 
dards, 200, 500, or 800 gm. 

As in the main study, the standard 
was presented once only, and never 
again. Each group then judged the 
series for 10 trials, a trial consisting of 
the presentation of all weights in the 


series once in a randomly determined 
order. 

The judgment functions for standard 
200 based on averages over Trials 6-10 
are shown in Fig. 7. It can be seen 
that the slopes are linear within the 
short ranges, and that the slope changes 
sharply from range to range. The 
presence or absence of a gap at 350 
and 650 gm. makes little difference. 

The chief difference between the re- 
sults of the long series (U15 and 
U17) and the results of the short series 
(L, H, and V) is that the former show 
a monotonic increase of judgment with 
weight, whereas the latter show some 
departure from monotonicity from the 
end of one series to the start of the next 
(Fig. 3). In all cases (short series, 
long series with a gap of 350 gm. and 
650 gm., and long series with no gap) 
the judgment functions show a double 
inflection over the range 100-900 gm., 
which is inconsistent with the power 
law (cf. Luce & Mo, 1965; Pradhan & 
Hoffman, 1963). The double inflection 
is found with all standards, indicating 


900 
SERIES UIS 
mof S 0-309 =a 
soo p 
w 
g 
p 
5 
"oo 
: 
Š voo 
í SERIES UI7 
zZ 700 $T0*200 ga. 
$ 
500 
300 
id AU 
WO 200 300 400 500 $00 700 $00 900 
WEIGHT (Grams) 
Fic. 7. Mean magnitude estimate over 


Trials 6-10 for each weight in Series U15 
and U17 with the 200-gm. standard. 
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that it is not an artifact due to some 
self imposed upper or lower limit on 
the numbers used by Ss to indicate 
weight judgments. 


Discussion 


The features of the first study needing 
explanation are: (a) the variation in the 
shape of the magnitude estimation func- 
tion with the standard, (b) the almost 
invariable upward drift of judgments over 
trials following a shift from one series to 
another, and (c) the “bending” phenom- 
enon on Trial 11 and the fact that it is 
directly related to the drift in judgments 
over trials. A fourth feature, namely the 
control of the standard over the magni- 
tude and direction of contrast effects, has 
already been discussed in the explanation 
of Fig. 2. 


Variation of Estimate Functions with the 
Standard 


If Fig. 2 approximates a true picture of 
the subjective representation of the 
weights used in the main study, and if 
the assumptions offered are correct, the 
judgment line for each standard-series 
combination will be distinct, and so the 
relative magnitude of the series to the 
standard will be established on a different 
basis with each series-standard combina- 
tion. ` Thus the level of magnitude esti- 
mates will not automatically increase from 
series to series in the same proportion for 
each standard, as would be the case if 
the standard did no more than determine 
the modulus in which the judgments are 
expressed. Similarly, the difference be- 
tween judgments for pairs of weights 
within a series will depend upon the 
judgmental basis; and so the slope, like 
the level, will show an interaction between 
series and standard, 

It is thus possible to describe the con- 
trol of the standard over the shape of 
the magnitude estimate function. To ex- 
plain why each function has exactly the 
shape found and to explain the character- 
istically sharp variation in slope from 
range to range requires precise estimates 
of the representation of each of the stimuli 
used, and these are not available. 
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Upward Drift over Trials 


Since the judgment line is assumed to 
shift toward a direction more favorable 
to discrimination between the weights in 
the postshift series, it should normally be 
true after a shift that the judgments for 
weights within the series will increase, 
whether the shift was upward or down- 
ward. 

Consider first Cond. 200 LV, in which 
there is a shift from Series L to Series V. 
Figure 2 shows the hypothetical judg- 
mental basis line pre- and postshift, As 
the judgment line shifts over trials from 
the preshift position (200 L) to the post- 
shift position (200 LV), the ratio of the 
magnitude defined by the projection of 
the midpoint of Series L to the magnitude 
defined by the midpoint of Series V in- 
creases. On the assumption that the mag- 
nitude of the ratio of the postshift to the 
preshift series mediates the level for the 
postshift series, there should be an up- 
ward drift over trials. In the converse 
case, 200 VL, the ratio of the projection 
for the preshift Series V to the projection 
for the postshift Series L also increases 
so that, once again, there should be an 
upward drift over trials. The almost in- 
variable upward drift in estimates over 
trials is thus explained very naturally. 
It is possible to imagine sets of stimulus 
objects for which a shift of judgment line 
in the direction of better discrimination 
would lower the magnitude for the set as 
a whole, and when estimates are available 
it may be possible to design a series of 
stimulus objects which shows a downward 
rather than an upward drift following a 
shift. 

An obvious alternative explanation for 
the upward drift is that there is a time- 
order effect causing a decrease in the 
effective magnitude of the preshift series 
and thus causing the postshift series to 
seem to increase in comparison. The de- 
fect in this alternative is that it seems to 
be unable to account for the fact that the 
magnitude of the upward drift varies con- 
siderably from condition to condition and 
varies with the standard used. If the 
time-order effect is more severe for heav- 
ier objects, as is commonly assumed, the 
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TABLE 4 
Summary or Desicn or Exp. III 
Group 
Stimulus Description 
I II III IV 
Standard 
Material Plastic Tin Plastic Tin 
Weight (grams) 200 200 200 200 
Density (gm/cm?) 1.24 15 1.24 A5 
Preshift Series 
Material Plastic Tin Plastic Tin 
Weight (grams) 100-300 100-300 100-300 100-300 
(Series L) (Series L) (Series L) (Series L) 
Density (gm/cm?) 1.24 TE 1.24 45 
Postshift Series 
Material Tin Tin Plastic Plastic 
Weight (grams) 700-900 700-900 700-900 700-900 
(Series V) (Series V) (Series V) (Series V) 
Density (gm/cm?) .60 . 460 4.97 4.97 
Mean shift in Weight (grams) 600 600 600 600 
Mean Shift in Density (gm/cm*) —.64 45 3.73 4.82 


upward drift over postshift trials should 
be greatest for shifts from heavier stan- 
dard-series combinations. The greatest 
effect found is with Cond. 200 LV. An 
explanation in terms of a change in basis 
seems to accommodate this fact. 


Bending on Trial 11 


The bending effect on Trial 11 (Fig. 
6), and the fact that it is greatest where 
the postshift upward drift in judgments 
is greatest, follows from the change of 
basis explanation if it is realized that the 
change takes place within the first post- 
shift trials as well as between trials, As 
the judgment line shifts, the weights pre- 
sented on Trial 11 are judged on a basis 
which progressively changes to increase 
the comparative magnitude of all weights 
in the postshift range. The weights pre- 
sented first on Trial 11 are therefore 
judged on a lower level than those pre- 
sented later on Trial 11. The results il- 
lustrated in Fig. 5 and 6 are consistently 
and almost perfectly in accord with the 
explanation. Where a large change in 
basis is involved the bending effect is pro- 
nounced within the first postshift trial and 
there is a pronounced upward drift in 
level over all postshift trials. 


EMPIRICAL VERIFICATION OF SOME 
PREDICTIONS 


Two subsidiary experiments to test 
some further consequences of the model 
were carried out and are reported 
below. 


Experiment III 


The strongest contrast effect found 
in the main study was in Cond. 200 LV. 
Estimates for Series V were higher 
than in the Control Cond. 200 VV. 
The mean estimate for Series V rose 
substantially over Trials 11 to 20 and 
the bending of the judgment function 
on Trial 11 was most pronounced. 
(See Fig. 5 and 6). 

If all effects are due to a pronounced 
change in the combinatorial basis with 
a shift from a region of low weight and 
density to a region of high weight and 
density, the effects should all increase 
if the weight shift is the same but the 
upward density shift is made greater. 
Similarly the effects should all decrease 
if the weight shift is the same but the 
density shift is made smaller. A more 
detailed derivation of this prediction 
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can easily be obtained by reference to 
Fig. 1 and 2, 

Method.—The five plastic Series L and 
the five plastic Series V. weights were re- 
produced in the medium of large tin cans 
(10.9 cm. in diameter and 142 cm. in 
height). The required weight was achieved 
by cylindrical inserts which avoided the “bot- 
tom heaviness” of the plastic containers used 
in the main study. The tin cans thus varied 
less from Series L to Series V both in 
density and in certain inertial characteristics, 
while differences in weight remained the 
same. 

The four conditions of the experiment are 
shown in Table 4, 

The weight shift is the same in all cases, 
but the upward density shift increases from 
Cond. I to Cond. IV. Each condition was 
represented by 12 Ss, all school boys drawn 
from the same population as in the main 
study. 

The procedure was the same as in the 
main study. 


Results. and. discussion —The mean 
judgments for each trial are presented 


in Fig. 8. The judgment functions for 
each condition on Trial 11, the first 
postshift trial, and Trial 20, the last 
postshift trial, are presented in Fig. 9. 

It is apparent that with weight shift 
held constant the contrast effects are 
modulated by the magnitude of the 
density shift. All four conditions are 
sharply separated in Fig. 8 and the 
postshift levels are in the order pre- 
dicted. The increments in level from 
Trial 11 to Trial 20 are also ordered 
as predicted. Finally, as Fig. 9 shows, 
the bending of the judgment function 
on Trial 11 is most pronounced in 
Cond. IV, heavily pronounced in Cond. 
III, and scarcely if at all present in 
Cond. I and II. 


Experiment IV 


Given a shift in the judgmental basis, 
the judgments themselves should also 
change. It follows that accuracy of 
judgment should be enhanced by a 
stable judgmental basis and adversely 
affected by a constantly shifting basis. 
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Other factors may affect accuracy (e.g., 
sensitivity to weight difference), but 
for a fixed series of weights the accu- 
racy with which they are judged should 
be higher with a context that forces 
little change in basis and lower with a 
context that forces considerable change 
in basis. Assumptions 2 and 4 imply 
that judgment within a short range of 
weights should produce a stable basis, 
but that judgment over a long range 
should produce a shifting basis. 


Method.—Two series of plastic weights 
were constructed, both of which included the 
five weights comprising Series H of the 
main study (400, 450, 500, 550, and 600 gm.). 
Each series consisted of 17 weights. One, 
Series S, stretched from 100 to 900 gm., 
with weights at 50 gm. intervals; the other, 
Series D, was densely packed in the range 
400 to 600 gm. with weights at 12.5-gm. 
intervals. Series S was designed to produce 
instability in the basis of judgment, Series 
D to produce a relatively stable basis of 
judgment. The weights 400, 450, 500, 550, 
and 600 gm. were common to both series. 

Two groups, of 10 Ss each, were used. 
Both groups used the same standard of 500 
gm. and were given 10 trials. Each S in 
both groups followed the same randomly de- 
veloped order of presentation. The Ss were 
drawn from the same school-boy population 
used in the main study. 


Results and — discussion.—Correla- 
tions were calculated between the five 
weights common to both series and 
mean estimates on Trials 6 to 10. The 
correlations for Ss in each group are 
presented in Table 5. 

There is little overlap between the 
two groups. The densely packed Ser- 
ies D produces correlations substan- 
tially higher, F (1, 22) — 17.14, p< 
-001, as predicted. It is by no means 
obvious that this result is to be ex- 
pected on other grounds. The dense 
crowding of weights in Series D might 
be expected to produce confusion and 
increase task diffculty to the point 
where accuracy is seriously affected. 
Similarly, the clear separation of 


TABLE 5 


CoRRELATION BETWEEN WEIGHTS COMMON 
TO BorH SERIES AND JUDGMENTS OVER 
TRraLs 6-10 For EACH S IN 


EacH Group 
Series 
D Ei 
.89 .68 
88 67 
.83 .62 
18 .59 
17 58 
-75 57 
14 56 
-13 .54 
.68 53 
«65 446 
54 42 
52 5H] 
M = .73 155 


weights in the stretched out Series S 
might be expected to lend the weights 
in the series a distinctiveness which 
facilitated accuracy. 


Some CONCLUDING REMARKS 
Psychophysical Functions 


If it is true that psychophysical judg- 
ments of lifted weights are determined by 
a variable combination of stimulus at- 
tributes, it follows that there is no single 
psychophysical function. 

The psychophysical question now be- 
comes not to discover the form of the psy- 
chophysical function for a given attribute 
but to determine the subjective representa- 
tion of the objects judged and the basis 
determining judgments. 


The Nature of Contrast 


The explanation put forward for the 
results of the main study and of the other 
experiments reported is one which holds 
that contrast effects are genuine, rather 
than artifact of the use of numbers in 
magnitude estimation, The assertion is 
that the impression of heaviness of a 
lifted weight depends upon a basis which 
is determined by experience with weights. 
Magnitude estimation is interpreted as a 
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task in which S uses numbers to indicate 
the ratio of two magnitudes both of which 
are present as impressions by virtue of the 
basis which has been established. 

In an experiment on contrast by Wil- 
liams, Ross, and. Di Lollo (1966) using a 
recognition task, Ss who had made com- 
parative judgments on weights in differ- 
ent ranges differed markedly in the 
weights they recognized as matching pre- 
viously well learned standards. Since no 
explicit magnitude estimation was called 
for, the task being to recognize a familiar 
weight, the experiment offers some sup- 
port for the assertion that the impression 
of heaviness varies with basis. 

The interpretation is similar in some 
respects to Helson’s (1964) notion of 
adaptation level, but unlike Helson’s ac- 
count of judgment under contrast, it does 
not involve the assumption of an “anchor” 
or implicit reference value, The adapta- 
tion assumed is an adaptation to a basis 
favorable for discrimination, and it bears 
some formal similarities to physiological 
conceptions of adaptation, in which the 
activity of receptor mechanisms is as- 
sumed to be modified under control from 
higher centers, in response to the input 
from the environment, 


Further Tests of the Theory 


A sharper test of the theory can be 
made when it is possible to estimate the 
representation of stimuli by individual Ss, 
It will then be possible to make specific 
predictions about levels, slopes, and the 
effects of shift, 

One method for obtaining the necessary 
estimates is multidimensional scaling of 
apparent differences between pairs of ob- 
jects of different weights in a given me- 
dium. Preliminary work has already been 
carried out with Ss making numerical 
magnitude estimates of weight difference. 
The results indicate that the structure is 


at least two-dimensional, and inconsist- 
ency with the detailed propositions ex- 
hibited in Fig. 1 and 2 and the discussion 
accompanying them is not apparent. 

Tt is possible to attack other stimulus 
properties such as pitch, loudness, vibra- 
tion, brightness, and velocity with the 
methods described in this study and thus 
to determine the generality with which 
the theory presented will apply. 
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measure was obtained following 
thematic pass- 
A chunking or unitizing 
ig was assumed, with acquisition 
number of relatively stable core 
lexical units are added with each 
derlie learning of both 
Since the core units for 


present in the input material, these 
es for the prose and random-word 


trials. The corresponding recog- 


due to the fixed effects of 


matching-to-theme in recognition following prose input and incre- 


mental effects of information proc 


quirements of the recognition test and rando 
were consistent with the expectations from the hypo 


expectations that are loosel 
inquiry. 


Subjective organization (SO) is re- 
flected by word-order regularities dur- 
ing recall that may be independent of 
the order of word presentation during 
training. Recall probability for indi- 
vidual words covaries with the degree 
of SO during the repeated-input test 
Cycles of a typical free-recall experi- 
ment (Tulving, 1964). Where train- 
ing consists of repeated presentations 
of randomly ordered words, without 
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essing compatibility between re- 
m inputs. Obtained data 


thetical ideas, 


y derived from theory at this stage of 


interpolated test trials, SO may pro- 
ceed relatively slowly and acquisition 
is demonstrably protracted (Lachman 
& Laughery, 1968). SO of random 
words perhaps reflects an implicit edit- 
ing process that occurs mainly during 
the output phase. Free recall of con- 
nected discourse, in contrast, advances 
rapidly even in the absence of inter- 
polated test trials. Prose, as a result of 
preexperimental language habits, is 
already organized into couplets, 
phrases, and sentences with powerful 
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syntactical linkages between adjacent 
words as well as semantic, associative, 
and thematic relationships among non- 
adjacent words. Input trials alone, 
therefore, may be sufficient to enhance 
SO, i.e., yield increases in the number 
of subject units (S units) which for 
prose reflect the input word order. 
The assumptions are advanced that 
conventional free recall reflects at least 
two underlying processes: (a) reten- 
tion of individual words achieved by 
recency, postremity, rehearsal, etc., and 
(b) retention of S units of two or more 
words (Tulving, 1968). It is post- 
ulated that for random-string inputs, 
output trials are an important condi- 
tion for SO, for the formation of S 
units. Without output trials, the num- 
ber of words in the S units will remain 
constant or increase slowly. The S 
units for connected discourse, in con- 
trast, reflect input word order and 
probably grow in size without output 
trials. Random and syntactic strings 
in the absence of interpolated test 
trials, therefore, will both equally re- 
flect the first process, acquisition of 
individual words. But only prose will 
precipitate a trial-by-trial increase in 
S units and a correlated exponential 
increase in word-unit measures of free 
recall. If these speculations are viable, 
divergent recall acquisition functions 
should be obtained for random and 
syntactic strings across early input 
trials. The result would be a signif- 
icant interaction between training ma- 
terials and trials. 

Recognition measures of retention, 
on the other hand, may be expected to 
yield fundamentally different outcomes. 
One recognition procedure (Lachman 
& Tuttle, 1965) requires that train- 
ing words be randomly mixed with new 
words or distracters. Test words are 
exposed singly for a binary recognition 
decision; one word is judged old or 
new every 1.5 sec. Since one word 
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is presented at a time randomly mixed 
with distracter words, S units and syn- 
tactical units are broken up. When S 
attempts to reconstruct a syntactical 
unit or S unit from a recognition word, 
he may of necessity search his memory 
store for the larger S unit to which the 
test word belongs. Processing infor- 
mation in this fashion might lead to a 
successful recognition provided that 5 
has sufficient time for the search. The 
S is deliberately provided insufficient 
time to prevent the confounding of re- 
call with recognition: implicit recall 
followed by matching-to-sample. Both 
syntactical and subjective organization, 
therefore, are mostly useless if the 
recognition test is paced. Semantic 
factors, instead, are important deter- 
minants of successful retention. Pompi 
and Lachman (1967), for example, re- 
port that following the presentation of 
prose, Ss in a recognition test are 
likely to be comparing many of the 
test words to the main ideas or theme 
of the passage. Thus, a training pas- 
sage with a military theme will result 
in Ss classifying words such as “rifle” 
as old words whether or not the word 
is actually recognized as having been 
previously presented in the passage. 
Moreover, if the central theme of the 
passage is unambiguous, thematic 
recognition is likely to add a fixed in- 
crement to the probability of correct 
recognition, an increment that is con- 
stant across number of presentations. 
For example, if the theme of the con- 
nected discourse is about police activ- 
ity, “handcuff” is as likely to be recog- 
nized after 1 trial as after 10. It is 
assumed that binary recognition re- 
flects at least two processes: (a) re- 
tention of individual words, or parts of 
words, and (b) recognition decisions 
achieved by matching some or all test 
words to surrogate processes that are 
initiated during training. The sur- 
rogate process may be  superordi- 
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nate category names, images, a theme, 
or other summaries of the inputs 
depending on characteristics of the 
training words. Although input trials 
of prose are assumed to yield higher- 
order S units such as phrases and sen- 
tences, these are incompatible with the 
information-processing requirements of 
the recognition test: S is required to 
recognize words, not phrases or sen- 
tences. Inputs of random strings, in 
contrast, force the processing of indi- 
vidual words which later are presented 
as such for recognition. The functions 
relating recognition of words from 
connected discourse to degree of train- 
ing, therefore, will reflect the fixed 
effects of matching-to-theme and will 
yield a gentle slope: a slow rate of 
increase of response probability that 
started from a relatively high level, 
For words presented in random order 
during training, a lower probability of 
correct recognition should be obtained 
initially, reflecting the absence of cor- 
rect recognitions due to matching-to- 
theme. But any additional presenta- 
tions of the training list are likely to 
precipitate relatively larger increases in 
correct recognitions in the random- 
order training group due to informa- 
tion-processing compatibility. The ex- 
pected outcome is convergent acquisi- 
tion functions with a resulting inter- 
action, 


METHOD 


Design.—All treatment conditions were be- 
tween-S. A 4X2X2X2 factorial con- 
tained the following treatments: two, four, 
six, or eight training trials; two sets of 
training lists; presentation in random or 
syntactic-thematic order; and a single recog- 
nition or recall test. Eight Ss were ran- 
domly assigned to each of the 32 treatment 
cells. 

Subjects—The Ss were 256 students of 
introductory psychology at the State Uni- 
versity of New York at Buffalo, who were 
fulfilling a course requirement. 

Verbal materials.—Two lists were chosen 
from six used in the Lachman and Tuttle 
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(1965) study. A compromise was sought 
between the needs of generality and the real- 
istic demands of the size of the experiment 
by choosing the two paragraphs (sniP and 
TRIP) that gave the most dissimilar results 
previously. Each consisted of 100 different 
words drawn írom the Thorndike-Lorge 
(1944) 1,000 most-frequent words, Four 
syntax words were repeated, so that each 
list consisted of 104 words that told a 
coherent story. For the syntactic-thematic 
condition the words were in their correct 
grammatical order, while for the random 
condition these same words were presented 
in random order. A separate random order 
was generated for each appropriate S' by a 
computer algorithm. 

Each word was printed in the left center 
of an otherwise blank IBM card. A cover 
card on each training deck told $ whether 
he would be learning a paragraph or random 
words. The same training deck in the same 
order was used for repeated trials with a 
given S. The test deck for recognition con- 
tained the 100 words írom the training 
list plus 100 randomly intermixed distracters. 
These distracter words were randomly se- 
lected from those of the Thorndike-Lorge 
1,000 most-frequent words that were not 
used in constructing the paragraphs, 

Procedure—The Ss were run in groups 
of from four to eight with the two lists and 
two orders of presentation always varied. 
Four Ss were tested by free recall and four 
by recognition when the group contained 
eight Ss. It was decided beforehand in a 
nonsystematic way how many training trials 
the group would receive. An experimental 
session consisted of either two, four, six, or 
eight input trials followed by a single reten- 
tion test. 

The Ss were instructed in both the train- 
ing and testing procedures before list presen- 
tation began. The instructions and the 
materials on the desk in front of him in- 
formed S as to which type of retention test 
he would take. The Ss were given no par- 
ticular strategy to employ in learning the 
words. 

A training trial was defined as one ex- 
posure to each of the 104 words in the train- 
ing deck. The Ss turned over and read 
each word every 1.5 sec. in time with tape- 
recorded clicks. Immediately upon com- 
pletion of a trial, Ss joggled the cards while 
maintaining their original order. When 
everyone was ready, E said, "begin" and 
they continued with the next trial. The 
intertrial interval was approximately 15 sec. 

Immediately following training, Ss were 
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tested for how many words they could re- 
member. The Ss with recognition test decks 
sorted the cards into "recognize" and "don't 
recognize" categories at the rate of one card. 
every 2.5 sec. in time wtih clicks. The free- 
recall Ss wrote down all the words they 
could remember in any order. 


RESULTS 


Recall responses were scored using 
two different criteria. By a strict cri- 
terion, only words exactly reproduced 
were counted correct. A liberal cri- 
terion involved counting a response 
correct if the only error occurred in 
the inflectional suffix of the word. 
Recognition data were also compiled in 
two ways. Uncorrected scores were 
simply the number of training words 
correctly recognized, while corrected 
scores consisted of the number correct 
minus the number of false-positive 
errors (C—E). Four analyses of 
variance were performed on the data 


using the various combinations of cri- 
teria. There were some slight shifts 
in F ratios (? levels in .01—001 range) 
when the C — E correction factor was 
employed, but the different recall cri- 
teria made no noticeable difference. 
For all analyses, however, rejection 
decisions were identical in that all re- 
sults to be discussed were significant 
beyond the .01 level in all four analy- 
ses. Detailed reporting of data, there- 
fore, is limited to raw recognition 
scores and recall scores based on exact 
word reproductions. 

The analysis of variance for these 
data yielded p<.001 for all main 
effects: degree of training (T), word 
order, syntactic vs. random (WO), 
training list (L), and retention test 
(RT). The F ratios were 547 (df — 
3, 224) for T, 68.2 (df = 1, 224) for 
WO, 187 (df=1, 224) for L, and 
304.7 (df = 1,224) for RT. The fol- 
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Fic. 1. Correct (upper) and false-positive (lower) recognitions and recall as a function 
of number of input trials for words presented in random order or in syntactic-thematic 
order. (Each data point for words correct represents an indépendent group of Ss.) 
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lowing interactions were significant be- 
yond .001: WO x L, F= 237, WO 
x RT, F=241, TX RT, F— 58, 
LXRT, F=20.6, and Tx WO x 
RT, F=11.6. The WOxLxRT 
interaction yielded F= 5.2, p <.05. 
Means for main effects can be in- 
ferred from the cell means plotted 
in Fig. l. The finding of prin- 
cipal interest is the Random vs. Syn- 
tactic X Degree-of-Training X Reten- 
tion Test interaction. Figure 1 shows 
the interaction as it occurred separately 
for each list. It should be noted that 
Several statistically significant effects 
occurred that are due to training lists: 
a main effect and three interactions. 
Nevertheless, the anticipated result of 
convergent acquisition curves (syntac- 
tic vs. random) for recognition and di- 
vergent acquisition curves for free re- 
call were obtained for both lists inde- 


pendently. 
An analysis of variance for false- 
positive errors yielded significant 


effects for trials, F (3, 224) — 8.92, 
b < .001, and for the type of reten- 
tion test, F (1, 224) = 10.21, p <.01. 
Errors appear to decline fairly uni- 
formly with increased training. 


Discussion 


The information-processing model that 
generated this research views the acquisi- 
tion of strings of natural language to be 
essentially a unitizing or chunking pro- 
cess, Accordingly, learning consists of 
the formation of a small number of rela- 
tively stable core units around which 
words or other lexical units are added 
with each input trial. Core units in con- 
nected discourse are of the order of 
phrases and sentences with the theme or 
central ideas of the passage functioning 
in the manner of an executive program 
(Deese, 1968; Pompi & Lachman, 1967). 
The observations of McNulty (1965) to 
the effect that high approximations to 
English are learned as units are con- 
sistent with the present interpretations. 
Moreover, the core-unit hypothesis is also 
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supported by Glanzer’s (1962) finding 
that function words embedded in nonsense 
syllables appear to produce a unitizing 
effect. These formulations taken with the 
recall protocols suggest that phrases from 
connected discourse increase in size over 
trials as words build up around core se- 
quences. Under conditions where these 
events are measured by free-recall pro- 
cedures with the word as the dependent- 
variable unit, trial-by-trial increments in 
learning are sizable and relatively con- 
stant. But conversely, word-recognition 
measures following presentations of con- 
nected discourse are incongruent with the 
implicit information-processing and core- 
unit organization that results from train- 
ing. Consequently, the recognition of 
words taken from connected discourse is 
only slightly influenced by repeated pre- 
sentations. 

The presentation of prose in this study, 
in certain respects, is similar to block 
presentation of conceptually related words 
in the clustering experiments reported by 
Cofer, Bruce, and Reicher (1966), and 
Dallet (1964). A preexperimental and 
efficient basis for organizing the inputs 
in both clustering and prose experiments 
is made apparent to S, and SO of the 
type described by Tulving (1962, 1964) 
is precluded. On the other hand, ‘the 
acquisition of words presented in random 
strings is largely contingent upon SO: 
the core units around which individual 
words are organized are determined by S. 
The range of stimulus characteristics 
around which S is likely to organize his 
outputs is large and frequently idiosyn- 
cratic. Bousfield and Bousfield (1966), 
for example, describe 11 classes of rela- 
tionships among adjacent items that are 
found in free-recall protocols. Since 
many specific forms of SO invoked by S 
in the absence of output trials are in- 
efficient, the free-recall scores for the 
random strings of words presented, as 
expected, rise very slowly across input 
trials. Recognition, in contrast, rises 
relatively sharply, probably because vari- 
ous dimensions of each word must be 
implicitly examined by S in formulating 
the categories and core units developed 
in SO. As McNulty (1965) has demon- 
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strated, recognitions can be successfully 
achieved on the basis of partial learning— 
some part characteristic of a word can 
be used to identify that word during 
recognition, 

In the random-strings groups, it is 
possible that the number of Ss who recog- 
nize the thematic nature of certain words 
increases over trials. If S, for example, 
sees “rifle,” “patrol,” “colonel,” etc., at 
various points in the training list, he 
might begin to suspect the semantic- 
thematic content of the material. The 
result of this state of affairs would be 
consistent with the steep slope of the 
random-order recognition curve presented 
in Fig. 1. Moreover, the early flattening 
of the syntactic-order recognition curve 
may be due to the fact that effects of the 
theme of the passage on recognition are 
more or less constant over trials. Appre- 
hending the main ideas or theme, in short, 
is all or none, and once it is apprehended 
matching-to-theme occurs in recognition. 
There is danger, however, in ascribing too 
much weight to this explanation without 
further evidence. In neither this study 
nor in numerous others employing the 
binary recognition task did Ss obtain 
mean scores above 80% correct. This is 
true of experiments in which 50-word 
random strings and a 50-word paragraph 
were administered for 128 input trials 
(Lachman, Laughery, & Field, 1966). 
An unexplained ceiling effect may be 
constraining recognition, and any hard 
and fast inferences concerning the ex- 
perimental variables are precluded. Nev- 
ertheless, binary recognition measures of 
retention in connected discourse studies 
should not be abandoned. Critical pro- 
cesses can be demonstrated by differential 
error rates (Pompi & Lachman, 1967), 
and free-recall errors are often too few 
and beyond the control of E. 
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DISTANCE PERCEPTION AS A FUNCTION OF 
AVAILABLE VISUAL CUES' 


TEODOR KÜNNAPAS 


University of Stockholm 


4 reduced-cue conditions and 1 full-cue condition were used to 
investigate distance perception as a function of available perceptual 
cues. It was found: (a) that accommodation does not permit any 
accurate perception of distance, not even at distances of 1 or 2 meters, 
(b) that retinal-image size is one of the most important cues for the 
judgment of distances, (c) that with successive increase of the 
number of perceptual cues, the range and the discrimination of 
perceived distances increase and improve in accuracy, and (d) that 
with successive increase of numbers of cues the certainty of the 


judgment increases. 


In their classical experiment Holway 
and Boring (1941) investigated ap- 
parent size as a function of physical 
distance for four conditions. They 
found that the slope of the function re- 
lating apparent size to distance dimin- 
ished as the mode of viewing the stim- 
uli changed from binocular observation 
to monocular observation, monocular 
observation through a small artificial 
pupil, and  monocular observation 
through the artificial pupil and a re- 
duction tunnel. As the number of 
extraneous cues was diminished, the 
slope of the function approached zero 
as a limit, i.e., it approached the law of 
the visual angle. 

The purpose of the present research 
was: (a) to investigate whether ac- 
commodation alone provides for the 
estimation of the distance of objects, 
(b) to determine how distance percep- 
tion improves with the successive addi- 
tion of retinal-image size, binocular dis- 


1 This is a report from the Perception and 
Psychophysics Unit of the Psychological 
Laboratories. The investigation was sup- 
ported by a grant from the Swedish Council 
for Social Science Research. The author is 
greatly indebted to Jan Lindell, who per- 
formed the experiments and assisted in the 
computational work, and to John C. Baird 
for comments on the manuscript. 


parity, and convergence, or all these 
three cues simultaneously, and (c) to 
study the similarity between these re- 
duced-cue conditions and a full-cue 
condition. 


METHOD 


Experimental conditions.—Five conditions 
were used, one in each of five experiments. 
The conditions are summarized in Table 1. 
The experiments under Cond. Rl, R2, R3, 
and R4 were conducted in a completely dark 
room in which the perceptual cues of 
distance were reduced considerably. The 
restriction of cues was greatest in Cond. 
R1. Using monocular regard and a constant 
visual angle, all distance cues except accom- 
modation were eliminated. ^ Possibly, con- 
vergence in the contralateral eye may have 
been an additional cue (Heinemann, Tulving, 
& Nachmias, 1959). In Cond. R2, R3, and 
R4, retinal-image size, convergence, binocular 
disparity, and, finally, all three cues were 
added to accommodation. These reduction 
experiments were complemented by a full- 
cue condition, F, in a well-lighted room. 

Stimuli—Circular areas, painted in phos- 
phorescent color, were used as stimuli. They 
were presented in a completely dark room so 
that only the exposed variable stimulus area 
together with the standard area was visible. 
Stimuli were presented at seven different 
distances: 25, 45, 75, 115, 195, 295, and 395 
cm. The standard area had a distance of 
115 cm. 

In the conditions involving a constant 
visual angle the size of the stimulus area was 


523 


524 TEODOR KÜNNAPAS 


variable so that the retinal-image size was ing the wire caused the circle to return to 
constant at all distances. Corresponding to — the horizontal position. Wires from all 
these distances, the diameters of the circu- stimuli were collected at a steeringboard 
lar areas were: 5, 9, 16, 24, 41, 62, and 82 where E was seated. 


mm. The diameter of the standard area was 


The position of the eye was marked by a 


24 mm. sight consisting of a metal ting with a 

In the variable visual-angle conditions, all ^ diameter of 3 cm. attached to S’s end of 
stimuli had the same size (24 mm.), and the metal rail at the same height as the 
they produced different retinal sizes when stimulus circles, when they were presented. 
located at different distances. The Ss saw the stimuli through the sight. 

Apparatus.—A special apparatus was con- In this way all stimuli were presented at 
structed in order to make presentation of the same eye level and on the same line of 
stimuli constant for all distances and Ss, and sight, 


to eliminate all possible acoustic cues. The 


The standard stimulus area was attached 


apparatus consisted of a horizontal rail of to a floor stand and was visible during the 
thin metal (400 cm.), one end of which entire experiment. The stand was placed 
was supported by a wall stand approximately at a distance of 115 cm. The position of 
25 cm. above the eyes of S. The lower the standard was opposite to that of the 
part of the stand served as a support for variable stimulus. When the latter was 
$'s forehead and chin, The stand could be in the left position, the stanadrd was in the 
regulated in height so that the eye position right, and vice versa, 


was kept constant for all Ss, The rail ex- 


Procedure—The S was seated on a com- 


tended in distance away from S and was fortable chair with a support for the back 
always at the same height. The rail could of his head, and another for his forehead, 
be placed in each of two positions—left or He was instructed not to move his head 
right—35° apart, with S at the vertex of and to keep his eyes shut when stimuli 
this angle. were changed. 


The stimulus circles were fastened to indi- 


The method of magnitude estimation was 


vidual steel wires, 25 cm. in length, attached ^ used. The Ss were instructed to rate the 
to the metal rail. The steel wire was held standard distance as 10 and to estimate the 
in a neutral horizontal position by a small variable distances in relation to this stand- 
spring, thus occluding S’s view of the circle. ard, one variable at a time. 


The stimulus device could be released from 


Left and right positions as well as left 


the spring by pulling a wire at the position and right eye (monocular regard) were 
of E. This permitted the circle to drop varied randomly. 


down into S’s direct line of sight. Releas- 


Subjects —Three different groups of Ss 


TABLE 1 
EXPERIMENTAL CONDITIONS 


Experiment | Condition Regard Visual Angle | Dark or Light Cues 
I R1 Monocular | Constant Dark Accommodation 
II R2 Monocular | Variable Dark Accommodation + 
Retinal-Image size 
HI R3 Binocular | Constant Dark Accommodation + 
Convergence + 
f Binocular disparity 
IV R4 Binocular | Variable Dark Accommodation -+ 
Retinal-image size + 
Convergence ++ 
Binocular disparity 
V F Binocular | Variable Light Accommodation + 


Retinal-image size + 
Convergence + 
Binocular disparity + 
Light and shade + 
Clearness etc. 


IMPORTANCE OF AVAILABLE CUES IN DISTANCE JUDGMENT 
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z TABLE 2 
SUBJECTIVE DISTANCE IN DIFFERENT EXPERIMENTAL CONDITIONS 
Subjective Distance 
OEC MAN 
Opica Distance Ri R2 R3 R4 F 

R SD R SD R SD R SD R SD 
0.25 2.2 12.5 | 5.00 2.2 | 0.71 1.9 | 0.54 2.0 | 0.63 1,9 | 0.45 
0.45 3.9 13.8 | 4.19 4.0 | 0.97 3.5 | 0.69 3.6 | 0.70 3.9 | 0.64 
0.75 6.5 12.7 | 6.44 6.9 | 1.56 6.3 | 0.84 6.8 | 0.69 7.2 | 0.61 
1.15 10.0 11.2 | 3.71 | 10.8 | 2.09 | 10.1 | 0.92 | 10.6 | 0.91 | 10.8 | 0.75 
1.95 17.0 11.4 | 3.52 | 17.3 | 4.43 | 17.0 | 1.99 | 19.1 | 2.20 | 20.5 | 1.81 
2.95 25.7 10.2 | 2.80 | 24.6 | 3.46 | 24.1 | 2.71 | 288 | 3.38 | 30.2 | 2.42 
3.95 34.3 13.9 | 4.01 | 26.8 | 3.78 | 30.8 | 3.44 | 37.5 | 3.11 | 39.8 | 2.98 


^ Given in meters, 


were used, one group for Exp. I and II, 
another group for Exp. III and IV, and a 
third group for Exp. V. These groups con- 
sisted of 10, 8, and 10 Ss, respectively. All 
Ss had normal visual acuity. In the first 
group 5 Ss began with Exp. I, and later 
participated in Exp. II; the other 5 Ss 
participated in these two experiments in the 
reverse order. Likewise, the order of ex- 
periments was rotated in the second group. 
Each S estimated each variable distance four 
times in each experiment (two stimulus posi- 
tions, right and left eyes). 


RESULTS AND DISCUSSION 


The main results are summarized in 
Table 2 and presented graphically in 
Fig. 1. Each scale value is the arith- 
metic mean of 40 estimates in Cond. 
R1, R2, and F, and of 32 estimates 
in Cond. R3 and R4. The intraindivid- 
ual variability is expressed as a stand- 
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Subjective distance as a function of objective distance. 


ard deviation in subjective units. 
Table 2 summarizes these values for 
subjective uncertainty. Each SD is 
the square root of the mean intraindi- 
vidual variance. 

It is seen that accommodation ( Cond. 
R1) did not provide sufficient informa- 
tion for accurate estimation of distance. 
All distances are judged to be approxi- 
mately equal, but slightly larger than 
the standard distance. Subjective un- 
certainty is very large. A number of 
experiments since the time of Wundt 
(eg. Heinemann, Tulving, & Nach- 
mias, 1959) have established the fact 
that, while accommodation may serve 
as a cue for perception of depth differ- 
ences, it is severely limited in this 
capacity. It is probably of little value 
at observation distances greater than a 


40 
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distance (m) 


(“A” represents Cond. R1 


and R2, “B” represents Cond. R3 and R4, and “C” represents Cond. F.) 
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meter. Discriminations of depth based 
on accommodation are neither precise 
nor accurate over distances greater 
than a meter or even less. This experi- 
ment shows that Ss do not accurately 
estimate distances up to one or two 
meters. 

Retinal-image size (R2) is sufficient 
to produce rather satisfactory estimates 
of distance. Also, in all other experi- 
mental conditions Ss gave relatively 
good estimates of distance. Tt is inter- 
esting to note that retinal-image size 
alone provides approximately the same 
depth estimation as binocular disparity 
together with convergence (R3). 
Moreover, retinal-image size has not 
very much less power than a full-cue 
condition (F). "Therefore, the retinal- 
image size must be regarded as one of 
the most important perceptual cues in 
the estimation of distance, at least un- 
der the conditions of this study. Also, 
binocular disparity is an important cue 
for depth, since convergence is usually 
a weak determiner of perceived distance 
(Biersdorf, 1966; Gogel, 1962; Heine- 
mann et al., 1959), 

All curves in Fig. 1 except that of 
Cond. R1 represent power functions 
fitted to the experimental data by the 
method of least squares. The fit is 
very satisfactory if the highest value in 
Cond. R2 is omitted. Thus, subjective 
distance is a power function of objec- 
tive distance in all four conditions 
(Table 3). The entries in Table 3 are 
based upon 


R=k (S—a)", [1] 


By least squares solutions the expo- 
nents for Cond. R2, R3, R4, and F 
vere in general agreement with the re- 
sults of Baird and Biersdorf (1967) 
vho found that the exponent is less 
han 1.0 for distance. In another 
tudy Baird (1965) found that distance 
stimates were close to the theoretical 
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TABLE 3 


SUBJECTIVE DISTANCE AS A Function 
oF PERCEPTUAL CUES 


Experimental 


Condition Equation 
R1 — 
R2 R = 0.998(S — 0.10).& 
R3 R = 0.972(S — 0,09)0.% 
R4 R = 1.008(S — 0.08)» 
F R = 1.033(S — 0.09)» 


straight line function (exponent = 1.0), 
but the empirical function was slightly 
concave upwards, 

The total range of subjective dis- 
tance is expressed as the ratio between 
the maximum and the minimum scale 
values given in Table 4. In Cond. R1 
the range is only 1.1, which means that 
all mean scale values are very close to 
the standard and that there is little dis- 
crimination of distances, Depth 
discrimination begins first with Cond. 
R2 and increases successively with sub- 
sequent conditions up to full cues 
(Cond. F). The latter condition 
yielded results in agreement with those 
reported earlier by Künnapas (1960). 
The range may be expressed also in 
percentage of the maximal range of 
Cond. F. In this regard there is a 
large difference between Cond. R1 
and R2. In the latter case the range 
suddenly increases to 58.2% and grows 
with each successive condition up to 
full cues. From Table 2 it is seen that 
this systematic range increase occurs 
only for values above the standard, 


TABLE 4 
RANGE OF SUBJECTIVE DISTANCE 


Experimental Rang Range in Per- 


Condition (Rotax. /Ratin) | Cone of 
R1 11 5.3 
R2 12.2 58.2 
R3 16.2 714 
R4 18.8 89.5 
p 20.9 100.0 


IMPORTANCE OF AVAILABLE CUES IN DISTANCE JUDGMENT 


while the values below the standard do 
not change beyond random error. 

The intraindividual uncertainty of 
judgments is shown in Fig. 2 where 
the coefficients of variation (100 SD/ 
R) are plotted against the scale values 
R. Condition R1 has the largest sub- 
jective uncertainty, and the uncertainty 
decreases successively with each con- 
dition. It is smallest in Cond. F where 
it reaches a value close to 876 for the 
longer distances. With an increase in 
the number of perceptual cues, the 
certainty of distance judgments in- 
creases steadily. 

The individual results are shown in 
Fig. 3, where individual mean esti- 
mates are plotted against objective 
stimulus distances. Each row repre- 
sents an experimental condition. It 
is seen from the graphs that there is 
a considerable variation between the 
individual mean judgments. The 
interindividual differences are largest 
in the most reduced condition (R1). 
These differences are rather great in 
Cond. R2, and decrease successively 
with each subsequent condition up to 
full cues. Also, in Cond. R4, which 
was the least reduced, the individual 
differences are small and nearly the 
same as with full cues. Thus, the 
interindividual agreement increases 
with the increase of the number of dis- 
tance cues. 

In Fig. 4 the present results are 
compared with those of Holway and 
Boring (1941). These authors plotted 
apparent size against objective dis- 
tance (Fig. 4A). The top broken line 
represents the prediction from the law 
of size constancy, while the bottom 
broken line represents the prediction 
from the law of the visual angle. Ap- 
parent size is a function of distance 
under four sets of conditions: binoc- 
ular regard (B), monocular regard 
(M,), monocular regard with artificial 
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Coefficient of variation 


o 10 20 30 40 
Distance scale value (R) 


Fic. 2. Coefficients of variation as a func- 
tion of distance scale values in experimental 
Cond. R1, R2, R3, R4, and F. 


pupil (Mj), and monocular regard 
with artificial pupil and reduction 
tunnel (M,). The slope of the func- 
tion relating apparent size to objective 
distance diminishes as the number of 
cues is reduced. Under the most se- 
vere reduction, the function approaches 
the law of the visual angle. On the 
other hand, the condition with the 
most cues (binocular) produced an 
overconstancy effect. 

The present results are shown in 
Fig. 4B, where subjective distance is 
plotted against objective distance. The 
top broken line represents the predic- 
tion of accurate distance estimates, and 
the bottom broken line represents the 
prediction of equidistance. Subjective 
distance is a function of objective dis- 
tance for five sets of conditions: full- 
cue, and four reduced-cue conditions. 
The slope of the function relating sub- 
jective distance to objective distance 
diminishes as the number of available 
cues diminishes. Under the most se- 
vere reduction the function approaches 
equidistance. On the other hand, the 
two conditions with the greatest num- 
ber of cues showed an overconstancy 
effect. Thus, the present results with 
subjective distance (Fig. 4B) coincide 
with those of apparent size (Fig. 4A), 
although the experimental situations 
were quite different. For size, Hol- 
way and Boring (1941) always used 
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distance 


Subjective 


Objective distance (m) 


Fic. 3. Individual mean estimates plotted against objective stimulus distances. (Each 
^w represents one experimental condition: the first row (A, B, and C) represents Cond. Ri, 
ie second row (D, E, and F) represents Cond. R2, the third Tow—Cond. R3, the fourth— 
ond. R4 and the fifth—the Cond. F. The parameters are individual Ss.) 


— = 
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Apparent 


Objective 


Subjective distance 


distance (m) 


Fic. 4. A: Apparent size as a function of objective distance under four sets of conditions 
(after Holway and Boring, 1941) : binocular regard (B), monocular regard (M3), monocular 
regard with artificial pupil (Ms), monocular regard with artificial pupil and reduction 
tunnel (Ms). (The top broken line represents the prediction from the law of sise constancy, 
while the bottom broken line represents the preditcion from the law of the visual angle.) 

B: Subjective distance as a function of objective distance for five sets of conditions: 


R1, R2, R3, R4, and F. 


(The top broken line represents the prediction of accurate distance 


estimates, and the bottom broken line represents the prediction of equidistance.) 


the shortest distance as the comparison 
and their largest distance was 12 times 
as great as the standard. In the pres- 
ent experiments the standard was one 
of the middle distances, and the largest 
distance was only about 3.5 times as 
great as standard. In the present ex- 
periment, the standard was fixed in 
distance. In the Holway and Boring 
study, the comparison was fixed in 
distance. Also, in the present situation 
some distances were closer than the 
standard. Nonetheless, the similarity 
of results in Fig. 4A and 4B support 
the hypothesis that size and distance 
judgments are closely related. 
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In 2 experiments, number of food pellets (reward magnitude held 
constant) was varied orthogonally with 5 partially or continuously 


rewarded acquisition trials. 


Significant acquisition occurred. With 1 


or 2 pellets, there was a tendency towards greater resistance to extinc- 
tion after continuous reinforcement; however, with either 10 or 24 
pellets, a significant partial reinforcement effect (PRE) was obtained. 
These data were discussed in terms of their relevance to previous 
demonstrations of the PRE after minimal acquisition, and to a frustra- 


tive nonreward interpretation 


The last five years have witnessed 
the accumulation of a sizeable body of 
experimental data concerned with the 
partial reinforcement effect (PRE) 
after a very limited amount of training. 
These data are said to have important 
implications for a frustration theory of 
the PRE (Amsel, 1958), because they 
reflect increased persistence of partially 
rewarded Ss after a number of train- 
ing trials seemingly insufficient to con- 
dition reward expectancy (rr) and 
countercondition rp — sp to running. 
There has, however, been some lack of 
agreement on the basic finding, de- 
pending primarily on the nature of the 
rewarding condition. Studies using 
wet mash reinforcement (e.g., Capaldi 
& Deutsch, 1967; McCain, 1966) or 
multiple food pellets (e.g, Bowen & 
McCain, 1967; Padilla, 1967) have 
consistently obtained the PRE after 
as few as 2-5 training trials. On the 
other hand, experiments using one or 
two large-size food pellets as reward 
(Surridge, Rashotte, & Amsel, 1967) 
have found no PRE-CRF extinction 
differences after four acquisition trials, 

1 Supported by Research Grants GB-3772 
rom the National Science Foundation and 
APB-72 from the National Research Coun- 
il of Canada. 


of partial reinforcement phenomena. 


whether or not these were preceded by 
extensive CRF training. 

Some sense can be made of these 
apparently contradictory findings by 
relating them to some recent experi- 
ments which have shown that the PRE 
can be produced in a runway apparatus 
without partially reinforcing the run- 
ning response itself. If all running 
trials were continuously reinforced, the 
PRE was nevertheless obtained if S 
was nonrewarded for short approach 
responses to the food cup on goal box 
placement trials (Trapold & Doren, 
1966; Trapold & Holden, 1966), or if 
the rat merely inserted its head into a 
liquid-dispensing magazine (Wilton, 


1967). Rats typically consume multis... 
ple food pellet rewards one pellet at a | 


time, moving back and forth from the 
food cup, and making several goal ap- 
proaches. of the "Trapold" variety on 
a single "trial"  C' observation is 
also that they, ez oy n this . 
manner, rathe. 

uously out of the fo 

cats do. It is therefore 

five trials with single-pellet . / 

really five trials, while five ti 

wet mash or multiple-pellet ri 

are, functionally, many trials. 
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Accordingly, number of food pellets 
(total magnitude equated) was varied 
orthogonally with percentage of re- 
ward in two experiments. Relative to 
one- or two-pellet conditions, a larger 
PRE was expected with multiple pel- 
lets. 


METHOD 


Subjects —Forty naive, male Wistar albino 
rats, 80-90 days old at the beginning of 
pretraining, were used in Exp. I. The Ss 
were habituated to a 10-gm. food deprivation 
schedule for 18 days prior to pretraining. 
In Exp. II, 60 similar S's were used and were 
maintained on 15 days of deprivation prior 
to the start of experimental training. 

Apparatus.—The experiments were con- 
ducted in two versions of the same appa- 
ratus. ` Both were constructed of plywood 
painted flat black and covered with clear 
Plexiglas and had identical dimensions: an 
ll-in. entry box, 11-in. start box, and 39-in. 
runway all were 2% in. wide X 3& in. high. 
Guillotine-type doors separated the entry and 
start chambers from the runway proper 
and formed a 13-in. goal box (GB) area. 
The food cup was a semicircular sheet 
metal trough, extending the width of the 
GB, fixed on the end wall of the GB. The 
proximal lip of the food cup was approxi- 
mately 3 in. above the floor and concealed the 
pellets. Photoelectric circuitry was used 
to obtain three 1-ft. measures of running 
time. 

Design.—In Exp. I, four groups of 10 Ss 
each were arranged factorially for continu- 
ous (C) or partial (P) reward and 2 500- 
mg. or 24 45-mg. Noyes pellets on each R 
trial. In Exp. II, six groups of 10 each 
`£ were given C or P acquisition with 1 1,000- 
mg., 10 97-mg., or 24 45-mg. pellets. 

Procedure.—Daily food rations were given 
at least 30 min. after training throughout 
the entire experiment. The Ss were handled 
for 1 min. daily while;t-abituating to depriva- 
tion. Pretrainin- sed of three 5-min. 
daily =~ odiustment during 

. _ vs ‘explored’ simul- 

jording circuitry opera- 
food in the runway or in 
duing. 
jae five acquisition trials was 
om the others by a 24-hr. ITI. 
youps were given an RNRNR se- 
An both experiments. The Ss were 
-sueyed from GB immediately upon com- 
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pletion of eating (60-90 sec.) and were 
confined for 20 sec. on N trials. No S was 
discarded for failure to run or eat. 

Eleven extinction trials were also sepa- 
rated by 24-hr. ITIs, and confinement to 
GB was again 20 sec. 


RESULTS 


Separate speed measures (reciprocal 
time) were obtained for each runway 
segment. The data presented here are 
total alley speeds based on running 
time over a 3-ft. distance. Analyses 
of variance on the individual measures 
gave virtually identical results in Exp. 
II. In the first experiment, between- 
group differences were smaller in the 
start segment than in either the run 
or goal measures. 

Acquisition—Training data appear 
in the left-hand side of Fig. 1. In- 
creased speed over the five trials of 
acquisition was significant in Exp. I, 
F (4, 144) = 89.07, p < .01. A large 
Percentage X Trials interaction, F (4, 
144) = 12.82, p < .01, resulted from 
the divergence of the P and C groups 
to different terminal speeds. The CRF 
groups ran reliably faster on Trial 5, 
F (1, 36) = 8.78, p < .01. No other 
main or interaction effects exceeded 
chance. 

In Exp. II, there was again a sig- 
nificant increase in speed over the first 
five trials, F (4, 216) = 153.09, p < 
01. By the end of training, the C 
groups were again running faster than 
their respective P groups, F (1, 54) — 
6.58, p < .05, which was also reflected 
in a significant Percentage x Trials 
effect, F (4, 216) = 438, p < .05. 

Extinction: Percentage X Pellets 
effects.—Both experiments indicated 
that the direction of the percentage 
effect was strongly dependent on num- 
ber of pellets. The Percentage x Pel- 
lets interaction was significant in both 
experiments, F (1, 36) = 472, p< 
05, and F (2, 54) = 15.93, p < .01, 
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MEAN SPEED (Ft. /sec.) 


Fic. 1. 


respectively. In Exp. I there was a 
suggestion that the CRF group was the 
more persistent after 2-pellet reinforce- 
ment ; however, a significant PRE was 
present in the case of the 24-pellet 
groups, F (1, 36) 2681, p< .05. 
These trends were reliable in Exp. II. 
In the single-pellet condition, the CRF 
Ss were significantly more resistant to 
extinction than those given PRF train- 
ing, F (1, 54) 2676, p< 01. On 
the other hand, the data of both the 
10-pellet, F (1, 54) = 15.03, p < .01, 
and the 24-pellet groups, F (1, 54) — 
21.87, p < .01, reflected a PRE. 

The number-of-pellets-effect was de- 
termined separately for each percent- 
age condition. In Exp. I, Group 2C 
tended to be more resistant to extinc- 
tion than Group 24C, and Group 24P 
tended to be more persistent than 
Group 2P, but neither comparison was 
significant. In the second experiment, 


L LEE a E 
LE A a E a 7.15" 16 


DAYS (TRIALS) 


Mean total alley speeds during acquisition and extinction. 


there were significant differences 
among the partial groups, F (2, 54) = 
11.90, p < .01, and among the continu- 
ous groups, F (2, 54) = 3.84, p <.05, 
as a function of number of pellets. 
Group 1C extinguished more slowly 
than Group 10C or Group 24C (New- 
man-Keuls p < .05), while the latter 
two groups did not differ. After PRF 
training, there were no differences be- 
tween the 10- and 24-pellet groups, 
although both were more persistent 
(Newman-Keuls p < .05) than Group 
TB 


Response speed decreased signif- 
icantly over the 11 extinction trials 
in both experiments, F (10, 360) — 
41.98, p<.01, and F (10, 540) = 
51.23, p « .01, respectively. Consider- 
ing between-group main effects, only 
the percentage effect in the second ex- 
periment exceeded chance, F (1, 54) 
= 1182, p< .01. 
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Extinction: Percentage x Pellets x 
Trials effects.—These interactions am- 
plify the Percentage X Pellets effect 
and indicate differences in rate of ex- 
tinction, which depend jointly upon 
number of pellets and percentage re- 
ward. The triple interaction was not 
significant in Exp. I. This would sug- 
gest that extinction, which was none- 
theless dependent on the interaction of 
percentage reward with number of 
pellets, did not proceed at differential 
rates. In Exp. II, the Percentage x 
Pellets x Trials interaction was strong, 
F (20, 540) = 2.90, p < .01. For this 
reason, the Percentage X Trials effect 
was calculated separately for each num- 
ber-of-pellet condition. In the 1-pellet 
groups, the Percentage X Trials effect 
was not significant. This confirms the 
fact (see Fig. 1) that although there 
was an overall difference in the ex- 
tinction performance of Groups IC and 
1P, this reflected terminal acquisition 
levels, and the groups did not ex- 
tinguish at differential rates. With 
both 10-pellet reward, F (10, 540) = 
3.04, p < .01, and 24-pellet reward, F 
(10, 540) = 7.20, p < .01, there were 
significant Percentage X Trials effects. 
Referring again to Fig. 1, the curves 
for the 10C and 24C conditions are 
negatively accelerated, while the 10P 
and 24P curves are linear or posi- 
tively accelerated. 


Discussion 


The two experiments taken together 
indicate rather clearly that CRF vs. PRF 
extinction, after a small number of ac- 
quisition trials, is critically affected by the 
type of reinforcer used. Both studies 
gave evidence for the dependence of per- 
sistence upon the joint effect of per- 
centage reward and number of pellets, 
In addition, there was some evidence that 
this interaction was sufficiently strong to 
produce extinction gradients having dif- 
ferent curvature, i.e, consistent decrease 
as opposed to an initial period of un- 
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diminished, or even increasing, response 
strength. These effects were completely 
absent in the acquisition data. 

Observation of the 10- and 24-pellet 
groups, which did display the PRE, indi- 
cated that S made multiple approaches to 
the food cup while eating the indicated 
number of pellets. (This is in keeping 
with our observation that while rats are 
consuming wet mash from a dish they also 
make back-and-forth responses.) Remem- 
bering that both Trapold (1966a, 1966b) 
and Wilton (1967) have demonstrated the 
PRE (without nonreward of the instru- 
mental response during acquisition) by 
simply allowing S to make a short ap- 
proach to the food cup, or to nose into a 
liquid-dispensing magazine, the fact that 
the PRE was obtained in the authors’ 
10- and 24- pellet conditions and in prior 
wet-mash and multiple-pellet reward 
studies is not surprising. 

The present data can be reconciled with 
prior investigations which have—as well 
as those which have not—obtained the 
PRE after a very limited amount of train- 
ing, and with studies of goal-box approach 
behavior, in terms which are compatible 
with the frustrative nonreward analysis. 
Multiple-pellet rewards could result in 
faster conditioning of rg than rewards of 
one or two pellets. The faster CRF ex- 
tinction after 10- or 24-pellet acquisition 
can be taken to reflect differentia] strength 
of rg as a function of number of pellets, 
Since Ry depends on some minimal 
amount of rp, it might be a factor in 
acquisition only in the multiple pellet 
PRF groups; consequently, only the 10P 
and 24P groups would be trained in the 
presence of rp-sp, and only in these 
groups could the running response be- 
come counterconditioned to sp. Follow- 
ing Spence’s (1956) theory of instru- 
mental reward learning, this countercon- 
ditioning would develop according to a 
contiguity principle. Even if rp were 
occurring on only the fifth trial of the 
RNRNR sequence, sp would be present 
on about 10 goal approaches in condition- 
ing of one group and 24 in another, not 
to consider further conditioning of this 
sort that should occur on early extinction 
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trials. (Note, again, that the PRF curves 
continue to increase on early extinction 
trials in the multiple-pellet groups.) The 
implication of the present hypothesis is a 
dual role for multiple-pellet rewards: (a) 
to allow for sufficient rp so that an N 
acquisition trial results in Rp; and (b) 
to produce multiple goal approaches on 
acquisition trials, so that the strength of 
sp APPROACH can be increased by 
repetition. 

As a final comment, these data have 
implications for studies concerned with 
conditioning and extinction as a function 
of magnitude of reward (e.g, Armus, 
1959; Ison & Cook, 1964), where mag- 
nitude of reward is defined by number of 
pellets. In these cases, it may be that 
magnitude effects (total weight of the 
reinforcer) and number of "trials" are 
completely confounded. There would 
seem to be some basis for urging that 
experiments of this sort define differential 
magnitude in terms of different sizes of 
single pellets. 
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Thresholds for the detection of the direction and the rate of frequency 
change for pure tones were determined. With constant initial fre- 
quencies of 125 to 1,000 Hz. for durations of .5-4 sec, the threshold 
frequency difference for detection of direction of frequency change 
closely approached that obtained in a conventional frequency discrim- 
ination test. The threshold rate of frequency change was inversely 
proportional to the duration of the frequency change. Tests with a 
variable starting frequency suggest that listeners can respond directly 
to frequency changes over the duration of the swept frequency tone. 
Tests with a relatively "pure" measure of rate of frequency change 
suggest that, at extremely short transition times, sensitivity to change 
in the rate of transition is nearly directly proportional to the frequency 
range subtended; at long transition times, sensitivity to change in 
the rate of transition is nearly independent of the frequency range 


subtended. 


The revival of interest in sensory 
gradients may be credited largely to 
Bekesy (1960, 1967) and Ratliff 
(1965). These investigators have 
pointed out the primary role played by 
inhibitory processes in sensory func- 
tion, especially in vision. Extensions 
to audition have run into the difficulty 
of determining the most appropriate 
analog of the visual phenomena. The 
most likely analog to Mach-band 
brightness gradients might be expected 
to be loudness gradients (Pollack, 
1963). Such an examination, how- 
ever, has not been revealing because of 
the difficulty of measuring differential 
loudness effects precisely in time. A 
more satisfactory method of demon- 
strating such effects appears to be 
short-term masking effects, as in the 
Zwicker (1965) “overshoot” phenom- 
enon, where masking is enhanced at 
temporal edges. 


1This research was supported by the 
National Science Foundation Grant 2894. 
Robert Parnes was responsible for running 
Exp. I-III. 


Because the cochlea performs a tem- 
poral-to-spatial transformation upon 
auditory information, it is tempting to 
seek frequency gradient effects as the 
analog of the visual spatial gradients. 
Indeed, Small and Daniloff (1967) 
have shown that the cutoff frequency 
of a moderately-sharply filtered band of 
noise (35 db/oct beyond the half-power 
points) can be discriminated by listen- 
ers. In addition, Zwisloki (1965) has 
shown how, on a theoretical basis, the 
Bekesy approach to gradients might 
throw light on the frequency selectivity 
of the ear. But the primary impetus 
for examining frequency gradient ef- 
fects has come from the neurophysiolo- 
gist. Whitfield and Evans (1965) 
have demonstrated that certain cortical 
neurons do not respond to steady tones, 
but respond vigorously to modulated 
tones. This result had been noticed 
earlier by Thurlow, Gross, Kemp, and 
Lowry (1951). Analogous findings 
have been reported for visual cortical 
neurons by Hubel and Wiesel (1959). 
Whitfield and Evans also found other 
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single cortical neurons which respond 
to specific direction of frequency 
change. Frequency changes in one di- 
rection, e.g., from high to low, will fire 
a unit, but frequency changes in the 
opposite direction will not. These find- 
ings have recently been confirmed and 
amplified at the level of the inferior 
colliculus by Nelson, Erulkar, and 
Bryan (1966). These investigators 
also found that some units responded 
to frequency modulated stimuli at fre- 
quencies lying completely outside of 
the "response area” of the unit. Suga 
(1965) examined cortical neurons of 
echo-locating bats, which are believed 
to extract information from frequency 
modulated signals. Of principal inter- 
est was that a unit's response to the 
direction of frequency modulation could 
be predicted from the shape of its re- 
sponse area. Inhibitory mechanisms 
are strongly implicated by Suga. 

Because the neurophysiological ob- 
servations suggest important "building 
blocks" for a conceptual picture of the 
processing of auditory information, a 
psychophysical study of thresholds for 
the detection of the direction of fre- 
quency change for pure tones was 
initiated. The author found an excel- 
lent study by Sergeant and Harris 
(1962) who determined thresholds of 
frequency change at a single frequency 
(1,500 Hz.) over a wide range of 
durations. 


METHOD 


Experimental variables—Three variables 
of a swept frequency tone were systematically 
manipulated: (a) the initial starting fre- 
quency; (b) the rate of change of frequency 
with duration; and (c) the duration of 
presentation. 

Apparatus.—A. Wavetex voltage-controlled 
oscillator, driven externally by a dc circuit, 
produced a linear increase or decrease in 
frequency over a 33-sec. period. The initial 
and terminal frequency were determined by 
a digital frequency counter. A programming 
circuit randomly scrambled the polarity of a 


dc ramp to provide either a positive or a 
negative frequency change. The duration of 
the tone was controlled by a Grason-Stadler 
electronic switch with a rise-time of 10 msec. 
A comfortable listening level, about 70-db. 
SPL, was employed with binaural earphones. 

Procedure.—In a one-interval test, listeners 
were exposed either to a rising frequency 
ramp or to a falling ramp. The listener was 
instructed to indicate the direction of fre- 
quency change. Except for control tests, the 
initial frequency was held constant within an 
experimental session. The rate of the fre- 
quency change was varied to encompass the 
75% correct identification level. A typical 
test involved 30 frequency changes at each of 
6-8 rates of frequency change. Following 
termination of the auditory signal, a voting 
period of 1.5 sec. was provided before pre- 
sentation of the subsequent signal. It is 
noted that, for short duration ramp tones, 
the perceived pitch tends to be identified with 
the terminal frequency. This phenomenon 
has been studied by Brady, House, and 
Stevens (1961). 

Subjects—Listeners were primarily music 
majors from the University of Michigan. 
Extensive training over several weeks pre- 
ceded data collection. Several crews of 
listeners were employed: 10 listeners par- 
ticipated in Exp. I; 8 of the listeners of 
Exp. I then participated with 1 new listener 
in Exp. II; 2 listeners participated in Exp. 
III; and 1 listener participated in Exp. IV. 


RESULTS 


Experiment I. Role of duration and 
initial starting frequency —The role of 
the duration of the swept frequency 
tone is examined in Fig. la. The 
ordinate is the median rate of fre- 
quency change required for 75% cor- 
rect detection; in Fig. la, the abscissa 
is the duration of presentation, the 
parameter is the initial starting fre- 
quency. The results for the individual 
listeners closely parallel the results for 
the median listener. 

A least-squares solution revealed that 
the experimental data were adequately 
fitted by a slope of —1, ie, by a 
direct inverse relation between thresh- 
old rate of frequency change and dura- 
tion. Stated differently, the total fre- 
quency change (rate of frequency 
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change multiplied by duration) was in- 
dependent of duration. Sergeant and 
Harris (1962) obtained a slope of 
about —1.25 on the double logarithmic 
coordinates of Fig. la from 75 msec. 
to nearly 10 sec., reflecting that smaller 
total frequency changes were associated 
with the longer duration ramp tones. 
Over the common range of durations 
(.25-5 sec.), their trained listeners 
yielded a slope of —14 (Sergeant & 
Harris, Table III, p. 1627). 

The abscissa and parameter of Fig. 
la are interchanged in Fig. 1b in 
order to examine the role of the initial 
starting frequency. Consistent differ- 
ences in the threshold rates of change 
are observed as a function of the initial 
starting frequency. The required slope 
is approximately directly proportional 
to the initial starting frequency. Stated 
differently, the relative threshold 
change, in percentage terms, is inde- 
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Fic. 1. Threshold detection of direction 
of frequency change for pure tones with a 
constant initial starting frequency. (The 
ordinate is the threshold rate of frequency 
change required to detect a positive from a 
negative frequency change of the same mag- 
nitude. In Fig. la, the abscissa is the dura- 
tion of signal presentation; the parameter is 
the initial starting frequency. The abscissa 
and parameter have been switched in Fig. 
lb. Each point is based upon a psycho- 
metric function of about 200 observations for 
each of 10 listeners. The dashed line repre- 
sents the threshold for steady tones, expressed 
in Hz, rather than Hz/sec.) 


pendent of the starting frequency. To 
be more exact, the slopes of the curves 
in Fig. 1b tend toward that of the 
steady tone (Exp. II), which has a 
slope of .88. 


Experiment II, Comparison with 
tones of fixed frequency.—In Exp. II, 
the author sought to “calibrate” the 
listeners and equipment in terms of 
frequency difference thresholds for 
tones of fixed frequency. The dc ramp 
generator was replaced by a dc battery. 
In a one-interval test, the polarity of 
the battery was programmed to yield 
steady tones above and below the refer- 
ence frequency. 

The effect of varying the duration of 
a steady tone with a reference fre- 
quency of 500 Hz. is presented as the 
dashed line in Fig. la. The ordinate 
is the threshold frequency difference, 
expressed in Hz., rather than Hz/sec. 
In contrast to the findings with swept 
tones, little effect is observed with 
steady tones as a function of duration 
between .5 and 4 sec. 

The effect of varying the frequency 
of the steady tone is presented in Fig. 
lb as the dashed line. There is a 
systematic lowering of the frequency 
discrimination thresholds with lower 
initial frequencies. The slope of the 
relationship is probably not different 
between steady and swept frequency 
tones. 

As a result of the extensive findings 
of Shower and Biddulph (1931), most 
summaries of the literature, e.g., that 
of Licklider (1951, Fig. 18) show a 
leveling off of the differential frequency 
threshold, Af, for reference frequencies 
below 1,000 Hz. This empirical func- 
tion is of more than passing interest 
because it plays a keystone role in the 
validation of the mel pitch function and 
in the validation of the unified concept 
of the critical band for masking by noise 
(cf. Licklider, 1951). 


538 


Recent experiments suggest that with 
well-trained listeners, Af may not be 
constant at lower frequencies, but 
rather decrease proportionally with f. 
For example, Small and Brant (1963) 
found that between 250 and 2,000 Hz., 
Af/f —.87. This slope is nearly in 
exact agreement with the slope of .88 
for the steady tones in Fig. lb. Simi- 
larly, some masking results demon- 
strated that the size of the critical ratio 
decreases proportionally with the mask- 
ing frequency down to 250 Hz. (Green, 
McKey, and Licklider, 1959). Thus, 
the close relation between frequency 
discrimination thresholds and fre- 
quency selectivity against masking 
noise, which is the basis of the unified 
concept of the critical band, may be 
intact. One simply must reconsider the 
shape of the function at lower sound 
frequencies. 


Experiment III. Effect of shifting 
the initial starting frequency.—The 
close correspondence between the re- 
sults for swept and steady frequency 
tones suggested that the listeners in 
Exp. I were responding to the fre- 
quency differences attained in the latter 
portions of the swept frequency tones, 
rather than to the direction of frequency 
change, per se. In order to check on 
this possibility, the author repeated 
some of the tests of Exp. I, but shifted 
the initial starting frequency at random 
on successive presentations. The as- 
sumed effect of this operation was to 
confuse a listener who was making his 
judgments of frequency change by com- 
paring the terminal frequency to the 
often-repeated constant initial fre- 
quency. 

The results of these tests suggest that 
listeners can discriminate direction of 
frequency change other than by com- 
paring the terminal frequency to the 
mean initial frequency. For a 500-Hz. 
swept tone subtending durations be- 
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tween .5 to 2 sec., one listener re- 
quired a 1.3-fold change in the thresh- 
old extent of frequency change (.7- 
-9 Hz) as the starting frequency var- 
ied from 0 to 10 Hz. Another listener 
required a 1.8-fold change (1.3-2.3 
Hz) under the same conditions. In 
both cases, the required threshold 
change was small relative to the shift 
in the starting frequency. When the 
initial starting frequency was varied 
over a still wider range, thresholds did 
not uniformly increase. These results 
suggest, then, that the listeners must 
have responded to the frequency change 
over the duration of the swept tone. 

In partial summary, thresholds for 
frequency differences were found to be 
nearly identical for swept frequency 
tones and for steady frequency tones. 
On the basis of tests with swept tones 
with a varying starting frequency, it 
was nevertheless concluded that listen- 
ers can respond to frequency changes 
over the duration of the swept tone. 


Experiment IV. Discrimination of 
"pure" rate of frequency change.—1t 
is difficult to achieve a pure measure of 
frequency change. Consider two tones 
that are swept at different rates of fre- 
quency change. If the duration of two 
swept tones is held constant, the swept 
tone with the greater rate of frequency 
change will subtend a wider frequency 
range. If the frequency range is held 
constant, the swept tone with the 
greater slope will subtend a shorter 
duration. In the first instance, dis- 
crimination may be based on frequency 
differences; in the second instance dis- 
crimination may be based on time dif- 
ferences. 

A preliminary approach to a pure 
measure is schematically illustrated in 
the insert to Fig. 2. A swept tone con- 
sisted of three segments: an initial 
steady-state segment; a terminal 
steady-state segment; and a transition 


DETECTION OF RATE OF CHANGE IN AUDITORY FREQUENCY 


segment spanning the steady-state seg- 
ments. If the total duration of the 
steady-state segments is long relative 
to the duration of the transition seg- 
ment, the principal variable will be the 
slope of the transition segment. 

In a one-interval test, the listener 
responded to whether the transition 
slope was relatively fast or slow. In a 
given set of trials, the transition time 
was varied and the frequency difference 
was held constant. A center frequency 
of 707 Hz. was employed. The total 
duration of the three segments was 5 
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Fic. 2. Threshold discrimination of rates 
of frequency change. (The insert graph 
schematically represents a tone with three 
frequency regions: a steady initial and 
terminal region and a transition between the 
two. The ordinate is the difference in transi- 
tion time between a reference signal, t, and 
the just-discriminable variable signal, te The 
abscissa is the frequency difference between 
the initial and terminal frequency regions. 
The parameter is the transition time of the 
reference signal. Each point is based upon 
a psychometric function of about 200 observa- 
tions by a single listener. The dashed refer- 
ence lines represent two extreme quantitative 
relationships.) 
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sec. The transition segment started 1 
sec. after the initial segment. The 
listener responded before termination 
of the tone so that the duration of the 
final segment did not serve as a dis- 
criminable cue. 

The experimental results are pre- 
sented in Fig. 2. The abscissa is the 
frequency difference subtended by the 
two steady-state segments; the param- 
eter is the duration of the transition 
segment of the reference signal and the 
ordinate is the additional duration of 
the transition segment of the variable 
signal required for 7596 correct dis- 
crimination. The coordinates are loga- 
rithmic. 

Two extreme hypotheses are repre- 
sented by dashed reference lines. The 
horizontal reference line represents 
transition sensitivity which is inde- 
pendent of the subtended frequency dif- 
ference. This reference condition is ap- 
proached for the longest reference tran- 
sitions, where the threshold transition 
change is about 80% of the reference 
transition. The —45° reference line 
represents sensitivity to transition 
changes which is inversely proportional 
to the frequency difference subtended 
by the initial and terminal segments. 
This reference condition is approached 
by the most abrupt transition (60 psec. 
transition smoothed by the earphones 
to about .1 msec). Despite the rela- 
tively high variability, intermediate 
conditions tend to sweep out results 
between the two extremes. 

The parameter and abscissa of Fig. 
2 are interchanged in Fig. 3. The 
dashed reference lines represent cor- 
responding hypotheses. The horizontal 
line represents threshold transition dif- 
ferences which are independent of the 
transition time. This condition is ap- 
proached for the smaller frequency dif- 
ferences. The 45° reference line repre- 
sents transition sensitivity which is a 
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Fic. 3. Threshold discrimination of rate 
of frequency change. (This graph represents 
a replotting of Fig. 2 with the abscissa and 
parameter interchanged.) 


constant proportion of the reference 
transition. This condition is ap- 
proached at the larger frequency dif- 
ferences. Intermediate conditions tend 
to sweep out results between the ex- 
tremes. 

Additional tests demonstrate that the 
sensitivity for detection of transition 
differences is, over a range of 60 db., 
nearly independent of sound level. At 
sound levels near the absolute thresh- 
old, however, the click introduced by 
a rapid frequency change (Neustadt, 
1965) becomes inaudible and sensi- 
tivity suffered for abrupt frequency 
gradients. 


DISCUSSION 


The picture painted by the results of 
the several experimental series suggests : 
excellent auditory discrimination for the 
direction of frequency change (Exp. I) 
which is nearly equal to that achieved for 
steady tones (Exp. II) ; good discrimina- 
tion of direction of frequency change de- 
spite changes in the starting frequency 
(Exp. III) ; and, two different ways of 
sensing differential rates of frequency 
change—for sharp transitions, threshold 
temporal gradients are proportional to the 
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frequency difference subtended; for slow 
transitions, threshold temporal gradients 
are proportional to the temporal difference 
subtended (Exp. IV). As in the case of 
pitch, the auditory system apparently 
solves the problem of successfully process- 
ing an extremely wide range of transition 
conditions by the employment of different 
modes of operation. 

Auditory theorists, notably Gabor 
(1946), have attempted to specify the 
trade off between a frequency analysis and 
a temporal analysis. In particular, Gabor 
postulated a basic uncertainty relationship 
between the precision of each form of 
analysis (At-Af — 1). An exquisitely 
sharp frequency analysis is simultaneously 
associated with a coarse time analysis, 
For example, for small frequency differ- 
ences in Fig. 3, temporal gradient sensi- 
tivity is uniformly poor over a wide range 
of transition times. An exquisitely sharp 
temporal analysis is simultaneously asso- 
ciated with a course frequency analysis. 
For example, for large frequency differ- 
ences in Fig. 3, temporal gradient sensi- 
tivity is sensitive to the transition time 
over a wide range. (A discussion of 
Gabor's mathematical development may be 
found in Littler, 1965.) Most relevant 
here is the postulation of a neural scan 
with a scanning period of up to 250 msec. 
which operates upon a cochlear frequency 
analysis. The change in the empirical 
functions in Fig. 2 between 200 msec., 
where gradient discrimination is depen- 
dent upon the subtended frequency differ- 
ence, and 400 msec., where gradient dis- 
crimination is, over a wide range, 
independent of the subtended frequency 
difference, may be suggestive of Gabor's 
postulated scan. It remains to be deter- 
mined whether other features of the basic 
uncertainty relationship can be revealed 
by an analysis of frequency gradients. 
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RESPONSE-CLASS SIMILARITY AND FIRST-LIST RECALL 
WITH MIXED AND UNMIXED TRANSFER DESIGNS: 
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When interlist similarity of responses in the A-B, A-C paradigm is 
varied with unmixed lists, differences in first-list recall may be at- 
tributed to the unlearning of responses which occurred during inter- 
polated learning (IL) or to competition between responses at recall. 
The mixed-list transfer paradigm eliminates the potential effects of 
similarity on unlearning. Retention of List-1 responses followed by 
same-class or different-class responses in IL was investigated with 


mixed- and unmixed-list designs. 
in retention loss as a function of 


utable to differences in elicitation 
during IL rather than to competiti 


In the A-B, A-C paradigm, retention 
decrement on an unpaced test of recall 
is assumed to reflect a relatively perma- 
nent reduction in response strength 
(unlearning) rather than a temporary 
blocking or confusion between re- 
Sponses (competition) which occurs 
during recall (Howe, 1967; Postman, 
1961). An important distinction be- 
tween competition and unlearning is 
the assumed locus of retention decre- 
ment: at the time of recall vs. during 
interpolated learning (IL). It is pos- 
sible that retention loss which has been 
attributed to unlearning may instead 
reflect specific competition between re- 
sponses which have been learned to the 
same stimulus. The dual association 
of A with B and with C might impede 
the retrieval of B, even on an unpaced 
test of recall. The process has been 
conceptualized as a blocking or crowd- 
ing of one response by another. Pre- 
sumably, both responses remain in 
storage but only one, or neither, is 
readily accessible for recall. 

If List-1 responses occur overtly or 
covertly during IL and are incorrect, 
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The results indicate that differences 
response-class similarity are attrib- 
and unlearning of List-1 responses 
on at recall. 


it is assumed that they are weakened 
or unlearned (Postman, 1961). An 
increase in the frequency of elicitation 
of List-l responses during IL should 
lead to an increase in unlearning. 
Underwood and Schulz (1960) have 
shown that Ss tend to restrict their 
responses to items from the categories 
Which are represented on the list. It 
has been assumed that a change in the 
class of responses from List 1 to List 
2 will facilitate response restriction 
(Postman, Keppel, & Stark, 1965). 
The occurrence of List-1 responses 
during IL will be more frequent when 
List-2 responses are from the same 
than when they are from a different 
category. In accordance with the elici- 
tation hypothesis, retention loss on an 
unpaced test of recall is greater when 
form class of List 1 and List 2 re- 
sponses is identical than when form 
Class is different (Friedman & Rey- 
nolds, 1967; Postman et al, 1965). 
These results may also be explained in 
terms of competition at recall. That 
is, the difference in List-1 recall be- 
tween groups which have the same and 
groups which have a different class of 
responses in List 2 may reflect greater 
difficulty of retrieval of the B member 
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of a pair when C is similar to B, rather 
than greater elicitation and unlearning 
of B during IL. 

The purpose of the present study was 
to identify the locus of retention decre- 
ment in the A-B, A-C paradigm when 
response-class similarity is manipu- 
lated. The effects of this variable were 
investigated under two conditions, one 
which permitted and one which pre- 
vented the use of similarity as a basis 
for the elimination of errors during IL. 
With an unmixed-list design, all re- 
sponses in IL were either from the 
same or from a different class of items. 
The change in response class from 
List 1 to List 2 under the "different 
class" condition provided a reliable 
basis for the elimination of List-l re- 
sponses as errors. With a mixed-list 
design, half of the responses on List 2 
were from the same and half were from 
a different class. In this case, response- 
class similarity does not provide a basis 
for the elimination of List-1 responses. 
There is no list rule of exclusion which 
may be applied (Postman, 1966). If 
the unpaced test of recall reflects un- 
learning of A-B which occurred dur- 
ing IL, then similarity of B and C will 
influence recall only when List-2 re- 
sponse class is homogeneous. On the 
other hand, if the difficulty of retrieval 
of B at the time of recall is a function 
of the similarity between B and C, 
there should be no interaction between 
the procedure used in IL (mixed vs. 
unmixed list) and similarity of re- 
sponse class. 


METHOD 


Design.—Four groups learned a first. list 
of 10 paired associates with numbers as 
stimuli and letters as responses. IL con- 
formed to the A-B, A-C paradigm for all 
groups, with B and C either from the same 
class (letters) or from different classes 
(letters and adjectives). At the end of IL, 
Ss were asked to recall the List-l response 
to each stimulus on an unpaced test. There 
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were two groups which learned an unmixed 
list (UL) and two groups which learned a 
mixed list (ML) in IL. The 16 Ss in Group 
UL-S learned 10 number-letter pairs (S- 
pairs), and the 16 Ss in Group UL-D 
learned 10 number-adjective pairs (D-pairs). 
A mixed list (ML) consisting of 5 S-pairs 
and 5 D-pairs was learned by a total of 32 
Ss. The ML Ss were divided into Groups 
ML-S and ML-D. The groups received the 
same treatment, and were differentiated only 
on the basis of the set of 5 List-l pairs 
which was to be used in the comparable-set 
analysis of recall scores. For the com- 
parable-set analysis, each S who learned a 
mixed list contributed a score on 5 pairs 
from OL, all of which were followed by 
either S-pairs in IL (Group ML-S) or by 
D-pairs in IL (Group ML-D). With this 
procedure, the analysis of recall data may be 
done by either (a) adjusting the recall score 
of each S in the UL groups to the base five 
or (b) selecting a set of 5 OL pairs for 
each UL S to correspond with the 5 OL 
pairs used in the analysis for ML Ss. The 
latter procedure was employed, and the set 
of pairs to be used in the comparable-set 
analysis was determined randomly for each 
S prior to the running of the experiment. 
Materials—The stimuli in all lists were 
the numbers 1-10. Twenty letters of the 
alphabet, excluding the five vowels and the 
letter W, were used as responses. The letters 
were divided randomly into two sets of 10 
with the following restrictions: each set con- 
tained five letters from the first half and five 
from the second half of the alphabet; no 
more than two consecutive letters of the 
alphabet appeared in the same set. The 
letters in Set 1 were used as responses in 
OL and the letters in Set 2 were used as 
responses for the S-pairs in IL. The re- 
sponse terms in OL and IL (S-pairs) were 
double letters (RR, GG, etc.) in order to 
equate pronunciation times for letter and 
adjective responses. Responses for the D- 
pairs in IL were 10 two-syllable adjectives 
which had minimal similarity of meaning. 
Two different pairings of stimuli and re- 
sponses were used to form two lists for OL. 
The pairings were random, with the exclu- 
sion of rhymes and obvious associations. 
Similarly, two lists of 10 S-pairs and two 
lists of 10 D-pairs were constructed for 
Groups UL-S and UL-D. If the C term 
formed a rhyme or an alphabetical sequence 
with the corresponding B term from OL, 
the pairing was excluded. Each of the UL 
lists was divided independently and randomly 
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into two sets of five pairs. The sets of S- 
pairs and D-pairs were combined appropri- 
ately to form two mixed lists. The arrange- 
ments permitted each set of five items from 
OL to be followed by A-C learning with a 
particular set of S-pairs or D-pairs by an 
equal number of Ss in the ML and UL 
groups. The experiment was thus replicated 
over two different OL and IL lists and four 
different sets of items for analysis of recall. 

Procedure.—Lists were presented on a 
Stowe memory drum at a 2:2-sec. rate with 
a 4-sec. intertrial interval. There were four 
different orders of presenting the pairs in OL 
and IL. Each order was used equally often 
as a starting order within each group. List-1 
learning was carried to a criterion of 5 cor- 
rect. The interlist interval was 2.5 min. 
Prior to the first trial of IL, Ss were in- 
formed of the nature of the responses on 
List 2. List-2 learning was continued for 
15 trials, and was followed by a test of recall 
of List-l responses. The test began 15.8 
min. after the last trial of OL. The stimuli 
were presented one at a time in the drum 
window ; the correct List-2 response was also 
Shown. The S’s task was to call out the 
List-1 response that went with each stimulus, 
and the test was paced by S. The stimuli 
were presented in one of the four orders 
which had been used in IL. 

Subjects.—Sixty-four students at the Uni- 
versity of California served as Ss in fulfill- 
ment of a course requirement. The Ss were 
assigned to groups in blocks of four in order 
of appearance at the laboratory. Each block 
contained one S from each group. Particular 
lists, combinations of lists, starting orders, 
and comparable sets of items were balanced 
within groups. 


REsuLTS AND Discussion 


Original learning.—The mean num- 
ber of trials to the criterion of $ cor- 
rect on List 1 was 10.2.. The means 
ranged 8.5-124. A three-way analysis 
of variance of number of trials to cri- 
terion in OL yielded no significant 
main effects or interactions (all p’s > 
.10), indicating that the lists were of 
equal difficulty and the groups were of 
comparable ability on the learning task. 

Interpolated learning.—The analyses 
of List-2 learning are based on a single 
set of five pairs for each S in the ML 
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and UL groups. The set which is in- 
cluded for each S is composed of the 
A-C items which correspond to the A-B 
items used in the analysis of recall. 
This provides independent measures 
for a direct comparison of performance 
on S and D pairs under ML and UL 
conditions. Two analyses of variance 
of number of correct responses in IL 
were done, one for Trials 1-5 and one 
for the entire course of IL. The re- 
sults of the analyses were essentially 
the same, and only the latter will be 
reported. There was, of course, a sig- 
nificant increase in number of correct 
responses over trials, F (14, 784) — 
143.2, p < .01. The effects of Type of 
Pair (S vs. D) and Type of List (ML 
vs. UL) were not significant, and none 
of the interactions were significant (all 
p’s > .10). The results of both analy- 
ses may be summarized as follows: 
there was no difference in number cor- 
rect on S-pairs and D-pairs under ML 
or UL conditions, and the rate of 
learning S- and D-pairs in the ML 
and UL groups was the same. Since 
the conclusions of the study were based 
on the nature of the interaction of Type 
of Pair with Type of List in recall of 
List-1 responses, it is important to 
note that the course of IL for S-pairs 
and D-pairs is the same in ML and 
UL groups. It seems clear that po- 
tential competing responses, either S- 
terms or D-terms, are learned equally 
well under ML and UL conditions. 
Similarly, the number of opportunities 
for the occurrence of List-1 responses 
as errors is comparable for ML and 
UL groups. 

Intrusions.—The pattern of intru- 
sions during IL, supports the assump- 
tion that the frequency of elicitation of 
List-1 responses is a joint function of 
similarity of response class and homo- 
geneity of type of response on List 2, 
In the UL groups, where a difference 
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in response class provided a basis for 
elimination of List-l responses, there 
were no List-l intrusions in Group 
UL-D while there were 16 List-l in- 
trusions contributed by nine Ss in 
Group UL-S. In the ML groups, there 
were 10 and 7 List-l intrusions in 
Groups ML-S and ML-D, respectively, 
contributed by five Ss in each group. 
List-1 intrusions occurred with approx- 
imately equal frequency in place of the 
correct S and D responses in the ML 
groups. 

Recall.—In order to answer the main 
question which is raised in this study, 
it is necessary to evaluate the inter- 
action of similarity of response class in 
IL with mixed vs. unmixed lists. For 
the comparable-set analysis, only the 
preselected set of five A-B pairs for 
each S in the ML and UL groups is 
used. The mean number of responses 
recalled in Groups UL-S, UL-D, 
ML-S, and ML-D was 2.8, 3.9, 3.8, 
and 3.0, respectively. A three-way 
analysis of variance, including replica- 
tions with different lists, yielded a sig- 
nificant interaction of Type of Pair in 
IL (S vs. D) with Type of List (ML 
vs. UL), F (1, 56) = 12.03, p < .01. 
The other main effects and interactions 
were not significant, p’s > .10. With 
an unmixed list, retention of A-B is 
lower when C is from the same re- 
sponse class than when C is from a dif- 
ferent response class; with a mixed list 
the relationship is reversed. The re- 


sults in the UL groups confirm prior 


findings on the effects of response-class 
similarity. 

Considering all 10 items for each S, 
the mean recall (adjusted to the base 
5) in Groups UL-S and UL-D was 
2.9 and 4.0, respectively. These re- 
sults are in close agreement with the 
means based on five pairs per S in the 
comparable-set analysis. Considering 
all pairs learned by 32 Ss in the ML 
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groups, the mean number of List-1 re- 
sponses correct at recall is 3.6 for those 
A-B items which were followed by S- 
pairs in IL and 3.6 for those A-B items 
which were followed by D-pairs. The 
equivalence of recall for this within-Ss 
comparison strongly suggests that the 
difference between ML-S and ML-D 
which was found with a between-Ss 
comparison (comparable-set analysis) 
is a chance difference. The magnitude 
of the interaction between Type of 
Pair and Type of List in the com- 
parable-set analysis is, of course, in- 
creased by the direction of the differ- 
ence between ML-S and ML-D. The 
significance of the interaction is con- 
firmed, however, by the equivalence of 
recall in the ML condition (within-Ss 
comparison) as contrasted with the sig- 
nificant difference between UL-S and 
UL-D based on all 10 items per S, F 
(1, 30) = 10.56, p < .01. 

The results clearly indicate that simi- 
larity of response class influences recall 
of List-1 responses only when the pro- 
cedure used in IL provides a basis for 
the elimination of List-1 responses as 
errors, ie. in the unmixed-list condi- 
tion. It may be assumed that the 
change in response class in Group 
UL-D functions to reduce the fre- 
quency of elicitation of B during IL, 
and hence reduces the unlearning of 
A-B. In the ML condition, where 
List-l responses canont be eliminated 
on the basis of a change in response 
class, recall of each List-1 response is 
independent of the similarity of the re- 
sponse which was learned in IL. The 
results are thus in complete agreement 
with the assumption that unlearning 
rather than competition accounts for 
the effect of response-class similarity 
on recall. 

The fact that the level of List-1 re- 
call in ML was intermediate between 
UL-S and UL-D requires further in- 
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vestigation. The introduction of a new 
class of responses in ML may reduce 
intertask similarity, thereby reducing 
the overall rate of elicitation of List-1 
responses in ML relative to UL-S. In 
these terms, the results of studies of 
contextual change and RI (Bilodeau & 
Schlosberg, 1951; Greenspoon & Ran- 
yard, 1957) are consistent with the 
finding of better recall in ML than in 
UL-S, and may reflect the same under- 
lying mechanism. The present results 
add to the increasing range of phenom- 
ena which are encompassed by assump- 
tions about the process of unlearning. 
The specification of variables which in- 
fluence the occurrence of first-task re- 
sponses during the learning of sub- 
sequent tasks promises to be a useful 
analytical tool for the future study of 
forgetting. 
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WITHIN-SUBJECTS PARTIAL REINFORCEMENT EFFECTS 
VARYING PERCENTAGE OF REWARD TO THE 
PARTIAL STIMULUS BETWEEN GROUPS: 


KAREN GALBRAITH, MICHAEL E. RASHOTTE, ann ABRAM AMSEL 


University of Toronto 


3 groups of 10 rats were run 4 trials a day under split-alley discrim- 
ination conditions. Ss in all groups were continuously rewarded in one 
runway and reward in the other runway was varied (50, 25, or 0%) 
between groups. All groups were then extinguished to both runway 
stimuli. The acquisition data replicated earlier within-Ss data obtained 
in simple black and white runways. All groups showed a generalized 
partial reinforcement effect in extinction. 


Recently, experiments in which par- 
tial (PRF) and continuous (CRF) 
schedules of reinforcement are pre- 
sented to single organisms (within-Ss) 
in relation to differential stimuli in a 
runway have indicated that running 
speed is influenced by PRF and CRF 
within-Ss in a manner similar, but not 
identical, to that in the between-Ss case 
during acquisition; in extinction the 
findings are quite different in the two 
cases (Amsel, MacKinnon, Rashotte, & 
Surridge, 1964; Amsel, Rashotte, & 
MacKinnon, 1966; Brown & Logan, 
1965; Henderson, 1966; Ludvigson, 
1966; Spear, 1964; Spear & Pavlik, 
1966; Spear & Spitzner, 1967). Be- 
tween-Ss experiments have routinely 
shown the partial reinforcement extinc- 
tion effect (PRE), greater resistance to 
extinction (higher speeds and lesser 
slopes) after PRF training than after 
CRF training; within-Ss conditions, 
using black and white alleys as dis- 
criminanda, have shown what Brown 
and Logan (1965) have termed the 
generalized partial reinforcement effect 
(GPRE), equal PRE-like performance 
to both stimuli. 

1 Supported by research Grants GB-3772 
from the National Science Foundation and 
APB-72 from the National Research Council 
of Canada. 


The present experiment was intended 
primarily to furnish extinction data 
after within-Ss partial reinforcement 
training under several conditions of 
percentage of reward to the partial 
stimulus. It is similar to an experiment 
by Henderson (1966) who, employing 
black and white straight runways as 
discriminanda, compared the acquisition 
performance of within-Ss groups given 
100% reward to one stimulus and 
either 100, 50, 25, 12, or 0 to the other. 
However he switched his groups to 
discrimination reversal rather than 
within-Ss extinction training. 

Kolesnik and Amsel (1966) have 
demonstrated that split alleys (half of 
each runway is painted white and half 
black, the positions of the halves being 
reversed in two runways) can serve as 
effective discriminanda for a successive 
discrimination and have suggested that, 
since the split alleys are equated for 
absolute brightness, such discriminanda 
would eliminate the Brightness x Rein- 
forcement-Schedule interactions found 
in some within-Ss PRF experiments 
which employed simple black and white 
runways (e.g. Amsel, Rashotte, & 
MacKinnon, 1966). A secondary fea- 
ture of this experiment is that it was 
conducted in such split alleys. 
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METHOD 


Subjects—The Ss were 30 male albino 
rats obtained from Woodlyn Farms, Guelph, 
Ontario, when approximately 90 days of age 
and were approximately 120 days old at the 
beginning of the experiment. 

Apparatus—Two parallel runways were 
employed, either of which could be aligned 
with a gray entry-box-start-box unit. The 
apparatus was identical to that described by 
Amsel, Rashotte, and MacKinnon (1966) ex- 
cept that the floor, walls, and metal retrace 
doors of each runway were painted as a “split 
alley”, in the manner described by Kolesnik 
and Amsel (1966): for runway BW the 
left half was black and the right half white, 
and the reverse was the case for runway 
WB. Briefly, the runways were 39 in. long, 
2% in. wide, 3% in. high and covered with 
clear Plexiglas. A metal guillotine door, 
134 in. from the end wall of the runways, 
served to confine S to the goal area. Time 
to traverse three successive 1-ft. segments 
of the runways was recorded on each trial 
by means of a system of clocks and photo- 
cells. 

Procedure.—Three weeks before the be- 
ginning of the experiment, Ss were placed 
on a 23-hr. 10-gm. food deprivation schedule. 
During this period each S had access to water 
ad libitum and was handled for a few min- 
utes every day. There was no habituation 
or prefeeding in the apparatus. 

The experiment consisted of 168 acquisition 
and 48 extinction trials. Each S received 4 
trials a day, 2 in the percentage runway (S1) 
and 2 in the CRF runway (Sz). The Ss 
were assigned to Groups 50-100, 25-100, 
and 0-100 in a quasirandom fashion. The 
hyphenated numbers in the group desig- 
nations refer to the percentages of reward 
received in the S; and S; runways, respec- 
tively, in every 2-day block of trials. Thus, 
Group 50-100 received 50% reward in S; 
and 100% in Ss Each group was divided 
into two subgroups of five Ss. The BW run- 
way was S, and the WB runway S; for one 
subgroup and the runway-reinforcement con- 
tingency was reversed for the other. This 
arrangement insured counterbalancing of 
stimulus-reinforcement contingencies within 
each group. 

The sequence of presenting the S; and Ss 
stimuli during the four daily trials was de- 
termined, separately for each S, by randomly 
assigning one of the following four-trial se- 
quences to each of the days in successive 
four-day blocks for the duration of the ex- 
periment: S,$:$:$s, S:S2S2S, SsS:S:Ss, and 


S.S,S,Si. For the 50-100 group reward was 
presented on only one of the S; trials, ran- 
domly chosen, on each day; for the 25-100 
group reward was presented on one of the S, 
trials in every two-day block; all Si trials 
were nonrewarded for the 0-100 group. Re- 
ward was one 500-mg. Noyes pellet and Ss 
were removed from the goal as soon as they 
stopped eating, or on nonrewarded trials, 
after 15-20 sec. 

The Ss were run in two squads of 15, with 
5 from each group randomly assigned to 
each squad. During extinction S was re- 
moved from the runway if it failed to 
traverse any 1-ft. segment within 60 sec. 
which was the score assigned to succeeding 
segments on such occasions. Other details 
of experimental procedure were identical to 
those reported by Henderson (1966). 


RESULTS AND DISCUSSION 


The three successive 1-ft. time mea- 
sures taken on each trial (start, run, 
and goal times) were transformed to 
speeds (1/time). Figure 1 shows 
mean speed in the S, and S, runways 
during acquisition and extinction for 
each group. 

All groups appeared to run faster in 
the S, runway in both the run and goal 
measures early in training. Analyses 
of variance conducted separately for 
each group and measure over Blocks 
1-5 confirmed these differences (sig- 
nificant reinforcement effect, p < .01 
for all but Group 25-100 analyses 
where p < .05). The only other sig- 
nificant effects yielded by these analy- 
ses were a main effect of Day in each 
analysis and one Subgroup x Rein- 
forcement interaction which occurred in 
the analysis of goal speeds for the 50— 
100 group, F (1, 8) = 26.00, p < .01. 
Subsequent comparisons indicated that 
this interaction resulted from sub- 
stantially higher S, speed for the sub- 
group with the BW runway as S,. 

Amsel (1967) has provided a sum- 
mary of data from other within-Ss ex- 
periments in which faster speeds in the 
low-percentage runway were obtained 
early in acquisition and has suggested 
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Fic. 1. Mean start, run, and goal speed in S; and S; runways for groups 
50-100, 25-100, and 0-100 during acquisition and extinction. 


a tentative rationale for this finding in 
terms of the development of anticipa- 
tory frustration. 

Analyses of variance of speeds from 
Block 6 until the end of acquisition 
were conducted separately for each 
group and measure. For the 50-100 
group, performance in the two runways 
differed only in the run measure, F (1, 
8) = 21.08, p < .01, whereas for Group 
25-100 the only difference occurred in 
the goal, F (1, 8) = 24.39, p< 01. 
Reliable differences between S, and S, 
performance were found in all mea- 
sures for Group 0-100, Fs (1, 8) = 
37.01, 11.35, and 90.83 for the start, 
run, and goal, respectively, p’s < .01. 
It is to be noted that the difference 
Obtained in the run measure for the 
50-100 group was due to faster S, than 
S, speeds, whereas reliable differences 
within the other groups were in the 
Opposite direction. Day X Reinforce- 
ment interactions, which were obtained 
in the start- and run-measure analyses 
for Group 25-100, Fs (31, 248) — 1.67 
and 1.63, respectively, p’s < .05, re- 


flected unstable relationships between 
S, and S, speeds during middle and 
late acquisition (see Fig. 1). A Day 
X Reinforcement interaction in the 
analysis of goal-measure data for the 
0-100 group, F (31, 248) = 6.19, p < 
-01, indicated the divergence of S, and 
S, speeds. Aside from a few significant 
Day effects, which are essentially un- 
interesting, the only other reliable effect 
obtained in these analyses was one 
Subgroup X Reinforcement interaction 
in the goal measure for Group 25-100, 
F (1, 8) = 5.66, p < .05. This inter- 
action was produced by earlier separa- 
tion of S, and S, speeds for the sub- 
group with the BW runway as S,. 
Both S, and S, terminal acquisition 
speeds of the three groups were com- 
pared over the last two acquisition 
blocks. Speed in the S, runway was di- 
rectly related to percentage of reward: 
the only significant effect in these anal- 
yses was percentage, Fs (2, 24) = 
3673, p< 01; 487 p<.05; and 
19.19, p < .01 for the start, run, and 
goal measures, respectively. Similar 
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analyses of asymptotic S, speeds failed 
to show any relation between per- 
centage of reward in S, and speed in S,. 
Although such a relation appeared 
especially obvious in the start measure, 
orthogonal comparisons of pairs of 
means in that measure revealed only 
that Group 50-100 ran faster than 
Group 0-100 in S, t(24) — 175, p 
« .05. 

For the most part, the present ac- 
quisition data replicated those reported 
by Henderson (1966) for groups 
trained in simple black and white run- 
ways. However, Henderson's finding 
that speed in the 100% runway was a 
direct function of percentage of reward 
in the other runway was not obtained in 
the present experiment. 

Extinction data for each group and 
measure were statistically assessed by 
analyses of variance. Every analysis 
yielded day effects. Nothing else was 
significant for the 50-100 group; for 
the 25-100 group, reinforcement was 
significant in the run measure, F (1, 
8) = 5.88, p < .05; and for the 0-100 
group, Reinforcement and Day X Re- 
inforcement were significant in all mea- 
sures. The only other reliable effect 
was a Subgroup X Day interaction ob- 
tained in the start-measure analysis for 
the 0-100 group. F (11. 88) = 2.13, 
p < .05. 

The lack of difference between per- 
formance in the S, and S, runways 
during extinction for the 50-100 group 
is consistent with earlier data obtained 
in simple black and white runways 
(e.g., Amsel et al, 1966). The 25- 
100 group's extinction data were also 
consistent with the earlier findings (the 
significant reinforcement effect yielded 
in the analysis of run-measure data for 
this group indicated maintenance of 
terminal acauisition differences rather 
than less resistance to extinction in the 
S, runway). The within-group statis- 
tical analyses of extinction data for the 
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0-100 group are diffcult to interpret 
due to extreme differences in terminal 
acquisition levels which account, at 
least in part, for the Reinforcement 
and Day X Reinforcement effects found 
in all measures. Such statistical effects, 
found when S, and S, terminal acquisi- 
tion levels are equal, would normally 
indicate a PRE. 

Comparison among the S, extinction 
curves for all groups is free of differ- 
ences in terminal acquisition levels and 
allows the generalized PRE to be as- 
sessed. Only one reliable effect was 
obtained in these analyses, a Day X 
Group interaction in the start measure, 
F (22, 264) — 2.66, p « .05, which re- 
flects flattening of the 0-100 curve, 
relative to the two other curves, after 
the third block of trials. In the run 
and goal measures, then, these analyses 
suggest equal resistance to extinction 
in the S, runway whether 50, 25, or 
096 reward was received in the S, run- 
way. In the start measure the 0-100 
group would appear to be most re- 
sistant to extinction. If 168 acquisition 
trials can be regarded as sufficient 
training, these present findings do not 
support Brown and Logan's (1965) 
conclusion that after extended training 
resistance to extinction of the contin- 
uously rewarded response in a 0-100 
discrimination decreases in comparison 
to a 50-100 discrimination. 

The similarity of the present data 
to those previously collected in simple 
black and white runways suggests that 
the split alleys are no less effective 
stimuli. Systematic statistical evidence 
that these alleys have any properties 
which differentially influence speed or 
interact with other experimental manip- 
ulations was lacking, although there 
were isolated instances of interactions 
involving subgroups. On the basis of 
the present evidence, split alleys would 
appear to have fewer disadvantages 
than simple black and white runways 
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for the study of partial reinforcement 
effects within single organisms. 
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SOLVING WORDS AS ANAGRAMS: 
II. A CLARIFICATION: 


BRUCE R. EKSTRAND 
University of Colorado 
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ROGER L. DOMINOWSKI 
University of Illinois, Chicago Circle 


Beilin and Horn (1962) and Ekstrand and Dominowski (1965) have re- 
ported that an anagram is more difficult to solve if it is a word (W prob- 
lem) than if it is a nonsense anagram (N problem.) These results were 
questioned by Mayzner and Tresselt (1965). On the basis of further ex- 
perimentation using the design of Mayzner and Tresselt it was concluded 
that while their criticisms were possibly correct, their experiment did not 
clearly show that W problems were equal in difficulty to N problems. The 
present results showed that W problems were more difficult than N prob- 
lems, supporting the findings of Beilin and Horn (1962) and Ekstrand 


and Dominowski (1965). 


In 1962, Beilin and Horn reported 
that an anagram is more difficult to 
solve if it already constitutes an 
English word (and S must find another 
solution word) than if the anagram is 
nota word. The former type of ana- 
gram will be referred to as a word 
problem (W) and the latter type as a 
nonsense problem (N). The question 
of whether W or N anagrams are more 
difficult to solve dates back to at 
least 1935. Several studies have 
been carried out since that time 
(Devnich, 1937; Hollingworth, 1935, 
1938; Nissenson & Sargent, 1941; 
Sargent, 1940). These experiments 
left the issue unresolved until neces- 
sary controls were employed in the 
Beilin and Horn (1962) experiment. 
In 1965, Ekstrand and Dominowski 
replicated and extended the findings 
of Beilin and Horn (1962). 

Subsequently, Mayzner and Tres- 
selt (1965), employing a different ex- 
perimental design, failed to find any 
difference between W and N problems. 
Since Mayzner and Tresselt viewed 
their design as having the better con- 


1The authors wish to express their ap- 
preciation to Carl P. Duncan for his encour- 
agement and many helpful suggestions 
throughout this research. 
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trols, they concluded that the findings 
of Beilin and Horn (1962) and of Ek- 
strand and Dominowski (1965) were 
wrong. 

The experiments of Beilin and Horn 
(1962) and Ekstrand and Dominowski 
(1965) employed essentially the same 
design. The basic problem (accord- 
ing to Mayzner & Tresselt, 1965) was 
that solution words of W problems 
were different from solution words of 
N problems. Mayzner and Tresselt 
(1965) correctly pointed out that the 
increased difficulty of W problems 
may have been due to either of two 
factors: (a) W problems in general are 
more difficult, or (b) the solution words 
of the W problems are just more diffi- 
cult answers to obtain than those of 
the N problems. Although Ekstrand 
and Dominowski (1965) equated the 
solution words for the W and N prob- 
lems on known effective variables, it 
is conceivable that the two sets of so- 
lution words differed on some un- 
known effective variable such that the 
W solutions were more difficult than 
N solutions. Mayzner and Tresselt 
(1965) used a design in which the W 
and N problems had the same solu- 
tion words. For example, an N prob- 
lem was threa and a corresponding W 
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problem was earth with both solutions 
being heart. 

There are certain problems with the 
Mayzner and Tresselt (1965) experi- 
ment. First, their N and W condi- 
tions were run at different times so the 
groups may not have been comparable. 
Secondly, if the W and N groups have 
the same solution words, it is impossi- 
ble to have exactly the same anagram 
letter order for the W and N groups. 
Questions arise when the procedure of 
Mayzner and Tresselt's (1965) experi- 
mentisexamined. An Swas presented 
with the N problem threa and was to 
solve for heart, However, S could 
give the answer earth, a correct solu- 
tion of no interest since the W-con- 
dition Ss were solving for heart given 
the anagram earth. Since there were 
two "answers" for each N problem, 
Mayzner and Tresselt had N-condition 
Ss solve for both answers, but em- 
ployed only the response measure for 
the “critical” solution, heart. A prob- 
lem arises when S gives the "'noncriti- 
cal" answer first. In the Mayzner 
and Tresselt procedure, S merely con- 
tinued working for the other solution. 
It is not clear what time score was 
used when the critical solution was 
the second one given. Furthermore, 
the most appropriate time score is not 
logically clear (see Beilin, 1966). If 
the total-time score (time from pres- 
entation of the anagram to obtaining 
the solution heart, even though the 
solution earth intervened) was used, 
there is clearly a bias which would 
serve to increase the apparent diffi- 
culty of N anagrams. Use of the net 
time score (time from, S solving earth 
to the time for heart) would perhaps 
produce a bias in the other direction. 

Perhaps even the net-time score is 
biased in the direction of finding the N 
problems to be more difficult. As- 
sume; that W problems are, in fact, 
more difficult than N problems, and 
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that S in the N condition first solves 
the noncritical answer earth. At this 
point, perhaps S is no longer in the N 
condition, for he now is faced with a 
type of W problem; e.g., he has solved 
earth and is trying to get to heart. Al- 
though a nonsense anagram is in front 
of S, the first solution obtained may 
continue to be elicited by the anagram 
thereby producing interference, or S 
may "work from" the W anagram, 
earth, rather than from the N anagram 
threa. 

In the present experiment an at- 
tempt was made to arrange the condi- 
tions such that Ss would almost al- 
ways solve the critical answer first. 
The basic design of Mayzner and Tres- 
selt (1965) was used, with two changes 
designed to increase the number of 
critical solutions given as first solu- 
tions. First, the problems were 
chosen so that the two words which 
could be made from the five letters in 
any given problem were discrepant 
in frequency of occurrence in the lan- 
guage. The W-condition Ss were 
given the lower frequency word as the 
anagram. The N-condition Ss were 
given a nonword anagram and asked 
to solve for one or both solutions. 
Mayzner and Tresselt (1966) have 
shown that Ss strongly tend to give 
the higher frequency solution first. 
Therefore, the W and N conditions 
should be comparable on the time 
score for the first solution. As a 
second way of increasing the proba- 
bility that the N-condition Ss would 
solve critical words first, these Ss were 
primed prior to solving any anagrams 
(Dominowski & Ekstrand, 1967). This 
priming consisted of reading the list of 
critical solution words only. 


METHOD. 
Design.—The design was 3 X 2 factorial 


with three conditions and either prior priming 
(P) or no prior priming (NP). The three 
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conditions were Nonsense-1 solution (N-1), 
Nonsense-2 solutions (N-2), and Word (W). 
In N-1 conditions, Ss were given nonsense 
anagrams and asked to find one solution for 
each problem. In N-2 conditions, Ss were 
also given nonsense anagrams, but were asked 
to find both solutions. The N-2 condition 
corresponds to the nonsense condition of the 
Mayzner and Tresselt (1965) experiment. In 
W conditions, Ss were given the solution word 
of lower frequency as their anagram and were 
asked to find one solution. 

Materials—Thirteen five-letter problems 
were attempted by each S in each condition. 
Two of the problems had more than two 
solutions and were dropped from the analysis 
since these “extra” solutions were given quite 
frequently by Ss. Of the 11 remaining prob- 
lems, 2 problems also had more than two 
different solutions given by Ss, but the fre- 
quency of extra solutions for these problems 
was extremely low. These problems were 
retained. The 11 problems are presented in 
Table 1. 

Since the design demanded that the ana- 
grams in the N and W conditions be solved 
for the same solutions, the anagrams for the 
N and W conditions must differ. Control 
must therefore be exercised to insure that 
relevant anagram variables are equated for 
the W and N problems. The anagrams for 
the N and W conditions were matched as 
closely as possible on the summed anagram 
bigram frequency (ABF) according to the 
Underwood and Schulz (1960) count. The 
value of ABF for each problem is presented in 
Table 1. Secondly, the number of letter 
moves necessary to reach the critical solution 
(Dominowski, 1966) was identical for the 
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N and W anagrams. The equation of letter 
moves for the N and W problems was done to 
afford some control over letter-order difficulty. 
The number of letter moves necessary to reach 
the critical solution word for each anagram is 
also presented. 

Procedure.—Each of the 13 original ana- 
grams was typed in lower case letters on a 
3 X 5-in. card. The Ss were tested indi- 
vidually, gave their solutions verbally, and 
time scores were recorded from a stopwatch. 
In the W conditions, Ss were allowed 120 sec. 
to attain the correct solution. If S failed to 
solve a particular problem, he was informed of 
the correct solution, and a score of 120 sec. 
was entered. In the N-1 conditions, the 
procedure was exactly the same as in the W 
condition; the N-1 Ss were not told that there 
were two solutions to each problem. As soon 
as one solution was verbalized (whether or not 
it was the critical solution), E presented the 
next problem. If after 120 sec. neither solu- 
tion had been given, E told S the critical 
solution and entered a score of 120 sec. for the 
critical solution. The Ss in the N-2 condi- 
tions were told to try and find both solutions 
for each problem. They were told to con- 
tinue working after finding the first solution 
until they found the second solution. If 
neither solution was given after 120 sec., a 
score of 120 sec. was entered for both solu- 
tions, and S was informed of both answers. 
If S produced the first solution word in less 
than 120 sec., he was given an additional 120 
sec. (from the time of the first solution) to 
solve for the second. In the actual analysis 
of the time scores, however, no score over 120 
sec. was ever allowed. For example, suppose 
S solved teach after 80 sec. and cheat after an 


TABLE 1 
MATERIALS 
W Condition N Condition 
Number of 
Moves 
Anagram Solution ABF Anagram Solution ABF 
Froth Forth 5641 1 Frtho Forth 4998 
Cheat Teach 9073 3 Athce Teach 6838 
Thorn North 6550 3 Hontr North 8116 
Shrub Brush 1626 3 Bhsur Brush 1914 
Canoe Ocean 5058 2 Eanoc Ocean 7004 
Beard Bread 6020 1 Berad Bread 7030 
Diver Drive 7374 1 Deriv Drive 7616 
Bleat Table 8420 2 Bleta Table 5722 
Shelf Flesh 5754 3 Heslf Flesh 5543 
Sauce Cause 2729 2 Ceusa Cause 3273 
Sleet Steel 5457 2 Etesl Steel 6738 
Total 63702 64792 
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additional 45 sec. In terms of the total-time 
score, cheat took 125 sec. to solve; however, if 
total time were the dependent variable under 
analysis, cheat would be scored as a non- 
solution (120 sec.) even though S in fact 
solved it. 

The problems were presented to 13 Ss in 
each group according to a 13 X 13 Latin 
square to insure that each problem occurred 
once at each ordinal position. There were 
thus 13 different orders of problem presenta- 
tion, and 1 S in each group received each 
presentation order. 

In the nonprimed conditions (NP) Ss were 
instructed about the task and then began 
solving problems. In the primed conditions 
(P) Ss were also informed that they would be 
read a list of the solution words. The Ss in 
the W:P and N-1:P conditions were told that 
the list contained all the solutions to the 
problems. The Ss in the N-2:P condition 
were told that the list contained only one of 
the two solutions to each problem, but that 
they were to solve for both solutions. The 
higher frequency solution (the critical solu- 
tion) to each problem was contained in the 
list of priming words. The 13 words in the 
priming list were read out loud by E at a 2-sec. 
rate. The order of reading corresponded to 
a 13 X 13 Latin square; each problem thus 
occurred once in each ordinal position of 
the priming list. Immediately after the read- 
ing of the priming list, Ss started solving 
anagrams. x 

Subjects.—There were 13 Ss in each of the 
six conditions of the 3 X 2 design. They 
were all undergraduate students from North- 
western University whose service in the 
experiment was a course requirement. A 
schedule was made consisting of 13 blocks of 

.the six conditions, the order of conditions 
within a block being random. The Ss were 
assigned to the schedule according to their 
order of appearance at the laboratory. 


RESULTS AND DISCUSSION 


Number solved.—Within the four N 
conditions (P or NP by one or two so- 
lutions) a determination was made of 
the number of critical words (out of 11) 
that were solved as first solutions in a 
total time of less than 120 sec. The 
means were 6.54 for the N-1: NP con- 
dition, 8.62 for the N-1:P condition, 
5.38 for the N-2:NP condition, and 
7.85 for the N-2:P condition. A 
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2 X 2 analysis of variance indicated 
that priming increased the number of 
critical words solved first, F (1, 48) = 
27.92, p < .01, and that more critical 
words were solved first in the N-1 con- 
ditions than in the N-2 conditions, F 
(1, 48) = 5.01, p < .05. The F for 
the interaction was less than one. It 
is apparent from the above means 
that the procedures designed to in- 
crease the probability that Ss would 
solve the critical solution first were 
not particularly successful, although 
priming did have a highly significant 
effect in this respect. In any case, 
the intended situation did not obtain; 
clearly, Ss did not almost always give 
the critical solution first. 

Mean time scores.—For each S in the 
six conditions, a mean time score was 
determined for critical problems that 
were solved in a total time of less than 
120 sec. Unsolved problems were ex- 
cluded. The six means are presented 
in Table 2. A log transformation was 
applied to remove heterogeneity of 
variance. A one-way analysis of var- 
iance showed that the six groups dif- 
fered significantly, F (5, 72) = 6.15, 
p < 01. Itcan be seen from Table 2 
that priming facilitated solution in all 
three conditions, F (1, 72) = 10.57, 
p <.01. Itis also apparent that the 
conclusion reached about the diffi- 
culty of W. vs. N problems will depend 
upon whether the N-1 conditions or 
the N-2 conditions are used for the 
nonsense problems. Two nonortho- 
gonal comparisons were made, one 
being the main effect of N-1 against 
the W conditions, the other the main 


TABLE 2 


MEAN TIME SCORE (sEC.) ON CRITICAL 
PROBLEMS THAT WERE SOLVED 


N-2 NA w 
Primed 12.46 7.85 14.15 
Not Primed | 20.92 13.00 22.54 
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effect of N-2 against the W conditions. 
The former comparison was highly 
significant, F (1, 78) = 16.01 p< 01, 
while the latter comparison was not. 
This latter comparison is similar to the 
comparison made by Mayzner and 
Tresselt (1965) and identical to their 
results. 

One interpretation of this discrep- 
ancy is that the time scores of many 
of the critical solutions in the N-2 con- 
ditions represent scores for second so- 
lutions, and as such they are long time 
scores for reasons already mentioned. 
This would not apply to the total- 
time score for critical problems in the 
N-1 conditions because all critical 
problems that were solved were first 
solutions. However, another inter- 
pretation of the superiority of the N-1 
groups over the W groups is possible. 
The solution of a critical problem in 
the N-1 groups represents one of two 
possible solutions that S could have 
given. When S solves the noncritical 
problem first, this must mean that the 
noncritical problem was easier than 
the critical problem (despite frequency 
favoring the critical problem). In 
other words, there probably was a 
problem-selection bias Operating in 
the N-1 conditions such that the more 
difficult critical problems never en- 
tered into an Ss’ mean because he 
would solve the noncritical problem 
instead. 

Time scores: N —1 vs. W.— In an 
attempt to control for problem-selec- 
tion difficulties, a median time score 
was computed for each S in the N—1 
conditions for the critical problems. 
If neither solution was given, a score 
of 120 was assigned to the critical so- 
lution for that problem and this was 
ised in determining the overall me- 
lian. The mean of the medians was 
7.54 in the N—1:NP condition and 
1.46 in the N—1:P condition. An 
ittempt to derive a comparable mea- 
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TABLE 3 


MEANS OF THE MEDIAN TIME SCORES 
(SEc.) For CRITICAL PROBLEMS IN 
Conv. N-1 AND COMPUTED BY 
THREE DIFFERENT METHODS 
IN Cond. W 


Primed | Not Primed 


Method Cond. N-1 
4.46 7.54 
W Matched by Problems | 8.38 26.23 
W Matched by Difficulty | 7.81 23.50 
W Matched on the Most | 5.88 11.69 


Rapid Time Scores 


sure in the W groups was then made. 
Three such measures were computed. 
In the first measure, the W and 
N —1 Ss were compared on exactly the 
same problems. Each S in an N—1 
condition was matched with an S in the 
corresponding W condition. For each 
S in the N —1 condition there was an S 
in the W group who had received the 
problems in the same order; these two 
Ss became a matched pair. Based 
upon which of the 11 critical problems 
were solved by the N—1 Sina pair, a 
median time score was determined for 
the matched W S on exactly the same 
problems. The means of the medians 
of the W conditions are presented in 
Row 2 of Table 3, Row 1 containing 
the means from the N —1 conditions. 
In order to remove heterogeneity of 
variance, a log transformation was ap- 
plied to median scores. An analysis 
of variance (2 X 2 on the four means 
in Rows 1 and 2 of Table 3) showed 
thatthe main effects of W vs. N—1 and 
P-vs. NP were both highly significant, 
F (1, 48) = 22.14 and 13.83, respec- 
tively, p < .01. The F for the inter- 
action was 1.33. The implication is 
that W anagrams are more difficult to 
solve than N —1 anagrams. ; 
There is still a question concerning 
the matching procedure employed 
above. Although the analysis com- 
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pared N and W problems and exactly 
the same problems in both groups, 
the fact remains that the problems 
were "selected" by the N —1 Ss; how- 
ever, the W Ss could not select prob- 
lems. The above procedure may not 
take into account possible idiosyn- 
cratic differences in problem difficulty. 
New means were therefore computed 
for the W groups using the following 
procedure (these means constitute 
Row 3 of Table 3) : Each S in the N —1 
condition attempted all 11 problems 
but may have only solved 7 (e.g.) 
critical ones, leaving, four time scores 
representing noncritical solutions. 
The 7 critical problems may in some 
way have been selected for idiosyn- 
cratic reasons and therefore might be 
expected to be among the fastest prob- 
lems of the 7 that S solved. There- 
fore, the 11 problems were ranked ac- 
cording to time score. The rank po- 
sition of each critical solution was 
noted (in an extreme example, if S 
solved 7 critical problems, one would 
expect that the seven critical answers 
would hold the first seven ranks, al- 
though this, in actuality, seldom oc- 
curred). For each matched SS in the 
W conditions, the 11 problems were 
also ranked according to time score, 
and a median time score was deter- 
mined for the problems in the same 
rank positions as the critical prob- 
lems solved by the N—1 matched S 
(in the extreme example, the matched 
S's time score in the W condition 
would have been a median of his seven 
fastest time scores). The previous 
matching was done by problems 
whereas the present matching is pre- 
sumed to have been accomplished by 
matching on relative difficulty. After 
a log transformation to remove hetero- 
geneity of variance, a 2 X 2 analysis 
of variance was done on the four 
means in Rows 1 and 3 of Table 3. 
The main effect of W vs. N—1 con- 
ditions was significant, F (1, 48) — 
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14.17, p < .01, as was the main effect 
of priming, F (1, 48) = 9.84, p « .01. 
The F for the interaction was less than 
one. 

Row 4 of Table 3 contains still a 
third type of mean computed for the 
W conditions in an attempt to remove 
all possible questions concerning prob- 
lem selection. For each Sin the N —1 
condition, the number of critical prob- 
lems solved had been noted. For the 
matched S in the W condition, a 
median time score was comptued on 
an equivalent number of problems, 
but the specific problems chosen were 
those solved most rapidly by the W S. 
For example, if the N—1 S had solved 
seven critical problems, the score for 
the matched W S would be the median 
of the seven problems he solved most 
rapidly, Thus, it is the W S who is 
being given the maximum potential 
for problem selection (only his lowest 
time scores enter into the analysis). 
If the W condition is still more diffi- 
cult using this extreme selection con- 
trol, it would seem fair to conclude 
that W problems are more difficult 
than N problems. Following a log 
transformation, a 2 X 2 analysis of 
variance was performed on the means 
in Rows 1 and 4 of Table 3. The 
main effect of W vs. N—1 was still 
significant, F (1, 48) = 4.47, p < .05. 
The main effect of priming was also 
significant, F (1, 48) = 7.62, p < .01. 
The F for the interaction was again 
less than one. It is concluded that 
under both primed and nonprimed 
conditions, W problems are more diffi- 
cult to solve than N —1 problems. 
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STIMULUS INTENSITY, SITE OF STIMULATION, AND 
INDIVIDUAL REACTIVITY AS DETERMINANTS 


OF THE ENERGY THRESHOLD FOR 
PRICKING PAIN 


DONALD J. DILLON 
New York State Psychiatric Institute and Manhattan College 


Employing the D'Amour-Smith technique, threshold energies for the 
perception of pricking pain were obtained at 3 stimulus intensities in 
2 experimental sessions from 3 spots along the volar surface of the 
nonpreferred forearm of each of 33 Ss. Thresholds varied significantly 
according to intensity of stimulation, session, and spot stimulated. In- 
dividual differences were largely accounted for by a grouping effect: 
3 groups in terms of reactivity or sensitivity. Because of a significant 
Intensity X Spot interaction effect, spot differences were interpreted as 
differences in heat absorption and circulatory cooling and not as dif- 
ferences in pain sensitivity per se. A significant Intensity X Subjec- 
tive Reactivity interaction effect revealed that very insensitive or 
hyporeactive individuals were better able to discriminate between con- 
secutively paired intensities. The effect of criterion upon thresholds 
at different rates of heat input was discussed. 


The introduction of the dolorimetric 
technique into the investigation of the 
perception of pain has made it possible 
to utilize carefully controlled labora- 
tory procedures. Basically, the tech- 
nique consists in stimulating some skin 
area with radiant heat. Under stan- 
dardized experimental conditions in- 
volving area and duration of stimulus, 
psychophysical procedure, and statisti- 
cal analysis, the responses of “painful” 
or “not painful” may be correlated 
with known rates of heat input and the 
pain threshold is then defined as that 
point on the stimulus scale which cor- 
responds to the transition from “not 
painful” to “painful.” The experi- 
mental procedure and apparatus had 
been developed by and were fully de- 
scribed by Hardy, Wolff, and Goodell 
(1952). 

An alternative procedure is available, 
viz., the modification sometimes called 
the latency technique, devised by 
D’Amour and Smith (1941). In this 
case, the pain-no pain dichotomy is 


avoided and S indicates (by terminat- 
ing the stimulus) that point in time at 
which the stimulus becomes painful, 
and pain thresholds are measured in 
units of time (latency). Wertheimer 
(1952), Miller (1949), and Zegers and 
Cahill (1965) have utilized this modi- 
fication. The advantages of this 
method are that there is an “end 
point” which can be carefully explained 
to S; each stimulus is a painful one; 
and stimulation is under S’s control. 
The mere introduction of this rather 
clearly defined end point would seem 
to make this procedure superior to the 
original procedure of Hardy, Wolff, 
and Goodell (1952). 

Beecher (1966) has made the point 
that, since it lacks a strong reactive or 
emotional component, pain produced 
dolorimetrically is in no way com- 
parable to pain of pathologic origin. 
Because of this, Beecher feels that 
dolorimetry is not the technique of 
choice in evaluating analgesics. The 
investigations of Zegers and Cahill 
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(1965), Sweeney (1962), and Fang- 
man (1962), in which the latency tech- 
nique failed to demonstrate elevated 
thresholds as a result of administration 
of various analgesics, tend to confirm 
Beecher on this point. 

In the course of a literature survey 
conducted while attempting to increase 
the painfulness of the sensation evoked 
by the latency procedure, the author 
was surprised by the relative scarcity 
of latency data as compared with data 
obtained when the standard Hardy- 
Wolff-Goodel technique had been used. 
Direct comparisons of studies utilizing 
the standard technique are readily 
available; direct comparisons of stud- 
ies utilizing the latency technique are 
not. 

"There is also very little information 
concerning the relationship between in- 
tensity and duration of the stimulus. 
Hardy et al. (1952, p. 105) have pre- 
sented a curve demonstrating this re- 
lationship: stimulus duration decreases 
with increased intensity in something 
like an hyperbolic function, but with- 
out further specification of the function. 
Sievers (1962) obtained mean laten- 
cies for three different levels of inten- 
sity, but the levels were not sufficiently 
separated for even an approximation 
of the mathematical function. The 
means did differ significantly in the 
expected direction. 

Hardy et al. (1952, p. 91) present 
data thresholds for pricking pain for 
various parts of the body, and feel that 
“sensitivity to pricking pain is roughly 
uniform over the body surface (p. 
92)."  Zegers and Cahill (1965) 
tested 12 spots on the volar surface of 
each forearm and found no significant 
spot. differences across Ss. They did, 
however, control for possible spot dif- 
ferences. In preliminary investigations 
concerned with latency differences as a 
function of variations in intensity, this 
author detected regular but slight dif- 


ferences in spot sensitivity at certain 
low levels of intensity. 

Hardy et al. (1952, p. 89) present 
means and standard deviations from 
various investigations. These stan- 
dard deviations tend to display a rather 
small degree of individual differences, 
even with untrained Ss.  Fangman 
(1962), on the other hand, reports 
large individual differences and a large 
degree of internal consistency. 

This investigation was undertaken 
in order to describe the variability of 
latency thresholds for pricking pain 
and to demonstrate: (a) that intensity 
levels of considerable difference would 
result in large differences in threshold 
latency, with low intensities evoking 
long latencies, (b) that there is a pain 
sensitivity gradient along the volar 
surface of the forearm, (c) that thres- 
hold latencies vary from one session to 
another—in the direction of increased 
thresholds, and (d) that there are large 
individual differences in threshold lat- 
ency and that the extent of individual 
differences varies inversely with stimu- 
lus intensity. 


METHOD 


Subjects—The Ss were 33 undergradu- 
ate students who had never participated in 
experiments dealing with the perception of 
pain. There were 28 males and 5 females, 
all 20 or 21 yr. of age. 

Apparatus —The Hardy-Wolff-Goodell Do- 
lorimeter (Williamson Development Com- 
pany) was employed. The heat source is a 
100-w. projection lamp which is controlled 
by a regulated power supply and. timer. 
The lamp housing is a gun-like device with 
the lamp in the "handle" and an optical 
system (reflector and lens) in the "barrel." 
Light from the lamp passes’ through an 
opening at the front end of thé-barrel, which; 
is placed against the skin surface, and fo- 
cuses at a point 4 in. beyond, By depress- 
ing a button switch in the handle, S stimu- 
lates the skin, and, by releasing the switch, 
extinguishes the stimulus, An electric timer 
(Standard) connected between the power 
supply and lamp permits the measurement 
of stimulus duration. 
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Calibration of the lamp was accomplished 
by means of a thermopile (Williamson Co., 
Model RT2) and voltmeter galvanometer, 
the thermopile being placed against the 
opening in the “gun barrel" The apparatus 
was calibrated prior to each experimental 
session with dial settings corresponding to 
desired caloric values noted. Such dial set- 
tings did not vary by more than +3% dur- 
ing the course of the experiment. 

Procedure—All Ss were tested in two 
experimental sessions, with an intersession 
interval of 1 wk.’s duration. The experi- 
mental procedure including the instructions 
were the same in both sessions. In the 
first session, Ss had been instructed to re- 
member the sensation at the exact moment 
of terminating the stimulus and to try to 
hold that criterion in mind for the second 
testing 1 wk. later. 

'The Ss washed the nonpreferred forearm 
immediately before coming to the laboratory. 
The length of the volar surface of the fore- 
arm was then carefully measured and noted, 
and three spots (4, 4, and 1 the distance 
from the wrist to the elbow) were selected 
and painted thoroughly with black India ink. 
Each spot was approximately 3.5-4 cm. in 
area, somewhat larger than the 3.1 cm.* open- 
ing of the "gun barrel" of the apparatus. 

While the ink was drying, the criterion 
for pricking pain and the details of the 
stimulating procedure were outlined to S. 
A preliminary testing procedure was then 
carried out utilizing the technique as out- 
lined by Hardy et al. (1952, pp. 82-83). 
The spots were stimulated successively 
(wrist—mid—elbow, etc.). This preliminary 
testing was complete when one pain re- 
sponse had been given at each of the three 
spots. Then three sample latency thres- 
holds were obtained after the criterion de- 
scribed by Zegers and Cahill (1965, p. 103) 
had been explained to S. 

The author had determined, in a series 
of pilot investigations, that these two phases 
enabled E to warm up S’s skin to a level 
of 33-34° C., and that only below 33° C. 
did skin temperature variation contribute 
to latency threshold variance. It is pos- 
sible to maintain a skin temperature of 34^ C. 
at the start of each latency stimulation if an 
interstimulus interval of 13-2 min. is main- 
tained for a given spot. 

The third phase of each session constituted 
the experiment proper. Three intensity 
levels were selected, viz., .100, .200, and .300 
gm-cal/cm'/sec. In practice, these units 
are multiplied by 1,000, and reported in terms 
of millicalories/cm*/sec. Each of the three 
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spots was stimulated three times at each 
of these intensity levels, thus yielding 9 
latency thresholds per spot, or 27 thresholds 
per session per S. The stimulus intensities 
were presented in random order. The inter- 
stimulus interval was maintained at 30 sec. 
so that 90 sec. elapsed before a particular 
spot was again stimulated. For each stimu- 
lus, S placed the opening of the gun barrel 
against the spot, depressed the button-switch 
and held it until he first felt pain, at which 
time he released the switch. The stimulus 
duration was then recorded, the clock reset, 
and the appartus set for the next intensity. 
Thirty seconds after releasing the switch, 
S started the next stimulus. 


RESULTS 


The units of intensity are actually 
units of power, i.e., they represent the 
rate at which heat (light) is applied to 
the skin. The term intensity will be 
used, however, in order to conform 
with standard terminology. The units 
of intensity (mcal/cm?/sec) will be 
abbreviated as mcal/sec. 

The threshold latencies varied in- 
versely with intensity. The values are 
higher than the durations found by 
Bigelow, Harrison, Goodell, and Wolff 
(1945) but the relationship between 
duration and intensity is much the 
same. These same authors also estab- 
lished an end point for pricking pain 
at 44.3 + .5° C. A measurement anal- 
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ogous to this end point was available 
in this experiment ; viz., the energy or 
total heat input required to elicit the 
response of pricking pain. Mathe- 
matically, this threshold energy is the 
product of intensity and time, and is 
in units of mcal/cm?. It will be abbre- 
viated as mcal. 

Single (overall mean) energy thres- 
hold values were computed for each S, 
and plotted in a frequency distribution. 
The Ss were ordered as to these means, 
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and the means were grouped and plot- 
ted in the usual frequency distribution. 
This frequency distribution, together 
with a plot of individual average thres- 
holds is depicted in Fig. 1. With the 
exception of one S, there is a good case 
for separating Ss into three groups. 
Since S’s task was to extinguish the 
lamp at the first perception of pricking 
pain, one might term these groups: 
“hyporeactive” or relatively insensitive, 
“normally reactive” or normally sensi- 


TABLE 1 


Means, SDs, AND COEFFICIENTS OF VARIATION OF THE ENERGY THRESHOLDS 
FOR THE PERCEPTION OF PRICKING PAIN 


Intensity = mcal/sec 
100 200 300 
Spots 
Reactive Groups 
A B c A B c A B c 
Session I 
WwW 
M 1340 1760 1900 883 1120 1240 774 965 1050 
SD 245 339 248 196 160 96.4 146 101 143 
V 18.3 19.3 13.1 22.2 14.3 7.8 18.9 10.5 13.6 
M 
M 1190 1450 1670 801 1020 1190 757 906 1060 
SD 147 269 113 122 86.6 125 109 114 83.8 
V 12.4 18.6 6.8 15.2 8.5 10.5 14.4 12.6 7.9 
E 
M 1150 1460 1520 842 1030 1110 768 929 946 
SD 274 174 242 171 116 .139 152 890 99.2 
V 23.8 11.9 15.9 20.3 11.3 12.5 19.8 9.6 10.5 
Session IT 
W 
M 1460 1790 2160 982 1220 1390 862 1020 1150 
SD 314 187 323 174 103 164 159 142 115 
V 21.5 10.4 15.0 17.7 84 11.8 18.4 13.9 10.0 
M 
M 1240 1560 1850 955 1110 1310 857 967 1090 
SD 295 196 307 145 85.5 116 116 84.9 84.9 
V 23.8 12.6 16.6 15.2 7.1 8.85 13.5 8.8 7.8 
E 
M 1330 1600 1620 1010 1170 1290 891 992 1080 
SD 220 255 152 157 151 116 93.6 161 112 
V 16.5 15.9 9.4 15.5 12.9 9.0 10.5 16.2 10.4 


Note.—Threshold units: meal. 
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tive, and "hyperreactive" or highly sen- 
sitive. In Fig. 1, these groups are 
labelled C (n= 11), B (n= 10), and 
A (n= 12), respectively. 

The condition-by-condition compari- 
sons for these reactive groups are sum- 
marized in Table 1; A, B, and C are 
the same as for Fig. 1. The spots are 
the spots on the volar surface of the 
forearm: W—nearest the wrist; M— 
the mid forearm level; E—the spot 
nearest the elbow. The coefficients of 
variation were computed as: 100 SD/ 
M. The threshold energy values tabu- 
lated are means for three determina- 
tions. For purposes of variance analy- 
sis, these three were considered in their 
order of presentation. Order of pre- 
sentation, together with the parameters 
indicated in Table 1 (intensity, reactive 
group, session, and spot stimulated) 
were the main experimental variables. 
The error term was the within-cell 
variance—an index of individual dif- 
ferences (Ss within a reactive group). 

Except for order which did not pro- 
duce a significant effect (p < .01), the 
main effects were significant well be- 
yond the .001 level. Three interaction 
effects (Intensity X Subjective Re- 
activity, Intensity X Spot, and Spot X 
Subjective Reactivity) were likewise 
significant well beyond the .001 level. 

The main effects are as follows: (a) 
threshold energy decreases monotoni- 
cally with increased stimulus intensity ; 
(b) threshold energy is greater in the 
second than in the first session; (c) 
threshold energy is least for Group A, 
more for Group B and greatest for 
Group C; and (d) the threshold is 
higher (p < .005) near the wrist (W) 
than at the other two spots (M and E) 
which do not differ significantly ($ < 
.05) between themselves in the amount 
of energy required for threshold. 
These effects are depicted in Fig. 2 
(A) together with the significant inter- 
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Fic. 2. Graphic representation of those 
factors significantly influencing energy thres- 
holds for pricking pain. 


action effects (B) which are sum- 
marized in Table 2. 

The Intensity X Group interaction 
effect is: As the intensity of stimula- 
tion increases, the decrease in thres- 
hold energy is greatest among the hy- 
poreactive Ss, "moderate" among the 
normally reactives, and least among the 
hyperreactives. For all groups, the 
change in threshold energy is greater 
when intensity is increased from 100 
to 200 mcal/sec, than when it is in- 
creased from 200 to 300 mcal/sec. 

Spot differences are pronounced 
when the stimulus is 100 mcal/sec, and 
diminish at higher intensity levels. 
The difference is attributable largely 
to the greater insensitivity of the wrist 
spot, which is also more responsive to 
changes in intensity than are the other 
spots. 

With intensity partialled out, it is 
the hyporeactive group which displays 
the sensitivity gradient along the fore- 
arm, with lowest sensitivity at the 
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TABLE 2 


TABULAR SUMMARY or THOSE INTERACTIONS HAVING SIGNIFICANT EFFECTS 
Upon ENERGY THRESHOLDS FOR PRICKING PAIN 


Intensity Intensity Group 
Spot Group Spot 
100 200 300 100 200 300 A B c 

Ww A W 
M 1720 | 1130 | 964 1290 | 912 818 1050 |1310 | 1480 
SD 352 192 | 157 166 96.1 87.3 161 72.6| 113 

M B M 
M 1480 | 1060 | 935 1600 |1110 964 968 |1170 |1370 
SD i 278 172 | 135 c 104 32.9 54.0 112 61.4 65.7 
M 1440 | 1060 | 931 1790 |1250 | 1060 994 |1200 | 1260 
SD 226 162 | 114 118 554 60.8 130 86.9 84.6 


Note.—Intensity = mcal/sec. 


wrist. Spot differences in sensitivity 
for the hyperreactive group are nil. 

Correlation coefficients between ses- 
sions for each intensity are of the order 
of +.60. Coefficients among intensities 
are considerably higher (100:200 = 
90, 100:300 —.83, and 200:300 = 
94). Various interspot comparisons 
at the three levels of intensity pro- 
duced coefficients ranging from +.55 
to +.85 with highest values being ob- 
tained when W and M spots were 
paired. 


Discussion 


Before discussing certain aspects of 
these experimental findings, a description 
of the qualitative aspects of the pain “sen- 
sations" is in order. 

When the latency technique is employed 
to produce the response of pricking pain, 
the sensations are: (a) a sensation of 
warmth building to a feeling of heat that 
increases, (b) suddenly contracts, and 
(c) changes to a feeling of a short sharp 
stab of pain. The rate at which the feeling 
of heat intensifies, and the certainty of the 
point of transition of heat to pain depends 
upon the intensity of stimulation, i.e., upon 
the rate at which the heat is applied to the 
skin. The Ss in this experiment reported 
that they were quite certain of the transi- 
tion point when the stimulus was 300 
mcal/sec, and quite uncertain when it was 


100 mcal/sec. The reaction to the 200 
mcal/sec stimulus was qualitatively simi- 
lar to that at 300, but differed in the sub- 
jective intensity of the pain response, the 
stronger response being elicited by a stim- 
ulus of 300 mcal/sec. 

The fact that the standard deviation of 
the thresholds is larger at the 100 level 
than at either 200 or 300 mcal/sec would 
seem to be a reflection of this qualitative 
difference. The significant interaction ef- 
fects are also reflections of the differences 
in reaction to the rates at which the skin 
is heated, 

The hyporeactives as a group shortened 
their reactions most when stimulus inten- 
sity was increased, with the hyperreac- 
tives changing the least. The difference in 
rates at which threshold energy de- 
creased as intensity increased (slope of 
energy-intensity curve) was such that the 
groups became more alike at an intensity 
of 300 mcal/sec. The less sensitive Ss, 
by lengthening their reactions more as 
intensity decreased, indicated that they 
were better able to discriminate among the 
three intensities presented. With an in- 
tensity level of 100 mcal/sec, the hypo- 
reactive group required 3995 more energy 
before terminating the stimulus than the 
hyperreactive group, and 1295 more than 
the normally reactive group. At 200 
meal/sec, these figures were 37%, and 
12%, and, at 300 mcal/sec, 29% and 
1096, respectively. The normally reactive 
group required 2495 more energy at 100 
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mcal/sec than the hyperreactive group, 
22% more at 200, and 20% more at 300 
mcal/sec. Clearly, the group differences 
in threshold energy were diminishing as 
the intensity of stimulation increased. 

The spot effect was significant only at 
the intensity levels of 100 and 200 mcal/ 
sec, with the wrist spot requiring more 
energy than spots near the middle of the 
forearm or elbow. Neither differences in 
the thickness of the ink, nor expectations 
on the part of the Ss (three were none) 
would account for this interaction effect. 
When skin temperatures are elevated to a 
point higher than blood temperature, the 
blood stream serves to cool the skin, a 
condition that certainly was present in 
this experiment, Since the blood vessels of 
the arm lie closest to the skin surface at a 
point near the wrist, then the rate at 
which the skin is heated at the spot near- 
est the wrist is retarded. If the rate at 
which the skin is heated (e.g., 100 mcal/ 
sec) is low, this retardation should be 
noticeable; it was. The significance of 
the Spot X Intensity interaction effect, 
then, indicates a difference in thermal 
reaction, rather than a difference in pain 
sensitivity per se, and that spot differences 
tend to disappear as intensity increases. 

The spot differences in sensitivity were 
also contingent upon the sensitivity or 
reactivity level of a particular group. Spot 
differences were pronounced in the hypo- 
reactive group, less so in the group of 
normally reactive Ss, and virtually non- 
existent in the group of hyperreactive Ss. 
It seems as if Ss must be able to tolerate 
the heat for longer periods of time in 
order to demonstrate a differential spot 
sensitivity. 

When one investigates the perception 
of pain, one encounters the perplexing, 
and perhaps frustrating interaction of 
pain as a sensation, and pain as a reaction 
to aversive stimulation. The dolorimetric 
technique does not eliminate this inter- 
action, particularly when the latency tech- 
nique is employed to measure the pain 
threshold, since there is a strong emphasis 
upon a clearly defined end point for prick- 
ing pain. Individuals differ not only in 
basic physiological sensitivity to thermally 


induced pain, but in their interpretation 
of the carefully explained criterion for 
pricking pain. The results of this experi- 
ment, ie, the reduction in the threshold 
differences among the three reactive 
groups as intensity of stimulation was 
increased, would certainly suggest that the 
way to minimize criterion differences 
among Ss is to employ high intensity 
stimulation perhaps at a level much higher 
than administered in this experiment. The 
results also suggest that locus of stimula- 
tion would also be eliminated as a source 
of threshold variance at higher stimulus 
levels. The verbal reports of Ss, and the 
fact that they were less willing to tolerate 
longer stimulus durations at the 300 mcal/ 
sec level of stimulation, indicates more 
inter-S agreement as to criterion, and, 
possibly, a lesser degree of individual dif- 
ferences in "pain" sensation. At this 
level, Ss not only reported higher degrees 
of pain, but also behaved as if they were 
more set to terminate the stimulus quickly. 
Conversely, when the stimulus was 100 
mcal/sec, Ss were willing to wait for the 
“pain stab,” but the less sensitive Ss were 
more willing to tolerate longer periods of 
heat before admitting that heat had indeed 
changed to pain. 

One can only speculate as to the possi- 
ble outcome had a stimulus intensity of 
500 mcal/sec or higher been employed. 
Certainly, latency would be shortened 
(threshold energy reduced). The stimu- 
lation would be more unpleasant, more 
aversive, more painful. In all likelihood, 
locus of stimulation would be eliminated 
as a source of threshold variance, and that 
portion of the total variance attributable 
to individual differences would be re- 
duced. The threshold itself would still be 
of the detection type, a detection of that 
moment in time when heat changes to 
pain. The pain would be close to a level 
of aversion rather than at a level just 
noticeably present. 

With increasing intensity there is a 
decrease in the contribution of criterion 
to the latency (energy) threshold for 
pricking pain. It appears likely that a 
comparison of thresholds with very low 
stimulation to those with very high stimu- 
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lation would be of some value in assessing 
this criterion (possibly reaction) compo- 
nent in the perception of pricking pain. 
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EFFECTS OF UCS INTENSITY UPON CONDITIONING 


AND EXTINCTION OF THE GSR? 


F. W. BORING? ax» M. C. MORROW 
Georgia State College 


3 groups of Ss (m=20 each) received 11 CS-UCS (light/tone- 
shock) classical conditioning trials with a .5-sec. ISI, and then 10 
extinction trials. The experimental groups differed from each other 
according to the intensity of the UCS paired with the CS during 
conditioning. Ss in the low UCS group received either a 1.0-ma. or a 
1.5-ma. shock, those in the medium UCS group a 3.0-ma. shock, and 
those in the high UCS group a 5.0-ma. shock. Each group received the 
other groups' paired shock interpolated among the conditioning trials 
an equal number of times. A 4th group of Ss (n=20) received, 
throughout the experimental session, the same number of CSs and 
UCSs of each intensity as the conditioning groups, but the stimuli 
were never paired. The data showed that there was a significant posi- 
tive relationship between GSR magnitude and UCS intensity in both 
acquisition and extinction. There was also a significant positive cor- 


relation between CR magnitude and UCR magnitude after UCS inten- 


sity effects were partialed out. 


Wickens, Allen, and Hill (1963) 
failed to obtain UCS intensity effects 
upon GSR conditioning when condi- 
tioning was measured in extinction, 
but Wickens and Harding (1965) ob- 
tained such effects in both acquisition 
and extinction. The former study em- 
ployed a between-Ss design in which 
overall amount of shock was equated 
between groups by the use of interpo- 
lated shocks. The latter study was a 
within-Ss (differential conditioning) 
paradigm in which one CS was rein- 
forced with a strong shock and the 
other CS with a weak shock for each 
RY; 

The present between-Ss design dif- 
fers from the Wickens et al. ( 1963) 
study mainly by employing a wider 
range of shock intensities, determining 
their effects during acquisition and ex- 


1 This research is based upon a thesis sub- 
mitted to Georgia State College in partial 
fulfillment of the requirements for the MA 
degree. It was supported by Public Health 
Service Grant MH 12065 to the junior au- 


thor. TRA 
2 Now at Northwestern University. 


tinction, and presenting a greater num- 
ber of extinction trials. 


METHOD 


Subjects—The Ss were 80 Georgia State 
College undergraduates, 56 females and 24 
males. They were randomly assigned to four 
groups (n =20 each) except that the first 5 
in each set of four was assigned to the low 
UCS intensity conditioning group for rea- 
sons outlined below, Further restrictions 
on random assignment resulted from sex 
balancing. The S received $1.50 for par- 
ticipating. 

Apparatus——The S was seated in a dimly 
lighted IAC Audiometric Chamber, Model 
No. 1201. The E and S were able to com- 
municate at all times through a two-way 
intercom system. 

The CS was a white light and a tone. 
The 1-in. diameter light was placed at eye 
level approximately 30 in. in front of S’s 
face. It was produced by a Grason-Stadler 
Co. Multiple Stimulus Projector; Type 
E4580. The intensity of the 1,000-cps tone 
was 65-db. re .0002 dyne/cm*. It was pro- 
duced by a Hewlett Packard audio oscillator, 
Model No. 200AB, and attenuated by a 
Hewlett Packard attenuator, Model No. 
350D. The tone was delivered to S via a 
Permoflux headset, Model DHS 17. The 
UCS was an electric shock delivered through 
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electrodes taped to the middle and index 
fingertips of S's right hand. It was pro- 
duced by a two tube electronic regulator in 
conjunction with a 300-v. dc power supply. 

"Temporal relations of the stimuli and their 
durations were controlled electronically by 
the counting of clock pulses in four tran- 
sistorized decade counters. Output of these 
counters was gated through digital logic 
to effect the onset or offset of stimuli. 

The GSR was amplified by a Fels derm- 
ohmeter after being picked up from #-in. 
zinc electrodes attached to the palm and 
back of S's left hand. The electrodes were 
placed in lucite cups and covered with the 
standard Fels electrode paste. The GSR 
was recorded on an Esterline Angus Series 
"S" Labgraph milliameter operating at a 
paper speed of 12 in/min. 

Design and procedure.—Three experi- 
mental groups of Ss received 11 classical 
conditioning trials and then 10 extinction 
trials. The ISI during conditioning was 
.5 sec. and the CS terminated with the .1-sec. 
UCS. Following the second, fifth, eighth, 
and eleventh conditioning trials, Ss received 
CS-alone test trials. 

The experimental groups received either 
a 1.0-ma. or 1.5-ma. shock (Group L) 
paired with the CS during conditioning, a 
3.0-ma. shock (Group M) or a 5.0-ma. shock 
(Group H). The UCS intensity presented 
to an S in Group L was set at .5 ma. above 
the lowest shock level which produced a 
UCR during a series of shock adaptation 
trials preceding the conditioning trials (see 
below). The shock level was 1.0 ma. for 
14 Ss in Group L and 1.5 ma. for the re- 
maining 6 Ss. 

The S's hands were cleaned with alcohol, 
and the GSR electrodes were attached to 
the palm and back of his left hand. After 
being seated in the chamber, the shock elec- 
trodes were taped to the fingers of his right 
hand. The S was then informed that he 
could speak to E at any time, and that it 
was possible to open the chamber door from 
the inside. The light projector was pointed 
out, and the headset was placed comfort- 
ably on him. The E then left the chamber 
and closed the door. Approximately 3 min. 
later, routine instructions requesting S to 
remain motionless and pay attention to the 
stimuli were read to him over the intercom. 

The S was first given two presentations 
of the light-tone CS. These were followed 
by six presentations of the UCS in gradually 
increasing intensities from .5 ma. to 40 ma., 
and then two additional CS presentations. 

During the conditioning trials, which were 
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presented next, S received a rest period of 
about 3-min. duration after the third CS- 
alone test trial. During the rest period the 
door to the chamber was opened, the ear- 
phones were removed, and S was engaged 
in casual conversation by E. After the rest 
period, conditioning was resumed, and then 
the extinction trials completed the experi- 
mental session. Each conditioning group 
received, interpolated among the conditioning 
trials, the other groups’ paired UCS an 
equal number of times. For each S in 
Group L who received a 1.0-ma. paired 
shock, a corresponding member of each of 
the other groups received an interpolated 
shock of 1.0 ma., and the same procedure 
was followed for those who received the 1.5- 
ma. UCS. To achieve this matching, it was 
necessary for E to run a low shock S first 
in a particular replication. The CS-alone 
test trials during conditioning were preceded 
by the lower of the interpolated shocks. 
The interval separating paired conditioning 
trials, CS-alone test trials, and interpolated 
shocks was varied unsystematically among 
35, 40, and 45 sec. with an average interval 
of 40 sec. 

The extinction procedure was identical 
for the three conditioning groups. There 
were 10 presentations of the CS alone. 
Interpolated randomly among the CSs were 
10 presentations each of the low, medium, 
and high intensity USCs. 

The fourth group of Ss served as a con- 
trol for the three conditioning groups. This 
group (Group C) received throughout the 
"conditioning" period, the same number of 
CSs and UCSs of each intensity as the con- 
ditioning groups, but the stimuli were never 
paired, ie, a UCS never preceded CS onset 
by less than 10 sec., nor followed its offset 
by less than 15 sec. The extinction pro- 
cedure for Group C was identical to that of 
the experimental groups. 


RrsurTS 


The dependent variable was a drop 
in skin resistance of at least 100 ohms 
which began between 1.5 and 4.5 sec. 
after the CS-alone trials. The first 
extinction trial was treated as the fifth 
acquisition test trial in the analysis of 
the results instead of including it in the 
analysis of the extinction data. The 
data were transformed to change in log 
conductance. 
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TABLE 1 


Mean GSR MAGNITUDE AND SDs ron EACH GROUP on THE Last Two CS ADAPTATION 
TRIALS, DURING CONDITIONING AND EXTINCTION 


Conditioning Extinction 
Adaptation Test Trials Blocks of Three Trials 
1 2 3 4 5 1 2 3 
Group L 
M 0737 -0578 | .0631 | .0507 | .0513 | .0373 | .0526 | .0469 | .0411 
SD 0471 +0486 | .0494 | .0414 | .0436 | .0516 | .0572 | .0446 | .0366 
Group M 
M 0773 -0779 | .0986 | .0754 | .0801 | .0637 | .0635 | .0698 | .0597 
SD .0474 .0510 | .0675 | .0540 | .0589 | .0474 | .0436 | .0522 | .0374 
Group H 
M .0722 0884 | .0870 | .0819 | .0885 | .0808 | .0905 | .0807 | .0756 
SD .0343 0545 | .0513 | .0529 | .0576 | .0643 | .0630 | .0532 | .0504 
Group C 
M .0736 .0674 | .0555 | .0408 | .0322 | .0184 | .0327 | .0312 | .0354 
SD .0385 -0442 | .0330 | .0305 | .0323 | .0203 | .0242 | .0229 | .0304 


Acquisition—Table 1 shows the 
mean GSR magnitude and SD on the 
final two CS adaptation trials com- 
bined, and on the five conditioning test 
trials for each group. The combined 
conditioning groups produced signif- 
icantly larger average responses overall 
than did Group C, t (78) = 3.66, p < 
01. Analysis of covariance of the con- 
ditioning groups employing the mean 
magnitude on the final two CS adapta- 
tion trials as the covariate, and the 
mean magnitude on the five test trials 
as the dependent variable yielded a sig- 
nificant UCS intensity effect, F (2, 
56) = 3.82, p < .05. 

Magnitude of the UCR during con- 
ditioning was also determined by mea- 
suring the GSR to the UCS paired 
with the CS. Figure 1 shows the 
mean UCR magnitude overall of each 
of the conditioning groups to the paired 
UCS and of Group C to the 3.0-ma. 
UCS. The figure also shows, for 
comparative purposes, the mean GSR 
magnitude overall of the same groups 
to the five CS-alone test trials during 
conditioning. Somewhat surprisingly, 


mean UCR magnitude was smaller in 
Group L than was the CR in Group H 
and Group M. An evaluation of the 
mean UCR magnitude in the three 
conditioning groups showed that the 
differences were highly significant, 
F (2, 57) = 825, p < .001. 

When the mean magnitude of each 
S to the CS-alone test trials overall 
was correlated with his mean GSR 
magnitude to the paired UCS, a co- 


ee Punto UCS 


Q «1400: 
rH 
E 
ENT 
E 
= .os00 
= 
8 
z 0600: 
$ 
z 
0400 © GROUP ec 
nom 30mo. 5,0ma 
GROUP GROUPM GROUP M 
Unean intensity) 
UCS INTENSITY 
Fic. 1. Mean GSR magnitude overall 
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efficient of .750, p< 005, was ob- 
tained. With UCS intensity effects 
partialed out, the coefficient was re- 
duced to .721, p < .005. These find- 
ings are interpreted to mean that the 
level of conditioning achieved by an S 
was primarily a function of the size of 
his UCR. 

Extinction.—Table 1 also shows the 
mean GSR magnitude of the four 
groups over the last nine CS-alone ex- 
tinction trials in blocks of three trials. 
In the three conditioning groups the 
response declined slightly across the 
extinction trials while Group C ap- 
peared to remain at about the same 
level throughout. Analysis of mean 
magnitude overall during the extinc- 
tion trials showed that the com- 
bined conditioning groups produced 
significantly larger mean GSRs than 
did Group C, t (78) = 3.81, p < .01. 
Analysis of covariance of the extinction 
trials, again using the same covariate 
as in acquisition, showed that there 
was a significant difference in mean 
magnitude among the conditioning 
groups, F (2, 56) 2341, p< .05. 
Since each group—both the condition- 
ing groups and the control group—re- 
ceived the same experimental treatment 
during extinction, these differences re- 
flect residual effects of the differential 
conditioning procedures. 

To determine whether any extinc- 
tion occurred, mean magnitude of each 
conditioning group on the second and 
third extinction trials was compared 
to that on the final two extinction 
trials, i.e., a 3 X2 (UCS Intensity x 
Early-Late) factorial analysis was per- 
formed. The analysis showed that 
mean magnitude was significantly lar- 
ger on the early extinction trials than 
on the late ones, F (1, 57) — 5.52, 
p < .05, but neither the UCS intensity 
nor the interaction effect was signif- 
icant. 
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Discussion 


The results of this experiment have 
shown clearly that the amount of GSR 
conditioning is a positive function of UCS 
intensity. The significant results in the 
present between-Ss design were most prob- 
ably due to the use of a wider range of 
intensities than that employed by Wickens 
et al. (1963) in a similar study. 

Under the assumption that in classical 
conditioning drive (D) is produced 
mainly by the UCS, the four groups were 
equal during conditioning and extinction 
with regard to D, since each received the 
same amount of shock. The differences 
obtained may be taken therefore to sup- 
port the notion that UCS strength exer- 
cises its influence on habit (H). Alterna- 
tively, if it is assumed that the different 
UCS intensities paired with the CS re- 
sulted in the differential (between groups) 
conditioning of emotional responses (r,) 
either specifically to the CS or to environ- 
mental stimuli, then there may have been 
both H and D differences between groups, 
since r, allegedly produces increments in 
D (Spence, 1956). 

The relatively gradual extinction shown 
by the data was probably due to the com- 
bined effects of the partial reinforcement 
Schedule during conditioning and to the 
presentation of interpolated shocks during 
the extinction phase. The latter proce- 
dure most likely served to reduce the 
amount of “generalization decrement” 
(Kimble, 1961), and to maintain D during 
extinction, 

The striking positive relationship be- 
tween CR magnitude and UCR magnitude 
was in agreement with Razran’s (1957) 
view that CR size and UCR size are posi- 
tively related. He held that UCS intensity 
was an effective determiner of CR 
strength only insofar as it affected the 
really critical variable—UCR strength. 
The highly significant correlation remain- 
ing between these variables after UCS in- 
tensity effects were partialed out was 
taken to mean that the mechanism of the 
significant UCS intensity effect on CR 
strength was its ability to produce large 
UCRs at the high intensity values. 
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EFFECTS OF INTRASERIAL REPETITION ON 
SHORT-TERM RECOGNITION AND RECALL: 


THOMAS M. WOLF? ax» JOHN C. JAHNKE 


Miami University 


The retention of 7-digit strings in which the digit in Serial Position 
(SP) 2 was respeated in either SP 4, 5, 6, or 7 was measured by 
recognition (Exp. I) and recall (Exp. II). Performance on re- 
peated digits was facilitated relative to corresponding control digits 
in all conditions of Exp. I. Performance on repeated digits was in- 
hibited (the Ranschburg phenomenon) in all experimental conditions 
of Exp. II but that one in which the repeated items were most closely 
adjacent. An attempt was made to explain the apparently contradic- 
tory findings of the present experiments in terms of differences be- 
tween the processes of recognition and recall. 


Recent studies of immediate free re- 
call (Waugh, 1962, 1963, 1967) have 
Shown that retention of an item is a 
direct function of the number of times 
an item is repeated and of total pre- 
sentation time for that item. Such a 
result is, of course, consistent with the 
commonly held view that habit strength 
increases with number of “repetitions” 

r "reinforced responses.” 

It is of some interest, therefore, to 
note a special case in which the effect 
of repetition is apparently to interfere 
with performance on the repeated items 
or, less often, on items subsequent to 
the repeated items. This special case, 
known as the Ranschburg phenomenon, 
has been observed under conditions in 
which one of a short series of discon- 
nected items is repeated. Crowder and 
Melton (1965), Obonai and Tatsuno 
(1954), and Wickelgren (1965, 1966) 
have confirmed and extended research 


1 Based on a thesis presented by the first 
author to the Faculty of Miami University 
in partial fulfillment of the requirements for 
the M.A. degree. Grateful acknowledgment 
is made to A. W. Melton, who first brought 
the problem of the Ranschburg phenomenon 
to the authors’ attention, and to J. H. Davis 
and W. A. Wickelgren, who read critically 
an earlier draft of this report. 

2Now at the University of Waterloo. 


on the Ranschburg phenomenon. The 
magnitude of the Ranschburg effect has 
been found to increase directly with 
the distance between repetitions, except 
when the repeated items occupy favor- 
able serial positions; when repeated 
items occupy favorable positions or 
when repeated items are closely ad- 
jacent to one another, recall may be 
facilitated for the repeated items. It is 
important to note that studies con- 
firming the Ranschburg phenomenon 
used only short lists of items (6-10 
digits or letters), that only one item 
of the list was repeated only once, and 
that immediate serial recall of the 
items (items recalled in order of pre- 
sentation) was required. The question 
arises, therefore, as to whether or not 
memory decrements for repeated items 
might be obtained when different ex- 
perimental procedures are used. 
Recognition procedures presumably 
require S to judge only the familiarity 
of the test stimulus ; the target stimulus 
itself need not be retrieved as it ap- 
parently must in recall. Further, 
recognition procedures can allow the 
testing of individual items of a series 
and thus avoid in large measure the 
interference associated with the act 
of recall (cf. Tulving & Arbuckle, 
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1963, 1966). Recognition procedures 
force the emission of a response when 
otherwise an omission might have 
occurred; and, when adapted as in the 
present study, such procedures make 
possible analysis in terms of the Theory 
of Signal Detection (e.g., Egan, 1958), 
an analysis which effects a separation 
of the detection and decision compo- 
nents of memory. 

For such reasons, the first experi- 
ment reported here used the procedures 
typical of studies of the Ranschburg 
phenomenon, except that a probe- 
stimulus recognition procedure was 
used as the measure of retention. It 
was considered that under such condi- 
tions, each repetition of an item would 
increment trace-strength for that item 
and that recognition of repeated items 
might, therefore, be better than recog- 
nition of corresponding nonrepeated 
items. 


EXPERIMENT I 


Method 


Subjects.—The Ss were eight female and 
seven male psychology students attending 
Miami University. They participated in the 
experiment to fulfill a course requirement. 

Apparatus and materials—Stimuli were 
presented visually and sequentially at a 4-sec. 
rate on an Industrial Electronic Engineers, 
Inc., Model 089 bina-view decoding display, 
which was operated by a Friden Model S-2 
paper tape reader, Each S saw 182, seven- 
digit strings in each of two experimental 
sessions. The first 14 of these on each day 
were practice trials, and data from these 
trials were discarded. The sessions were 
one day apart. There were five basic ways 
in which stimulus strings were structured. 
For the control condition (Cond. 0), which 
occurred twice as often as the other four 
conditions, all seven digits of a string were 
different. For the four experimental con- 
ditions, all digit-series included a single repe- 
tition. The first occurrence of a repeated 
element was always in Serial Position (SP) 
2, and the second occurrence was in either 
SP 4, 5, 6, or 7 (Cond. 4, 5, 6, and 7), 
respectively. 

‘sA basic set of seven-digit strings was de- 
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rived from a table of random numbers with 
the restriction that no digit was repeated in 
any string. Experimental strings were de- 
rived from these by substituting the appropri- 
ate (repeated) digit in the appropriate serial 
position. The strings met the following 
constraints: (a) there was no ascending or 
descending sequence of length greater than 
two, (b) two consecutive strings could not 
have the same digit in the same serial posi- 
tion, (c) the last digit of one string could 
not be the same as the first digit of the 
next string, and (d) Cond. 4, 5, 6, or 7 could 
not occur twice in a row, while Cond. 0 
could not occur more than four times in a 
rOW. 

Procedure.—The 336 experimental strings 
were structured into four basic sets of 84 
strings each. The four sets were presented 
according to a balanced Latin-square design. 
One of each set of four successive Ss was 
assigned to a different one of the four orders 
of presentation. The S read out loud the 
digits as they appeared on the bina-view. 
After a string had been presented, S was 
required to read out loud two consecutive 
sets of the Letters A, B, C, and D in various 
random arrangements as they were presented 
on the bina-view at a 4-sec. rate. Imme- 
diately following the interpolated letter-read- 
ing, a probe-digit was presented and S was 
asked to indicate whether that digit had 
(an “old” stimulus, So) or had not (a “new” 
stimulus, Sa) occurred within the list and 
to indicate his confidence in his judgment 
on a five-point scale. The following con- 
straints were imposed on the probe digit: 
(a) no more than four successive probe- 
stimuli were items which actually were 
old (or new) and (b) the digit chosen as 
the probe could not be the same on more 
than two consecutive trials. For half of 
the strings the probe digits were items which 
were old and for the remainder, new. Each 
serial position was tested with equal fre- 
quency. 

The S was instructed to respond to the 
probe digit as rapidly as he could, consistent 
with what he thought was a good degree 
of accuracy, by pressing one of two appropri- 
ately labeled ("Yes" and “No”) telegraph 
keys. Left-right position of the keys was 
counterbalanced. The S was informed after 
each response of the correctness of his de- 
cision. The E recorded Ss' responses, as 
well as his response latencies, which were 
recorded to the nearest .01 sec. from a 
Standard Electric clock. Intertrial interval 
was approximately 8 sec, 
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Results 


For each of Cond. 4, 5, 6, and 7 there 
were 420 correct recognitions possible 
(15 Ss X 4 tests at each of seven serial 
positions) when the probe digit was 
old and 420 when it was new. For the 
control condition, of course, there were 
twice as many tests, for which half of 
the probe digits were old and half new. 
The proportions of correct recognitions 
for Cond. 4, 5, 6, 7, and the control 
when the probe digit was old (correct 
acceptance of an old stimulus) were 
826, 817, .812, .750, and .735, re- 
spectively. Corresponding data when 
the probe digits were either old or new 
(correct rejection of a new stimulus) 
were .850, .826, .848, .799, and .786. 

Repeated-measures analyses of vari- 
ance were performed separately on the 
data representing the proportion of 
correct recognitions when the probe 
digit was old and for the data repre- 
senting the total of all correct recogni- 
tions (both old and new stimuli). 
Both analyses showed that the main 
effects of conditions and Ss were highly 
significant; for correct recognitions 
when the probe digit was old, F (4, 
56) = 6.66, p < .01 and F (14, 56) = 
6.80, p < .01, respectively. For total 
correct recognitions, corresponding 
values were F (4, 56) = 748, p < .01 
and F (14, 56) = 6.09, p < .01, respec- 
tively. A Newman-Keuls analysis for 
the data based on correct recognitions 
when the probe digit was old indicated 
that performance for Cond. 0 was 
poorer, p < .05, than that for Cond. 4, 
5, and 6, and that performance for 
Cond. 7 was poorer, p < .05, than that 
for Cond. 4 and 6. No other com- 
parisons were significant. A corre- 
sponding analysis for the data based on 
total correct recognitions indicated that 
Cond. 0 and 7 were significantly dif- 
ferent, p < .05, from Cond. 4 and 6, 


THOMAS M. WOLF AND JOHN C. JAHNKE 


but not from each other. No other 
comparisons were significant. 

Serial-position analyses of the ex- 
perimental data showed markedly su- 
perior performance on the repeated 
items in each of the four repetition 
conditions. Within each condition, 
performance was better for the re- 
peated than for any of the nonrepeated 
items in every case but one; in Cond, 
5, performance on the item at SP 7 
was very slightly superior to that on 
the repeated item. Performance on 
items adjacent to the repeated items 
was sometimes better and sometimes 
poorer than on corresponding control 
items. No obvious pattern of differ- 
ences was observed. 

Memory-operating characteristics 
(MOC).—A response placed in con- 
fidence category “yes-5” of the present 
study represents S's application of the 
strictest decision criterion, "'yes-4" the 
next strictest, etc. With the 10 ratings 
of the present study, there are 9 de- 
cision criteria ranging from strict to 
lax. Plots which show the cumulative 
probability of assignment of S's re- 
sponse to Category j or “stricter” 
when the probe stimulus was old or 
when the stimulus was new have been 
called “memory-operating characteris- 
tics” (Norman & Wickelgren, 1965) ; 
such plots reflect how well 5 partitions 
his ratings of confidence between old 
and new stimuli. MOC curves are 
shown separately in Fig. 1 for the criti- 
cal items (repeated items of experi- 
mental conditions and items occupying 
corresponding serial positions in the 
control condition) and the noncritical 
items (nonrepeated items of experi- 
mental strings and items occupying 
corresponding serial positions in the 
control condition). 

Each panel of Fig. 1 shows that the 
experimental and control curves for 
critical items differ greatly, but that 
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Fic, 1. Memory-operating characteristics for the experimental conditions of Exp. I. 
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this is not the case for the noncritical 
items. Plots of these data were also 
made on normal-normal probability 
paper. These plots were fitted rela- 
tively well by straight lines of near 
unit slope; curve fitting was done by 
inspection. Following Wickelgren and 
Norman (1966), values of d, a mea- 
sure of signal detectability, were ob- 
tained for the experimental conditions 
by estimating the coordinates of the 
intersection of the straight lines with 
the negative diagonal These values 
are shown in Table 1. Both Fig. 1 
and Table 1 clearly show that recogni- 
tion performance is better for repeated 
items as compared with correspond- 
ing (nonrepeated) items, but that this 
improvement does not extend to the 
aggregate of remaining items in those 
strings in which repetitions occur. 
Latencies.—Latencies can be used as 
an indicator of S's confidence in the 
correctness of his responses. If S is 
confident a probe digit was old, there 
should be little delay in his response; 
consequently, latencies would be ex- 
pected to decrease with increasingly 
correct performance. Means of me- 
dians of the reciprocals of such latencies 
are presented in Table 2 separately for 
each serial position in each experi- 
mental condition. The conversion of 
the latencies to reciprocals normalized 
the distribution, and the use of medians 
made it possible for each S' to contrib- 
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TABLE 1 


VALUES OF d For CRITICAL AND NoN- | 
CRITICAL ITEMS For EACH 
EXPERIMENTAL COMPARISON 


Critical Noncritical . 
Cond. 
Exp. Cont. Exp. Cont. 
4 2.68 | 111 | 1.90 | L54 
5 2.08 | 122 | 141 | 1554 
6 246 | 141 | 183 | 141 
7 2.68 | 148 | 122 | 141 


ute equally to the data in most instances — 
(all Ss contributed a value except one - 
S in Cond. 7, SP 5). It can be seen ` 
that, on the whole, latencies were ' 
shorter for the repeated items than for — 
the corresponding control items. It 
will be remembered that recognition 
performance was also better for the — 
repeated than for corresponding con- - 
trol items. E 
Spearman rank-order correlation co- _ 
efficients were calculated between re- _ 
sponse speed and recognition perform- 
ance for each condition separately. — 
For Cond. 4, 5, 6, and 7, respectively, - 
p = 429, 271, .643, and .700, p > .05. — 
For Cond. 0, p = 929, p < .05. Only 
for the control condition, in which — 
there were no repeated items, was re- 
sponse speed closely correlated with | 
accuracy of recognition. If latencies 
were to be considered a measure of the 
order in which items in memory are 


Serial Position 


Cond. 
1 2 3 4 5 6 7 
A 940 923 .864 923 .825 .925 918 
5 934 850 -862 (882 “850 1899 :930 
6 1802 :933 (827 1825 851 :933 942 
7 937 985 820 (857 890 :866 :985 
Control 925 838 879 816 1854 1882 1.004 
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scanned, data for Cond. 0 might be 
taken to suggest a scanning in order of 
strength (cf. Corballis, 1967) ; how- 
ever, such a suggestion lacks convic- 
tion, since response speed and strength 
were not significantly correlated in the 
remaining conditions. Inspection of 
Table 2 would further indicate that 
scanning was probably not occurring 
in serial order, since response speed 
and serial position appear to be either 
inversely or only weakly correlated. 

If latencies are to be taken as a 
measure of order of scanning, present 
data suggest no simple scanning struc- 
ture. Such data as the present would 
then appear to pose difficulties for both 
a "nonassociative" model of short-term 
memory (STM) which specifies a fixed 
order of read-out and an "associative" 
model which relies heavily on the 
strengths of interitem associations to 
determine order of read-out (cf. 
Wickelgren, 1966). 


EXPERIMENT II 


The recognition procedure of Exp. I 
showed that memory for repeated items 
was facilitated, relative to that for cor- 
responding control items. Inhibition 
of memory for repeated items, on the 
other hand, has been associated solely 
with recall procedures ; it was the pur- 
pose, therefore, of Exp. II to confirm 
the Ranschburg phenomenon with the 
stimulus materials of Exp. I. 


Method 


Subjects—The Ss were five female and 
five male students enrolled in psychology 
classes at Miami University. They partici- 
pated in the experiment to fulfill a course 
requirement. 

Materials and procedure.—The stimulus 
materials were identical to those used in 
Exp. I, except that during the 1-hr. experi- 
mental session no practice trials were given 
and S was presented with just one of the 
four sets of 84 items. The Ss were randomly 
assigned to one of the four sets with the 
constraint that one of each four successive 
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Ss was assigned to a different one of the 
four sets. 

The Ss were instructed to read the digits 
aloud and then to repeat them immediately 
from memory in serial order, They were 
told to say “blank” if they could not remem- 
ber an item or its position, so that their re- 
sponses exactly duplicated the original string. 
Tape recordings were made of S’s responses 
and were used to check E's records. La- 
tencies were recorded by a voice key. 

The remaining apparatus and other details 
of procedure were the same as in Exp, I. 


Results 


The proportions of items correctly 
recalled for Cond. 4, 5, 6, 7, and the 
control, respectively, were .955, .901, 
901, .929, and .948. For this and all 
subsequent analyses, unless otherwise 
specified, an item was scored correct 
even if it was not given in its correct 
serial position (free-recall scoring). 

Serial-position analyses of the ex- 
perimental data are presented in Fig. 2. 
For this figure a repeated item mis- 
placed by S was scored to the closest 
position in which a repetition should 
have been. For example, if repetitions 
should have been in SPs 2 and 5 but 
actually were given in recall in SPs 
3 and 6, the first repetition was 
assigned to SP 2 and the second to 
SP 5. If only a single repeated item 
was given in recall and was halfway be- 
tween the two positions in which repe- 
titions were to have been, half were 
scored to the earlier positions and half, 
the later. It can be seen in Fig. 2 that 
inhibition of recall occurred for the 
critical items, relative to corresponding 
control items, in all experimental con- 
ditions except Cond. 4, where facilita- 
tion occurred. Inspection of Fig. 2 
indicates that inhibition in recall ap- 
pears to be most clearly associated with 
the second occurrence of the repeated 
digit. 

Sign tests were used to compare per- 
formance on the critical and noncritical 
items in the experimental and control 
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Fic. 2. Serial-position analyses for the experimental conditions of Exp. II. 


conditions, because of restriction in 
range in scores and skewness in their 
distribution. Both serial-recall (item 
correct only if given in correct SP) 
and free-recall scoring analyses indi- 
cated that performance was better for 
the repeated items than for the cor- 
responding control items for Cond. 4; 
however, these differences were not 
statistically significant, p= .09 and 
.145, respectively. Performance was 
poorer (serial-recall scoring) for the 
repeated items than for corresponding 
control items for Cond. 5 and 6, p < 
-011 and .011, respectively, as was that 
with free-recall scoring, p < .02 and 
011, respectively. Performance was 
poorer on critical than on correspond- 
ing control items for Cond. 7 ; although 
this difference was not significant with 
free-recall scoring, p = .50, it was with 
serial-recall scoring, p = .035. Analy- 
ses for noncritical items made sepa- 


rately for each experimental condition 
with its appropriate control indicated 
that there were no significant differ- 
ences in performance, p > .254 in all 
cases, except for Cond. 4, p< .055 
(serial-recall scoring). Perhaps be- 
cause performance was so good overall, 
no evidence was found for associative 
intrusions of the type described by 
Wickelgren (1966). 

The procedure for computing la- 
tencies was the same as in Exp. I; 
however, each value constituted a re- 
sponse to the first item of the string 
instead of a single item, as was the case 
in Exp. I. The mean of medians ob- 
tained for Cond. 4, 5, 6, 7, and the con- 
trol, respectively, were .782, .770, .742, 
./51, and 767. Again, latencies would 
be expected to decrease with increas- 
ingly correct performance. Facilitation 
was observed for Cond. 4, and the la- 
tency was shorter than for the control. 
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Inhibition was observed for Cond. 5, 
6, and 7, and the latencies were longer 
than for the control, except for Cond. 
5, which closely approximated the con- 
trol. 


Discussion 


The problem posed by the present data 
is how memory for repeated items is 
facilitated when retention is measured by 
recognition but inhibited when retention 
is measured by recall. One explanation 
runs as follows. Facilitation in the rec- 
ognition of repeated items occurs because 
recognition performance depends upon 
S’s remembering correctly the presence 
of the repeated item in the original series 
but not at all upon his remembering that 
that particular item was a repeated item 
(e.g, S could remember that the digit 
“7” was in the series but not that “7” had 
appeared twice, yet still correctly accept 
a probe-digit “7”). Since each repetition 
ought to increment trace-strength, re- 
peated items on average ought to be 
stronger, and, thus, more easily recog- 
nized than nonrepeated items, 

Performance in recall, however, de- 
pends both upon S’s retrieval of the re- 
peated item and upon his remembering 
that that particular item was a repeated 
item (e.g, S must remember both that 
"7" was in the series and that “7” oc- 
curred twice). Inhibition in the recall of 
repeated items would be expected if occa- 
sionally S fails to detect or to remember 
that an item was repeated, or if S remem- 
bers that an item was repeated, but not 
which one; in such instances, S is likely 
to emit the critical item only once. In 
this way performance on the critical item 
will be reduced at one of the two serial 
positions at which that item had appeared. 
Other considerations suggest a reason 
for the apparent differential bias against 
recall of the second occurrences of the re- 
peated item. Since recall is serial, sec- 
ond occurrences of a repeated item will 
be longer in store and suffer greater out- 
put interference (Tulving & Arbuckle, 
1963; 1966) on average than first occur- 
rences. Tt only need be further supposed 
that memory of a particular item which 
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occurred twice is more subject to such 
disruptive effects than is memory for a 
corresponding control item. From these 
considerations it follows that the Ran- 
schburg phenomenon ought to be one 
in which repeated items are not merely 
misplaced but, rather, not given at all 
if not given in their correct serial posi- 
tion. Further, inhibition ought to be 
greater at the serial position of the 
second occurrence of the repeated item. 
Experiment II showed that repeated items 
were omitted more frequently than control 
items as the present argument suggests, 
except in Cond. 7, where the critical item 
occupied a distinctive serial position and 
where the critical item was apparently 
simply misplaced. The data of Exp. II 
and of Crowder and Melton (1965) sug- 
gest that inhibition was greater for the 
second occurrences of the repeated items, 
as the present argument also suggests. In 
part, the present statement holds that 
the Ranschburg phenomenon involves 
not the forgetting of an item, but that the 
item was a repeated item. As such, the 
present statement appears to depend upon 
the notion that an item may be repre- 
sented in memory in more than one way, 
a notion for which there is precedent 
(e.g, Yntema & Trask, 1963). 

In conclusion, the present experiments 
suggest that the Ranschburg phenomenon 
may be primarily a function of the re- 
trieval procedure. Repetition of an item 
appears only to strengthen the memory 
trace for that item; failures to recall 
repeated items seem to occur only when 
it is necessary to remember both that an 
item occurred and that the item was re- 


peated. 
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IN HUMAN-PROBABILITY LEARNING? 
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2 experiments investigating concurrent simultaneous and successive 
contrast effects with human Ss in a probability learning task are re- 


ported. 


Successive and simultaneous negative contrast effects were 
noted with 2 nonzero levels of incentives. 


These effects were not 


obtained with a paired zero and nonzero incentive level. An absolute 
magnitude hypothesis for the latter condition was proposed. 


There is a good deal of evidence to 
suggest that incentive effects are con- 
textual phenomena.  Pubols' (1960) 
review of incentive studies using animal 
S's showed that no differences in per- 
formance are found between groups 
experiencing different, nonzero incen- 
tives (absolute method or between-Ss 
design). However, performance dif- 
ferences that are a function of incentive 
magnitude have been reliably obtained 
from studies in which animal Ss have 
simultaneous experience with more 
than one level of incentive (differential 
method or within-Ss design). 

A series of recent studies using hu- 
man Ss have demonstrated a striking 
parallel with the data reviewed by 
Pubols. Harley (1965a) using a 
paired associates (PA) task and My- 
ers, Fort, Katz, and Suydam (1963) 
using a probability learning task found 
no significant differences in . per- 
formance, with a between-Ss incentive 
design.. Significant differences in PA 
anticipation (Harley, 1965b) and in 
the probability of choosing the more 
frequent event P(A,) (Lipkin, 
Schnorr, Suydam, & Myers, 1965) 
have been obtained from a within-Ss 
design. Similar effects with within-Ss 
designs have been demonstrated by 
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Schnorr, Lipkin, and Myers ( 1966) 
and by Schnorr and ‘Myers (1967) in 
probability learning tasks, 

A consistent feature of research em- 
ploying both human and animal Ss is 
that the within-5s incentive effect is one 
of depression or negative contrast, i.e., 
a decrement in performance to the low 
incentive member of the stimulus pair 
in a differential design as compared to 
performance for the same incentive 
magnitude in an absolute design. Cor- 
responding elation effects are rarely 
found. 

The cited research suggests that 
contrast effects (CEs) are obtained 
when the same Ss have experience 
with more than one level of incentive. 
Consequently a successive contrast (in- 
centive shift) paradigm should result 
in similar effects. Successive negative 
CEs have not been reliably obtained 
with this paradigm and Vogel, Mi- 
kulka, and Spear (1966) have attrib- 
uted the failure to find negative CEs 
with the successive paradigm to tem- 
poral factors stich as the spacing of 
trials. Studies using the successive 
paradigm with human Ss are rare and 
inconclusive, but it does appear that 
only negative CEs are found (Swens- 
son, 1965). 

Recently, Spear and his associates 
combined the successive and simultane- 
ous: paradigms in a- single design. 
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Spear and Hill (1965) reported an ex- 
periment in which rats, initially trained 
to choose between two rewards of dif- 
ferent magnitudes in a position dis- 
crimination, experienced a shift in the 
quantity or quality of reward to the 
more favorable alternative (MFA) 
while reward to the less favorable 
alternative (LFA) remained constant. 
They obtained a simultaneous but no 
successive CE. Spear and Spitzner 
(1966) reported a series of four ex- 
periments, the first three of which 
basically support the Spear and Hill 
finding of no successive CEs. The 
fourth study was important in that a 
successive CE was demonstrated. This 
is especially pertinent in that it was 
designed to determine the nature of 
contrast in the reward, nonreward 
Situation. An interesting finding was 
that zero incentive was experienced as 
"less reward" rather than zero reward 
when S had simultaneous experience 
with a nonzero incentive. 

The present paper reports two ex- 
periments combining the simultaneous 
and successive paradigms in a single 
design with human Ss. The general 
purpose of the two experiments was to 
determine the nature of successive and 
simultaneous incentive contrast phe- 
nomena in human Ss for two nonzero 
incentives (Exp. I) and for a paired 
zero and nonzero incentive level (Exp. 
II). 


EXPERIMENT I 


Method 


Subjects.—The Ss were 35 male and 35 
female University of Denver undergraduate 
volunteers randomly distributed into seven 
groups of 10 Ss each with the restriction 
of an equal number of males and females 
in each group. 

Procedure.—The basic procedure was 
adapted from Schnorr et al. (1966). The 
stimuli were white cards similar to playing 
cards. The face of each card was rubber 
stamped with either a line or a dot to repre- 


J. HALPERN, J. A. SCHWARTZ, AND R. CHAPMAN 


sent the stimulus event, and the back of the 
card was stamped either 1¢ or 5¢ referring 
to the incentive level for that card (trial), 
The cards were assembled in decks of 250 
cards each and shuffled prior to the experi- 
ment and in full view of each S. In all 
decks of 250 cards, the probability of the 
more frequent event P(E;) was .60. E; 
was the dot for half of the Ss in each 
group and the line for the remaining half, 

All Ss were given 500 trials in two series 
of 250 trials each. Groups were delineated 
on the basis of their first series and second 
series incentive conditions, with some groups 
experiencing an incentive shift after the 
first 250 trials. The seven groups were: 
55 (indicating 5¢ trials for both series of 
250 trials), 11, 15, 51, 5&1—5&l 
(both series consisting of randomly mixed 
1¢ and 5¢ trials of equal frequency), 5&1—5, 
and $&1—1. 

Each S was run individually and instructed 
to predict whether the nonvisible side of the 
stimulus card would reveal a line or a dot 
when turned over. The level of incentive 
visible on each card at the beginning of the 
trial informed S of the amount of money 
that he would win with the correct prediction 
of the stimulus event or lose for an incor- 
rect prediction. An initial stake of chips 
was given to S and at the end of the experi- 
ment, all chips were exchanged for money 
with the initial stake subtracted from the 
winnings. No Ss received any instructions 
concerning an incentive shift; after the first 
250 cards were depleted, E left the room and 
returned with a new deck of cards. 


Results and. Discussion 


The mean preshift P(A,)s over 
blocks of 50 trials for all groups are 
given in Table 1. The last trial block 
Shows that groups receiving identical 
treatments are roughly equivalent. 
Orthogonal comparisons between these 
like-treated groups for the last trial 
block substantiated a hypothesis of no 
difference. Further, the last trial block 
shows that differences between groups 
receiving only a single level of incen- 
tiveare minimal. Inspection of the data 
for groups run under the within-Ss 
design shows some differentiation for 
the different incentive magnitudes. The 
lack of any clear asymptotic data and 
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the large amounts of error variability, 
especially for those groups experienc- 
ing two incentive levels, preclude any 
further discussion of the preshift data. 

Figure 1 gives the mean postshift 
P(A,)s in blocks of 50 trials for 
Groups 555, 1-1, and 5&15&l. 
The figure shows extensive probability 
overshooting which is typical for prob- 
ability learning tasks with monetary 
incentives. Further, there is a clear 
depression in P(A,) for the 1¢ data of 
Group 5&1—5&1 as compared with the 
5€ data of this and the remaining 
groups. Asymptotic performance (last 
100 trials), which was characterized 
by substantially less variance than pre- 
shift performance, was roughly identi- 
cal for the between-Ss groups and the 
5€ portion of the within-Ss group. A 
planned comparison of the asymptotic 
performance of the between-Ss groups 
with the 1é portion of the within-Ss 
group proved significant, t (27) = 3.6, 
p < 01, showing a negative CE as a 
function of experimental context in 
agreement with previously cited stud- 
les. 

Figure 2 shows the mean postshift 
P(A,)s for Groups 55, 11, 135, 


TABLE 1 


Proportion oF A; RESPONSES FOR EACH 
Group Over THE First Frye BLocks 
or Trrats (Exp. I) 


Blocks of 50 Trials 

Group 
1 2 3 4 5 
55 553 | .604 | .688 | .684 | .716 
121 "576 | .611 | .694 | 662 | .688 
1—5 1553 | .647 | .664 | -656 | .671 
541 "562 | -540 | .591 | .662 | .679 
1&5—1&S (1) |.472 | .519 | .546 | .530 | .614 
(5) | 502 | 587 | 641 | .650 | .641 
1&55 (1) | .460| .552 | .611 | 552 | .508 
(5) | 1551 | .590 | .672 | .640 | 631 
1&5—1 (1) | .526 | 510 | .596 | .591 | .649 
(5) | 1581 | .662 | .677 | .640 | .687 
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Fic. 1. Proportion of A: responses in 
blocks of 50 trials over the last 5 trial 
blocks for Groups 5—5, 1—1, and 5&1— 
5&l. (Exp. I) 


5—1, 5&1—5, and 5&1—1. The 
P(A,)s corresponding to the asymp- 
totic portions of the curves (last 100 
trials) were combined to form a two 
factor analysis of variance with three 
levels of preshift incentive (1€, 56, and 
1¢&5¢) and two levels of postshift in- 
centive (1€ and 546) with postshift 
P(A,) serving as the dependent vari- 
able. The analysis showed that pre- 
shift treatment had a significant effect 
on postshift performance, F (2, 54) — 
10.48, p « .01. Post hoc omparisons 
(Scheffe, 1959) were performed to 
determine the nature of the effect. 
Significant differences were obtained 
for each of the orthogonal compari- 
sons of Groups 5—5 and 1—1 with 
Groups 51, 1&5—1, and 1&5—5. 
The first two comparisons demonstrate 
an incentive shift negative CE. Ela- 
tion effects were not obtained, ie., 
P(A,) for Group 15 was roughly 
equal to the performance of Group 
5—5 and the shift from 1&5 to 5 re- 
sulted in a significant depression effect 
when compared to Groups 55 and 
1—1. This finding is surprising and 
cannot be accounted for by any of the 
theories usually invoked to explain 
CEs, ie, a frustration or adaptation 
level theory. As a working hypothesis 
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blocks of 50 trials over the last 5 trial 
blocks for Groups 5—1, 1—1, 5—1, 15, 
5&1—5, and 5&1—1. (Exp. I) 


for further research, this finding sug- 
gests that any shift from a within-Ss 
incentive condition to a more simple 
condition results in a depression. One 
possible explanation would be to attrib- 
ute this depression effect to a loss of 
interest after the shift to a simpler con- 
dition. This hypothesis would also 
imply that a depression effect such as 
that shown for Group 1&5—1, which 
would ordinarily be considered as a 
negative CE, might also be due to 
this “loss of interest.” 

In general, the results of Exp. I are 
in agreement with the results of other 
incentive studies using animal and hu- 
man Ss in that a within-Ss negative CE 
was obtained with no concomitant be- 
tween-Ss effect. The study also demon- 
strates an incentive shift, negative CE 
with human Ss. This successive CE 
is often difficult to find with animal Ss 
presumably because spacing of trials 
permits the possibility of extinction of 
the low incentive response habit. The 
massing of trials in the present experi- 
ment was obviously sufficient for main- 
tenance of the habit. 


EXPERIMENT II 


This experiment was essentially a 
replication of Exp. I with different in- 
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centive magnitudes. Specifically, the 
second experiment was designed to de- 
termine whether the pairing of a zero 
and small nonzero incentive magnitude 
would be comparable to the pairing of 
two nonzero rewards. Incentives of 
0€ and 16 were selected so that the ab- 
solute value of the difference between 
incentives in the two experiments were 
clearly dissimilar. According to a con- 
textual hypothesis frequently used to 
explain CEs, the absolute difference 
between incentives should have little 
or no effect on performance. 


Method 

The seven groups of 10 Ss each were: 
11, 0-0, 1-0, 0—1, 1&0—1&0, 1&0—0, 
and 1&0—41. The 0¢ cards for all groups 
receiving 1é trials either pre- or postshift 
were stamped "0c." The cards for Group 
0—0 were left blank. This was an attempt 
to enhance the secondary reinforcing prop- 
erties of the 0¢ reward and thereby deter- 
mine whether Spear and Spitzner's conclu- 
sion (no reward is seen as less reward for 
Ss experiencing a concomitant nonzero in- 
centive) could be applicable to humans. 


Results and. Discussion 


Table 2 gives the mean preshift 
P(A,)s in blocks of 50 trials for each 
group. Analyses of the last trial block 
for groups receiving identical treat- 
ments showed no significant differences 
between the P(A,)s. Differences be- 
tween groups receiving only a single 
incentive level and differences in per- 
formance to the two incentive magni- 
tudes of the between-Ss groups were 
also insignificant, As in Exp. I, the 
preshift data were characterized by a. 
lack of clear asymptote and large 
amounts of error variability. 

Figure 3 gives the postshift P(A,)s 
in blocks of 50 trials for Groups 1—1, 
00, and 1&0—1&0. The figure 
shows a clear between-Ss incentive 
effect for the last 100 trials, ¢ (18) = 
3.91, p< 01. This is not surprising | 
in that it demonstrates probability 
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TABLE 2 


PROPORTION or A; RESPONSES FOR EACH 
Group OVER THE First Five BLocks 
or Trrats (Exp. II) 


Blocks of 50 Trials 

Group 
1 2 3 4 5 
11 -556 | .640 | .651 | .632 | .659 
0-0 .552 | 536 | .591 | .572 | .618 
0-1 .502 | .561 | .602 | .591 | .658 
1-0 .582 | .632 | .601 | .722 | .710 
1&0—1&0 (0) |.562 | 559 | 548 | .540 | .591 
(1) |.496 | .471 | .502 | .566 | .621 
1&0—1 (0) | 491 | .540 | .576 | .588 | .591 
(1) | 555 | .612 | .608 | .641 | .646 
1&0—0 (0) | .498| .561 | 562 | 563 | .599 
(1) |.592 | .644 | .622 | .694 | .698 


matching for Group 0-0 and over- 
shooting for Group 11. Such prob- 
ability matching with no incentive and 
overshooting for a monetary incentive 
is a reliable finding of probability learn- 
ing research. What is surprising is 
the lack of any within-Ss incentive 
effect, seemingly due to the significant 
depression of P(A,) to the 1€ incentive 
of Group 1&0—1&0 as compared to 
P(A,) for Group 11, t (18) = 3.21, 
p<.01. It might be compelling to 
conclude that the lack of a within-Ss 
incentive effect shows that both 06 and 
1 are equivalent and perhaps negligible 
rewards. Such an interpretation, how- 
ever, is obviated by the differences be- 
tween Groups 0-0 and 11. It is 
interesting to note here that Harley 
(1965a, 1965b) obtained opposite re- 
sults with paired 0¢ and 25€ incentives 
ina PA task (a within-Ss negative CE 
with no between-Ss CE). An obvious 
reason for the discrepancy between the 
present results and Harley's results is 
the different nature of the tasks and 
response measures. A second possi- 
bility concerns the absolute magnitude 
of the difference between a 25¢ reward 
and a 1¢ reward vs. no reward. It was 
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mentioned previously that zero incen- ' 
tive when paired with "some" reward 
seems to take on features of less re- 
ward rather than zero reward. Some 
support for such a contention comes 
from Spear and Spitzner who found 
that rats experiencing a simultaneous 
12 food pellet (FP) and no reward 
did not differ in percentage of choice 
of the MFA from rats receiving 12&1 
FP. However, both of these groups 
differed significantly from a group re- 
ceiving 1&0 FP. The authors’ results, 
together with those of Spear and Spitz- 
ner, would suggest an absolute magni- 
tude hypothesis for CEs for the pairing 
of a very low and no incentive, i.e., the 
absolute magnitude of the difference 
between 1¢ and O¢ is small, especially 
if O¢ is not preceived as “no reward." 
Generalization of an absolute magni- 
tude hypothesis to other incentive com- 
binations would be difficult since there 
is some evidence to support a con- 
textual hypothesis when two nonzero 
levels of incentive are paired (Schnorr 
& Myers, 1967). 

Figure 4 presents the mean postshift 
P(A,)s in blocks of 50 trials for 
Groups 1—1, 0-0, 10, 021, 1&0 
1, and 1&0—0; and shows for all of 
these groups that postshift 0¢ perform- 
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blocks of 50 trials over the last 5 trial blocks 
for Groups 11, 00, 01, 1—0, 1&0—1, 
and 1&0—>0. (Exp. II) 


ance was lower than postshift 1¢ per- 
formance. An analysis of variance of 
the same type as that used in Exp. I 
for the last 100 trials showed only a 
significant postshift treatment effect, 
F (1, 54) = 25.3, p < .01. This find- 
ing together with the previous one, 
shows the lack of any CEs with human 
Ss for a paired zero and small nonzero 
incentive magnitude. 

It may be noted from Fig. 4 that 
the asymptotic P(A,)s for Groups 
10 and 1&0—0 were greater, 
although not significantly so, than for 
Group 0-0, Similar results were ob- 
tained by Swensson (1965) in an in- 
centive shift study with human Ss, 
ie. the shift from 5¢ to 0€ resulted 
in higher terminal P(A,) than a group 
receiving 0€ all through training. This 
suggests that the hypothesis presented 
earlier, that no reward is perceived as 
less reward when it is simultaneously 
paired with a nonzero reward, might 
also be true for human Ss when the 
two are paired successively. 
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SEMANTIC STRUCTURE IN SHORT-TERM MEMORY * 
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This paper reports an experiment on semantic structure in short-term 
memory. A selected portion of the English kinship system provided 
the semantic structure. Kin terms were drawn írom a model having 
3 dimensions or features—sex, generation, and lineality-collaterality. 
8 terms were used in every possible combination of triads. These 
triads were presented to Ss who were instructed to recall them on 
signal. Counting backwards by 3s occupied Ss between presentation 
and recall. The recall intervals were 4, 8, and 16 sec, Results showed 
that recall was poorer for triads in which the terms differed by all 
3 features rather than only by 2. An analysis of errors showed that 


more intrusions occurred with 3 features than with 2. 


Short-term memory, like most cog- 
nitive activity, is sensitive to linguistic 
structure. For example, recall varies 
with meaningfulness as measured by 
association value (Peterson, Peterson, 
& Miller, 1961). Further, Murdock 
and vom Saal (1967) showed that 
there is better recall for trigrams taken 
from single categories than for those 
taken from different categories. In 
a different experimental paradigm, 
Wickelgren (1965, 1966) found that 
confusions between letters in short- 
term memory are determined by the 
phonemic features possessed in com- 
mon by the letters. Finally, the obser- 
vations of Brown and McNeill (1966) 
on the “tip-of-the-tongue” phenomenon 
suggest that retrieval from long-term 
memory consists, in part, of combining 
various semantic and phonemic fea- 
tures in different ways until the item 
to be retrieved is recognized. 

Given these experimental results, 
there is amiple reason to suppose that 
short-term memory will be sensitive 
to variations in semantic structure. 
This paper examines the effect of the 
structure of a particular semantic field, 


1 The research reported in this paper was 
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kinship, upon short-term retention. 
Kinship is representative of well-struc- 
tured semantic fields. It has been ex- 
tensively studied by anthropologists and 
subjected to linguistic componential 
analysis (Romney &  D'Andrade, 
1964). The experiment reported here 
was performed on the expectation that 
errors in recall would reflect the inter- 
nal structure of a portion of the En- 
glish kinship system, as that system is 
specified by componential analysis. 
Specifically, the authors expected that 
the fewer the semantic features repre- 
sented in an item (an item consisting 
of a triad of kinship terms), the better 
the retention of that item. 

Figure 1 shows the kinship terms 
used in the present study, and illus- 
trates the structure of the semantic 
system relating these terms. Three 
features of dimensions are represented 
in the model. These are generation, 
sex, and lineality-collaterality. Any 
given triad may exhibit variation in 
either two or three features. For ex- 
ample, the triad mother-aunt-father 
uses terms that differ in only two fea- 
tures (sex and collaterality), whereas 
the triad aunt-father-nephew contains 
terms differing in all three features. 
All things equal, fewer errors with the 
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Aunt Uncle 


Motec LA S. 


Nephew 


Daughter Son 


Fic.. 1. The kinship terms used in this ex- 
periment shown as the surface of a cube. 


triad mother-aunt-father than with the 
triad aunt-father-nephew should be ex- 
pected. 


METHOD 


The general method is a familiar one 
in short-term memory studies. The au- 
thors exposed an item consisting of three 
kinship terms to Ss, and, after a variable 
delay interval, S' was signalled to recall the 
item. A distracting task filled the delay 
interval. 

Four lists of 14 items each were used. 
Each list contained six triads having a two- 
feature structure and eight triads having 
a three-feature structure. In any set of 
eight elements, there were 56 unordered 
combinations of three elements taken at a 
time. The 56 different triadic combinations 
from the set of eight kinship terms were used 
in such a way that each term occupied the 
different positions in the triad approximately 
equally. Four different delay intervals were 
used, 0, 4, 8, and 16 sec. For each S, each 
interval was used with one of the four lists, 
Each S was tested on each list, the different 
Ss being presented different combinations 
of list order and delay interval order, 

The Ss, 24 Johns Hopkins University male 
undergraduate students, were paid for par- 
ticipation, and they were tested individually. 
Each S was told that the experiment con- 
cerned memory. He was practiced in 
counting backwards by threes (the distract- 
ing task) with a set of terms not used in 
the experiment itself. Finally, 5 was re- 
quired to write out all eight of the kinship 
terms actually used in the experiment to 
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make sure that Ss were well aware of the 
limited portion of the kinship system they 
were to use. 

The following is the complete proced- 
ure for a single trial: (a) three kin 
terms appeared simultaneously in the win- 
dow of a memory drum for 1 sec; (b) a 
three-digit number appeared in the window 
for 1 sec, S named the number, and began 
to count backwards by three; (c) the num- 
ber was replaced by a series of asterisks 
which remained during the retention interval 
and during which S continued to count; and 
(d) a buzzer sounded at the end of the 
retention interval, and S' recalled the three 
terms. The S was required to.name three 
terms, even if he had forgotten one or more 
of those presented. The whole sequence was 
recorded both by a tape recorder and by E 
actually present. Latencies were timed from 
onset of buzzer to onset of S's response. 
Postexperimental inquiries probed for cod- 
ing and mediating devices by Ss. 


RrsuLTS 


Items were scored as correctly re- 
called if they were identical both in 
content and order to the items actually 
presented. Figure 2 shows the reten- 
tion for all items in the experiment 
compared with the results from Mur- 
dock's (1961) experiment, which stud- 
ied retention for triads of common 
English words. The curves for the 
two experiments are quite close. The 
fact that retention in the present ex- 
periment is slightly higher than. in 
Murdock's may itself reflect the highly 
Structured nature of the kinship sys- 
tem. In any event, despite the dif- 
ferences (the kinship terms are mixed 
one- and two-syllable words), the re- 
sults of the present experiment are 
similar to those from the earlier ex- 
periments, 

Figure 3 shows that at all retention 
intervals, recall is slightly better for 
two-featured items than for three-fea- 
tured items. Excluding the zero delay 
test (for which no difference can be 
expected to occur), the difference be- 
tween the recall of two-featured and 
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Fic. 2. The retention curve for the data in the 
present experiment compared with earlier results. 


three-featured items is statistically sig- two kinds of errors, transpositions and 
nificant by a sign test (p < .05). intrusions. A transposition occurs 

Since Ss were required to give three when a term presented is recalled but 
terms for each recall, there were only in the wrong position. An intrusion is a 
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Fic. 3. The retention curve for two-featured items 
compared with three-featured items. 
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kinship term not actually given on that 
particular test. Overwhelmingly, such 
intrusions were from the portion of 
the kinship structure used in the ex- 
periment. Other terms (such as 
grandfather or cousin) occurred only 
very rarely. Table 1 shows an analysis 
of transpositions and intrusions, The 
numbers in this table are the rates of 
occurrence of each type of error (fre- 
quency divided by number of items 
scored). There are 144 observations 
per cell for two-featured items and 192 
observations per cell for three-featured 
items. Note that, on any given item, 
both an intrusion and a transposition 
may be possible. In general, the table 
shows that both transpositions and in- 
trusions increase with the length of 
the retention interval and that intru- 
sion errors occur at a higher rate when 
the items differ in all three features. 
A more detailed analysis of the pat- 
tern of errors reveals even more di- 
rectly the influence of semantic struc- 
ture. The authors compared each re- 
call triad with the triad originally pre- 
sented, and tabulated the paired rela- 
tions. For example, aunt-father-daugh- 


TABLE 1 


RATE oF OCCURRENCE OF TRANSPOSITION AND 
INTRUSION ERRORS PER TRIAD 


Transpositions 
Retention 
Interval Two. Three- 
Feature Feature All Items 
Items Items 
0 .00 .00 00 
4 93 60 50 
8 .69 52 59 
16 54 67 62 
Intrusions a 
0 .00 01 01 
4 399 67 61 
8 56 .69 -63 
16 EI 81 -70 


Note.—N = 144 per cell for two-featured items and 
192 per cell for three-featured items. 
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ter might have produced the recall aunt- 
daughter-mother. In that case, the 
three pairs would be aunt-aunt, father- 
daughter, and daughter-mother. A 
total of 28 different unordered pairs is 
possible. Of these, 12 differ by two 
features, and 4 by three features. One-, 
featured variation would be represented 
by any paired values on one of.the 
faces of Fig. 1, two-featured variation 
by any paired values around a corner, 
and the three-featured.variation by the 
paired diagonal vertexes. By chance, 
the one-featured errors should account 
for 43% of the total, as should the 
two-featured errors, while the three- 
featured errors should account for 
14%. In actual fact, the frequen- 
cies of the one-featured errors and the 
two-featured errors differed signif- 
icantly. One-featured pairs accounted 
for 49% and two-featured pairs 38%. 
The frequency distributions for the 
one-featured and two-featured pairs 
were compared by the Kolmogoroy- 
Smirnov two-sample test, and were 
found to be different at the .05 level. 
The three-featured errors accounted 
for 13% of the total. 

The latencies did not differ signif- 
icantly, either for the correct responses 
or the total of all responses, though - 
they showed the same pattern as per- 
centage correct responses—shorter la- 
tencies for small retention intervals 
than for large ones, and for two-fea- 
tured items than for three-featured 
items. 


Discussion 


The results of the experiment are con- ` 
sistent with earlier experiments on short- 
term memory. There is the familiar de- 
crease in items correctly recalled as a 
function of recall interval. In addition, 
transposition errors increase with reten- 
tion interval length (see Murdock & 
vom Saal, 1967). The new results show 
that recall is better when fewer features 
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distinguish the terms of an item and that 
there are more substitutions among items 
close together in the kinship structure 
than among items far apart. These re- 
sults, though not large in magnitude, 
show that the form of a semantic struc- 
ture, such as the kinship model, effects 
short-term memory. 

Stronger and more detailed influence 
of semantic structure is detectable in the 
pattern of transposition and intrusion 
errors. Since transpositions take place 
within a triad—without interaction with 
outside terms—no difference between the 
two-featuréd and three-featured items for 
transpositions should be expected. Ex- 
amination of the upper portion of Table 1 
shows that the difference between these 
is very small indeed. On the other hand, 
intrusions reveal interaction with terms 
outside of the triad actually presented, 
and the storage and retrieval of three- 
featured items should run more risk of 
intrusion than storage and retrieval of 
two-featured items. Furthermore, the 
fewer the features represented in an 
item, the less it should be affected by the 
length of the delay interval Both of 
these possibilities are supported by the 
results presented in the bottom part of 
Table 1. + 

One of the purposes served by the ex- 
periment is to provide experimental—as 
opposed to intuitive and linguistic—evi- 
dence for the reality of a particular 
semantic structure. The results of the 
comparison of the two- and three-fea- 
tured sets of trigrams provide that evi- 
dence. Further evidence that Ss pos- 
sessed some version of a semantic com- 
ponent model for kinship is found in 
the postexperimental inquiries. The Ss, 
of course, when asked, did not explicitly 
state that they encoded the trigrams in 
the form of some semantic model, but 
their statements did reveal an awareness 
of the importance of semantic kinship 
features for encoding. 

Here is an unselected (serial) sample 
of comments transcribed from the experi- 
mental tapes: (S1) “or like uncle, 
nephew and daughter—I'd remember the 


uncle and nephew as one because, you 
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know, they sort of go together"; (52) 
"What I was trying to do was to see a 
relationship between the words"; (S3) 
(no strategy reported) ; ($4) “There 
were some like, see, like father, uncle, 
son was pretty easy combination and 
maybe niece, mother, aunt or something 
like that"; ($5) “One method I used 
was relationship. I related uncle and 
aunt . . . I don't remember any tactics 
for [the third member of the triad)”; 
(S6) “I just used the first letter of each 
word"; (S7) “Well, like, if something 
came up like son-uncle-father, then I'd 
think they’re all men and use that.” 

It is worth noting that one of these 
protocols (as well as some others not 
presented here) reports some nonseman- 
tic encoding devices. Unfortunately 
there are only four Ss among those rec- 
ords which could be transcirbed and who 
report coding who fail to mention any 
kinship coding, therefore no reasonable 
comparison can be made between those 
Ss who report kinship coding and those 
who do not. 

The results of the present experiment, 
of course, can be described by a number 
of semantic “similarity” models, but the 
major findings suggest that any model 
must have the essential properties re- 
vealed by this experiment. These prop- 
erties, cast in the form of familiar simi- 
larity models, would require either a 
city-block metric (specifically a hollow 
cube in the portion of the kinship system 
used in this experiment) or a hexahedron 
with large distances between the diagonal 
vertexes. Perhaps more refined results 
could distinguish between these possibil- 
ities, though there is no compelling rea- 
son from current semantic theory why 
there should be a distinction between 
them. The point is that “stimulus simi- 
larity” or other such variables in experi- 
ments of this sort require some multi- 
dimensional or multifeatured treatment. 
In addition, the student of psycholinguis- 
tics is gratified to note that this partic- 
vlar psychological function—short-term 
memory—reveals the influence of seman- 
tic features. 
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EFFECTIVENESS OF RETRIEVAL CUES IN 
MEMORY FOR WORDS* 


ENDEL TULVING Aw» SHIRLEY OSLER 


University of Toronto 


Ss had to memorize lists of 24 to-be-remembered (TBR) words. 
The TBR words were exposed for study on a single input trial, in 
presence or absence of cue words—weak associates of the TBR words. 
Recall of TBR words was tested in presence or absence of these cue 
words. The findings showed that (a) cue words (retrieval cues) 
facilitated recall of TBR words when they were present both at input 
and output, (b) retrieval cues did not enhance recall of TBR words 
when they were present only at output, and (c) 2 retrieval cues pre- 
sented simultaneously with each TBR word were no more effective in 
facilitating recall than single cues. The main conclusion was that 
specific retrieval cues facilitate recall if and only if the information 
about them and about their relation to the TBR words is stored at 
the same time as the information about the membership of the TBR 


words in a given list. 


When a person studies a list of to- 
be-remembered (TBR) words with the 
intention of recalling them at a later 
time, appropriate mnemonic informa- 
tion is stored in his memory. This 
stored information is used at the time 
of attempted recall to reproduce the 
original input. The success of recall, 
broadly speaking, depends on two fac- 
tors: the amount and organization of 
the relevant information about the 
TBR words in the store at the time 
of attempted recall (availability of in- 
formation), and the nature and num- 
ber of retrieval cues which provide 
access to the stored information (acces- 
sibility of information ; Mandler, 1967 ; 
Tulving & Pearlstone, 1966). The 
distinction between these two factors 
can be demonstrated under conditions 
where different groups of Ss are 
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Education, as well as to the principals, 
teachers, and pupils in 10 Scarborough 
schools for their cooperation. 


treated identically (given identical in- 
structions, presented with identical ma- 
terial, asked to engage in identical ac- 
tivity interpolated between input and 
output, etc.) up to the beginning of 
the recall period, and then provided 
with different kinds of retrieval cues. 
The availability of relevant information 
is equal for different groups under 
these conditions and consequently any 
variation in recall must be attributed to 
differences in accessibility of this in- 
formation. 

An earlier experiment in the present 
series (Tulving & Pearlstone, 1966) 
provided an experimental demonstra- 
tion of the distinction between avail- 
ability and accessibility. The Ss were 
presented with lists of TBR words 
which they had to memorize. At input 
the TBR words were accompanied by 
the names of conceptual categories of 
which the words were members. 
When these category names were given 
to Ss at output as retrieval cues, Ss 
recalled more words than when no 
experimentally manipulated retrieval 
cues were present at output. This 
finding demonstrates that retrieval de- 
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pends upon the completeness of rein- 
statement, at the time of output, of the 
stimulating conditions present at the 
time of input (Melton, 1963), but it 
does not provide much insight into 
the underlying mechanisms. 

The experiment reported in this pa- 
per was designed to provide some em- 
pirical evidence relevant to specula- 
tions about the nature of these mecha- 
nisms. This evidence takes the form 
of answers to four specific questions. 
First, is it possible for cue words that 
are only weakly associated with the 
TBR words to facilitate recall of TBR 
words? In the Tulving and Pearl- 
stone (1966) experiment, the average 
frequency of occurrence of TBR words, 
as responses, to category names as 
stimuli, according to the Connecticut 
Restricted Word Association norms 
(Cohen, Bousfield, & Whitmarsh, 
1957) was 6.5%, and in every case 
the connection between the category 
name and the TBR word was quite 
obvious. What would happen if the 
strength of the associative connection 
is weaker and less obvious? 

Second, given that a retrieval cue 
is effective if it is present both at input 
and at output, is it equally effective 
if it is provided to S only at the time 
of attempted recall of the TBR word? 
It is conceivable that a preexperimental 
associative bond between the cue and 
the TBR word is sufficient to make the 
retrieval cue effective. On the other 
hand, it may be necessary that in- 
formation about the relation of the re- 
trieval cue to the TBR word be spe- 
cifically stored at the time of the input 
of the TBR word. 

The third question is related to the 
second: If a TBR word is paired with 
a certain cue at input and its recall 
then tested in presence of a different 
but preexperimentally equivalent cue, 
would such a changed cue also be as 
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effective as the original cue? To the . 
extent that the effectiveness of a re- 
trieval cue depends on the existence 
of a preexperimental associative bond 
between the cue and the TBR word, 
the changed cue would still be expected 
to facilitate recall. If, on the other 
hand, information about the relation 
of the cue to the TBR word has to be 
stored at the same time as information 
about the TBR word, the changed cue 
would not be expected to be effective. 

The fourth and final question has 
to do with the effectiveness of double 
cues. Given A and B as two dif- 
ferent but associatively equivalent cues 
for a given TBR word, would the pre- 
sentation of both A and B at input as 
well as at output produce greater fa- 
cilitation of retrieval of the TBR word 
than each of them separately? Some 
data from an experiment on tachisto- 
scopic identification of words—selec- 
tion of words from the long-term store 
—would lead one to expect that double 
cues are more effective than single 
cues (Tulving, Mandler, & Baumal, 
1964). It is also conceivable, however, 
that even though E presents two words 
as both input cues and retrieval cues 
for a given TBR word, S may treat 
the two words as a single unit, compar- 
able to a single-word cue, and no 
greater facilitation of recall would 
Occur. 


METHOD 


Design.—Lists of 24 TBR words were 
presented to Ss for study and subsequent 
recall on a single trial. Four input con- 
ditions were combined factorially with five 
output conditions, except that one input con- 
dition was associated with only four output 
conditions. The input conditions were: 
(a) the TBR words were presented alone 
(Input Cond. 0), (b) each TBR word was 
accompanied by Cue A (Input Cond. A), 
(c) each TBR word was accompanied by 
Cue B (Input Cond. B), and (d) each TBR 
word was accompanied by both Cues A and 
B (Input Cond. AB). The output condi- 
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tions were: (a) noncued recall of TBR 
words (Output Cond. 0), (b) recall of 
TBR words in presence of Cues A (Out- 
put Cond. A), (c) recall of TBR words in 
presence of Cues B (Output Cond. B), 
(d) recall of TBR words in presence of 
both Cues A and B (Output Cond. AB), 
and (e) free recall of TBR words and of 
all cues shown at the time of input (Out- 
put Cond. WC). The Output Cond. WC 
vas not used in conjunction with Input 
Cond. 0, since in the latter condition no 
cues had been presented. Thus there was 
a total of 19 experimental treatment com- 
binations. Each combination can be desig- 
nated in terms of its input and output condi- 
tions. Thus, for instance, Cond. 0—0 was 
a standard free recall condition—TBR words 
shown alone and tested in absence of any 
cues; in Cond. B-AB each TBR word 
was presented in presence of Cue B and 
tested for recall in presence of both Cues A 
and B; in Cond. AB-WC each TBR 
word was presented in presence of both 
Cues A and B and S had to recall TBR 
words and both Cues A and B in absence 
of any experimentally provided aids to re- 
trieval, etc. 

Independent groups of Ss were used in 
each of the 19 treatment combinations. The 
design is reflected in the organization of the 
data in Table 1. 

Materials—Two equivalent lists of TBR 
words (I and II) were used to provide for 
sampling of materials. Each list consisted 
of 24 words selected from among the stimu- 
lus words in the Minnesota Free Associa- 
tion Norms (Russell & Jenkins, 1954). For 
each TBR word two other words were se- 
lected to serve as cues, These were low- 
frequency responses from the associative 
hierarchy of each TBR word. Each cue 
had been given as a response to the stimu- 
lus words in the Minnesota norms by fewer 
than 1% of Ss. Of the two cues thus 
selected for each TBR word, one was arbi- 
trarily labelled as Cue A and the other as 
Cue B. The two sets of cues, A and B, 
for each list were thus approximately 
equally related to the TBR words. Some 
examples of A and B cues and TBR words 
are: fat, LEc-MUTTON; village, pIRTY— 
CITY; dark, crrx—SHORT ; body, viGoR— 
HEALTH; empty HURT—STOMACH; 
emblem, soAR—EAGLE. 

Independently of the present experiment, 
free association data were collected for cue 
words of this experiment as stimuli. The 
Ss were 278 high school students in the 
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same school system in Metropolitan Toronto 
where the present experiment was carried 
out. One-third of these Ss were presented 
with 48 A cues, another third with 48 B 
cues, and the remaining one-third with each 
pair of corresponding A and B cues pre- 
sented simultaneously (double cues). The 
Ss were instructed to write down for each 
single or double stimulus word some other 
word that the stimulus word or words made 
them think of. The observed proportions 
of responses corresponding to the TBR 
words used in this experiment (the total 
number of TBR words given to their re- 
spective cues, divided by the product of 
number of words and number of Ss) were 
as follows: A cues—.011, B cues —.015, and 
double cues—.025. 

The Ss’ assignment to experimental con- 
ditions—The Ss were 674 boys and girls 
from 23 eighth grade classes in 10 different 
elementary schools in the Metropolitan 
Toronto area. (Original design called for 
testing of Ss in 24 classes, but Ss were lost 
in 1 class because of apparatus failure) 
They were tested in intact groups in their 
own classrooms, classes varying in size from 
22 to 36. Each of the four input conditions 
was administered to 6 classes, each in a 
different school. Three classes in each 
input condition were tested with List I, 
and 3 with List II. (Because of the 
apparatus failure, Input Cond, 0 was given 
to only 2 classes with List I). In each 
class, Ss were assigned to the five output 
conditions (four in the case of Input Cond. 
0) on the basis of a random distribution of 
five (four) kinds of recall booklets to the 
seats in the classroom. Thus, each of the 
19 experimental groups, corresponding to 
the 19 treatment combinations, consisted of 
Ss from 6 different classes, each in a dif- 
ferent school. The sizes of the 19 experi- 
mental groups varied 32-39, with a median 
size of 36. 

Procedure.—Upon entering each classroom 
recall booklets were first distributed. These 
booklets contained pages for Ss' recording 
of their recall, different sections of booklets 
being of different colors and separated by 
blank pages. The E then gave some general 
information to Ss about the study of memory 
conducted by the psychology department of 
the University of Toronto and about the 
nature of the task, explaining how the ma- 
terial to be "learned" was to be presented, 
how Ss were to write their recall at the 
proper time in their recall booklets, and how 
they were to work independently. A prac- 


596 


tice list, consisting of 24 adjectives as used 
by Tulving and Pearlstone (1966) was then 
administered. Words were projected on the 
Screen by means of a film-strip projector, at 
the rate of 2 sec/word, and Ss were allowed 
3 min, to record their recall in the booklets. 
The Ss were then asked to turn to the next 
(blank) page in the recall booklet and were 
given study instructions with respect to the 
“second part of the experiment,” memoriza- 
tion of the experimental list proper. 

Study instructions varied according to 
input conditions. The Ss were told that they 
had to study and try to remember words 
projected in capital letters on the screen. 
In Input Cond. A, B, and AB, they were 
told that each of the capitalized words was 
to be accompanied by another word which 
was “somehow related to the capitalized 
word. . . .” (two other words in Cond. AB) 
typed in smaller case letters above the capi- 
talized word. They were told that although 
their memory was going to be tested only for 
words typed in capital letters (TBR words) 
they should also pay attention to the words 
typéd in lower case letters (cue words), 
because these “may help you to remember 
the capitalized words later on.” They were 
also told to try “to see how each word and 
its accompanying cue (cues) are related.” 

The list was projected on the screen in 
front of the classroom, at the rate of 5 sec/ 
frame in all input conditions. Each frame 
contained one TBR word in capital letters, 
and, in Input Cond. A, B, and AB, one or 
two cue words in lower case letters above 
the TBR word. After all 24 frames had 
been shown, Ss were asked to turn to the 
next page in their recall booklet, read the 
recall instructions on the top of the page, 
and then to record their recall. Recall in- 
structions in the booklets varied according 
to the output condition to which S$ had been 
assigned. The first sentence in all instruc- 
tions was, "Now write down all the capi- 
talized WORDS you remember, and the 
final sentence was, "The important thing is 
to get as many WORDS correct as pos- 
sible.” Some examples of the rest of the 
instructions follow: 


Condition 0—0: “write them down in 
any order you like. . . ." 

Conditions 0—A and 0—B: “the 
words you see written on this and on 
the following sheet may help you to 
remember the WORDS since each of 
them is related to one of the capitalized 
WORDS. If you can, put each of the 
WORDS you remember opposite the 
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word to which it is related. If you find 
this too difficult, however, put down the 
WORDS you remember anyhow, any- 
where on this sheet or on the follow- 
ing sheet... .” 

Conditions A-A and B-B: “the cue 
words you saw are given on this 
sheet and on the following sheet. If 
you can, write down each WORD on 
the line opposite the cue word. If you 
remember a WORD but you do not re- 
member which cue word it went with, 
put it down anyhow, anywhere on this 
sheet or the following sheet. . . ." 

Conditions A-B and B-A: "each of 
the words you see on this sheet and 
on the following sheet is related to one 
of the capitalized WORDS in more or 
less the same way as the cue words 
which appeared on the slides together 
with the WORDS you had to remem- 
ber. If you can, put each of the 
WORDS you remember opposite the 
word to which it is related. If you find 
this too difficult, however, put down the 
WORDS you remember anyhow, any- 
where on this sheet or the following 
sheet; IEAn 

Conditions A-WC and B-WC: "write 
them down in any order you like 
in the blank spaces on the right hand 
side of this sheet and the following sheet. 
Tf you can, you should also write down 
as many of the cue words as you remem- 
ber. Write the cue words in the blank 
spaces on the left side of this sheet and 
the following sheet. If you can, write 
each capitalized WORD on the line op- 
posite the cue word that went with it. 
But if you do not remember the con- 
nection between a WORD and its cue 
word, write both of them anyhow, any- 
where on this sheet or on the following 
sheet... aai 


The 24 words in each of two lists were 
shown in the same constant order to all 
Ss. In Output Cond. 0, the recall sheets in 
the booklet consisted of 24 consecutively 
numbered lines on two successive pages. 
In Output Cond. WC, there were two col- 
umns of 24 numbered lines, the left one for 
cue (cues) and the right one for TBR 
words. In Output Cond. A, B, and AB, the 
cues were shown in the same order in 
which their corresponding words had ap- 
peared in the input list. 

In each group, 6 min. were allowed for 
recall. Recall booklets were then collected 
and Ss permitted to ask questions they had 
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about the experiment, problems of memory, 
or psychology. Finally, Ss were asked not 
to discuss the experiment with other students 
at the school until the next day and reasons 
for this request were given. 


RESULTS 


The mean number of words correctly 
recalled from the practice list of 24 ad- 
jectives was 6.19 for all 674 Ss. The 
same list administered under compar- 
able conditions to 948 high school Ss 
in the Tulving and Pearlstone (1966) 
study had yielded a mean recall score 
of 9.48. This difference indicates that 
the authors are here dealing with a dif- 
ferent population of Ss and that there- 
fore no direct comparisons of the main 
data between the two experiments 
would be very useful. The mean 
scores on the practice list for different 
experimental groups varied from 5.74 
to 6.69. 

The primary data are provided by 
the mean number of TBR words cor- 
rectly recalled, “Lenient” scoring was 
used throughout. The S was given 
credit for recall of a TBR word regard- 
less of whether or not the word was 
paired with its appropriate cue. 

The data on correct recall of TBR 
words were pooled over Lists I and IT 
and are summarized in Table 1. Table 
1 shows the number of Ss in each ex- 
perimental group (2), the mean num- 
ber of TBR words recalled (M), and 
the standard deviation of the distribu- 
tion of the correct recall scores (SD). 
The last column in Table 1, Output 
Cond. WC, also shows the mean num- 
ber of cue words correctly recalled. 
Thus, for instance, the entry 845 + 
5.88 in the cell corresponding to Cond. 
A-WC means that the mean number 
of TBR words recalled was 8.45 and 
the mean number of cues recalled was 
5.88. The figure for the standard 
deviation, however, applies only to the 
distribution of TBR word scores. 
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Table 1 contains all the data needed 
to answer the four questions posed in 
the introduction. First, is it possible 
for cue words that are only weakly 
associated with the TBR words to fa- 
cilitate recall of TBR words? The 
answer to the question is affirmative. 
When single cues were present at 
input, cued recall (Cond, A-A and 
B-B) was approximately 70% higher 
than noncued recall (Cond. A-0 and 
B-0). With the data pooled over 
Input Cond. A and B, the two means 
are 14.93 and 8.73, respectively. 

Second, does an otherwise effective 
retrieval cue still facilitate recall if it 
is provided to S only at the time of 
output? This question receives a 
negative answer. When single or 
double cues were present at output, 
but not at input (Cond. 0-A, 0-B, and 
0-AB), recall was lower than in the 
absence of cues at both input and out- 
put (Cond. 0—0). With the data 
pooled over relevant conditions, the 
two means are 849 and 10.62, re- 
spectively. 

Third, would a changed cue, pre- 


TABLE 1 


NuMBER or Ss-(n) MEAN NuMBER OF 
Worps RECALLED-(M), AND STANDARD 
DeviaTion-(SD) ror Eacn EXPERI- 
MENTAL CONDITION 


Output Condition 


Input 
Condi- 
n 0 ^ B AB wc 
0 
n 3 38 39 37 
M 10.62 8.39 8.64 8.43 
SD (2.79) | (2.97) | (2.43) | (2.68) 
A 
37 36 36 36 33 
M 9.00 | 14.94 6.94 | 14,81 | 8.45+5.88 
h SD (2.96) | (3.50) | (2.68) | (3.72) (2.87) 
n 36 37 35 37 35 
M 8. 7.95 | 14,91 | 14.84 | 8.86 -6.63 
ACA (2.94) | (3.23) | (4.14) | (4.76) (2.72) 
" 32 34 34 33 32 
M 9.06 | 11.24 | 11.79 | 14.33 | 8.31-+5.91 
SD (4.08) | (3.69) | (3.86) | (4.05) (3.15) 


Note.—For explanation of entries in column WC, 
see the text. 
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experimentally equivalent to the cue 
paired with the TBR word at input, 
be as effective as the original cue? 
This question is also answered nega- 
tively. Recall of TBR words in Cond. 
A-B and B-A was not only consider- 
ably lower than in Cond. A-A and 
B-B (means of 7.45 and 14.93, re- 
spectively), but also lower than in 
Cond. A-0, A-WC, B-0, and B-WC 
(pooled mean of 8.70) in which the 
cue present at input was simply omitted 
at output. 

Fourth, does the presentation of two 
cues at input as well as at output fa- 
cilitate recall of TBR words to a 
greater extent than do the single cues? 
Again the answer is negative. The 
mean number of TBR words recalled 
in Cond. AB-AB was 14.33, while in 
Cond. AB-0 and AB-WC it was 8.68. 
Thus, with input conditions held con- 
stant, recall in presence of two retrieval 
cues per TBR word was approximately 
65% higher than in absence of these 
cues. The facilitation of recall by 
double cues thus is of the same order 
of magnitude as that observed in the 
case of single-cue input conditions 
(70%). 

Statistical analyses, in the form of 
one-way analyses of variance within 
each input condition and subsequent 
contrasts of individual means using 
Scheffé’s method, were found to sup- 
port all of the above statements at least 
at the .05 level of significance and will 
not be reported in detail. Some addi- 
tional observations, however, may be 
of interest and will. be mentioned 
briefly. 

In Cond. A-A and B-B, e.g., over 
96% of all TBR words correctly re- 
called were paired with their cues on 
recall sheets, while in Cond. 0-A and 
0-B only 39% of recalled words, and 
in Cond. A-B and B-A 34% were so 


paired. The two latter figures are 


ENDEL TULVING AND SHIRLEY OSLER 


probably inflated because of the cor- 
respondence between input positions 
of words and the ordering of cues on 
the recall sheets. These data again 
demonstrate that otherwise potent re- 
trieval cues are quite ineffective in facili- 
tating recall if they are not presented 
together with the TBR words at input. 

When two cues were presented with 
each TBR word at input, but recall was — 
tested in presence of only one of those 
cues (Cond. AB-A and AB-B), re- 
call was lower than in presence of both 
cues (Cond. AB-AB). This finding 
is reminiscent of stimulus selection in 
paired associate learning (Underwood, 
1963), although alternative interpreta- 
tions are possible. 

When TBR words were accom- 
panied by cues at input but had to be 
recalled in absence of any cues, recall 
was approximately equal in Output 
Cond. 0 and WC. The mean recall of 
TBR words in Cond. A-0, B-0, and 
AB-0 was 8.83, and in Cond. A-WC, 
B-WC, and AB-WC it was 8.55. 
Thus, the requirement that S retrieve 
cue words in addition to TBR words 
produced the same level of recall as the 
requirement that S select for recall 
only the TBR words from the total 
input. The lower recall of cues than 
of TBR words in Output Cond. WC, 
however, does suggest that Ss paid less 
attention to cues than to TBR words 
at input, i.e., that they believed E's 
instructions. Of all the cues recalled, 
93% were paired with their corre- 
sponding TBR words. 

Finally, the number of repetitions 
of TBR words was negligible (674 Ss 
gave a grand total of 106 repetitions), 
but the frequency of extralist intrusions 
was relatively high. In the seven non- 
cued recall conditions the mean num- 
ber of extralist intrusions per S was 
.58, while in the 12 cued recall condi- 
tions this average was 1.77. 
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DISCUSSION ` 


An earlier experiment in the present 
series (Tulving & Pearlstone, 1966) 
demonstrated that category names of 
TBR words can serve as effective re- 
trieval cues. In that experiment cate- 
gory names accompanied the TBR words 
at input, and recall of TBR words was 
tested either in presence of category 
names (cued recall) or in absence of 
these cues (noncued recall). Cued re- 
call was higher—in some cases consider- 
ably higher—than noncued recall. A part 
of the design of the present experiment 
replicated the paradigm used by Tulving 
and Pearlstone, namely Cond. A-A and 
B-B (cued recall) and A-0 and B-0 
(noncued recall). The data from this 
part of the experiment fully corroborated 
the earlier findings. 

Some other evidence available in the 
literature (Earhard, 1967a, 1967b; Tul- 
ving, 1962) indicates that initial letters 
of TBR words can also function as po- 
tent retrieval cues. In these experiments, 
the multitrial free-recall paradigm was 
used, and Ss were instructed to think 
about the intial letters of TBR words 
at input and to generate the letters of 
the alphabet as retrieval cues on their 
own at output. Such "alphabetic recall" 
was found to be higher than free recall. 
In addition, several smaller experiments 
the authors have done at Toronto have 
shown that synonymic cues of TBR 
words —(sENT—hwisted, BRIDGE—bond, 
sort—pliable, etc.), as well as descrip- 
tions of graphemic features of TBR 
words (a long word—understanding, a 
word ending in ly—intimately, a word 
with a double consonant in the middle— 
summer, etc.), also facilitate recall if 
the cues are present both at input and at 
output. It thus appears that a wide va- 
riety of experimentally manipulable spe- 
cific retrieval cues that are meaningfully 
related to the TBR words can provide 
access to stored information about the 
TBR words that is available but not 
accessible under the noncued recall con- 
ditions. 

While the meaningfulness of the con- 
nection between the cue and the TBR 


word—the meaningfulness obviously 
being determined by Ss' preexperimental 
knowledge of the language—may be a 
necessary condition for the effectiveness 
of retrieval cues, it does not seem to be 
a sufficient condition. It may be neces- 
sary in that a random pairing of cues 
and TBR words will probably not en- 
hance cued recall when compared with 
noncued recall, and it is not sufficient 
in that the presence of cues only at out- 
put (Cond. 0-A, 0-B, and 0-AB of 
the present experiment) does not facili- 
tate recall of TBR words. The overall 
pattern of the data reported in this paper 
was completely consistent in showing that 
whenever the cues accompanied the TBR 
words at input, their presence at output 
facilitated recall, and whenever they were 
absent at input, their presence at output 
did not serve any useful purpose. In 
fact, the presence of cues only at output, 
or changing of cues from input to output, 
appeared to interfere with recall of the 
TBR words. This phenomenon may 
merit further study, but for the present 
purposes the important finding is the lack 
of recall facilitation by cues presented 
to Ss for the first time at the time of 
recall. This finding, in conjunction with 
the finding that the same cues were quite 
effective when presented at both input 
and output, suggests that specific re- 
trieval cues facilitate recall if and only 
if the information about them and about 
their relation to the TBR words is stored 
at the same time as the information about 
the membership of the TBR words in a 
given list. The authors would like to 
offer this suggestion as the main con- 
clusion of the present experiment. 

At first blush, this conclusion may ap- 
pear to be inconsistent with the results 
of experiments (e.g, Bilodeau & Blick, 
1965; Fox, Blick, & Bilodeau, 1964; 
Lloyd, 1964) in which retrieval cues have 
been provided to Ss only at the time of 
recall and which have showed such re- 
trieval cues to facilitate recall. The in- 
consistency disappears, however, if it is 
remembered that appropriate coding of 
input words may take place even if E 
does not explicitly suggest to S how he 
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is to code the TBR items, that is, what 
additional information he has to store 
with the TBR item at the time of input. 
If the TBR word is bulb—to use an ex- 
ample given by Bilodeau and Blick 
(1965)—at least some Ss are quite likely 
to think of it as something to do with 
light. If "light" is then presented by E 
as a retrieval cue, it is effective for those 
Ss in the same way as it would have been 
if it had been presented together with 
bulb at input. 

Thus, if E leaves S free to code the 
input subjectively, or lets S make his 
own differential responses to stimuli 
(Postman, Adams, & Phillips, 1955), the 
effectiveness of specific retrieval cues 
provided by E at output presumably de- 
pends on the extent of the overlap be- 
tween the cues and such subjective cod- 
ing responses that have occurred at in- 
put. Experimental manipulation of cues 
at the time of the presentation of TBR 
items simply restricts the ways in which 
various Ss code the input and thus pro- 
vides E with greater control over what is 
stored, but the underlying mechanisms 
are probably the same in both cases. Re- 
gardless of whether § codes the TBR 
items subjectively or follows the sug- 
gestions for coding given by E in the 
form of input cues, a retrieval cue is 
effective only if the information about it 
and its relation to the TBR ‘item is 
stored at the same time with the TBR 
items. This conclusion is quite consist- 
ent with the principle that retrieval de- 
pends upon the completeness of reinstate- 
ment of original stimuli at the time of 
recall (Melton, 1963). 

Finally, a few words about the effec- 
tiveness of double cues. To the extent 
that specific retrieval cues provide access 
to the information about TBR items not 
accessible in absence of such cues, one 
might expect that the use of multiple 
retrieval cues would lead to more effec- 
tive recall performance than the use of 
single cues. The attempt to demonstrate 
this relation in the present experiment, 
however, ended in failure. Double-word 
cues were found to be no more effective 


than single-word cues, even though 
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each member of the double-word cue 
was shown to be quite effective in 
other parts of the experiment, and even 3 
though the double-word cue produced a — 
somewhat higher frequency of "correct 
guesses" in the free-association task. Tt 
looks as if Ss in the present experiment 
treated the double-word cue as a single 
unit of information which was as potent 
in effecting retrieval of the TBR word 
as was a single-word unit. It is still 
conceivable, however, that double cues 
are more effective than single cues if the 
two members of the double cue are pre- 
sented separately, rather than simultane- 
ously, at input. Bevan, Dukes, and 
Avant (1966) have shown, in the termi- 
nology of the present paper, that non- 
cued recall of TBR words repeated within 
a list is higher for words accompanied 
by multiple cues than for words accom- 
panied by single cues. The same phe- 
nomenon may well hold for cued recall 
as well, 
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For certain stimulus sequences in 2-choice discrimination learning, 
hypothesis theory yields 2 counterintuitive predictions: During both 
presolution performance and insoluble problems (a) the probability 
of a correct response will be suppressed below .5 and (b) the suppres- 
sion will be greater for better information processors. In 2 experi- 
ments, 1 with 4-dimensional and 1 with 8-dimensional stimuli, both 


predictions were confirmed. 


Levine (1966), in an analysis of 
hypothesis behavior during discrimina- 
tion learning, employed stimuli varying 
in form, color, size, and location (cf. 
Fig. 1). The sequence of stimuli were 
internally orthogonal, ie., during four 
consecutive trials each letter was black 
twice, large stimuli were on the right- 
hand side twice, etc. The four stimuli 
of Fig. 1 are counterbalanced in this 
way. Levine, Miller, and Steinmeyer 
(1967), using these stimuli, discovered 
an unexpected property in the choice 
behavior of adult humans: prior to the 
trial of the last error ((TLE) the prob- 
ability of a correct choice was .38, sig- 
nificantly less than .5. 

It is possible to show that the com- 
monly held assumptions of hypothesis 
(H) theory predict this performance 
suppression. These assumptions are :* 


1. The S, in trying to solve a prob- 
lem, selects an H from some set and 
responds on the basis of that H. For 
example, S may predict that “large” is 


1This research was supported by United 
States Public Health Service Grant MH 
11857-01. 

2 Now at the University of Connecticut. 

3 Although these are listed here as assump- 
tions they all have direct empirical support. 
See Levine (1966) and Levine et al. (1967) 
for the experimental results. 


the correct basis for choosing. He 
would, then, choose the left side of 
the top stimulus of Fig. 1, or the right 
side of the second stimulus, etc. 

2. Given demonstration problems 
and appropriate instructions, S samples 
only from the Hs corresponding to the 
stimulus levels. Thus, S predicts that 
large, or x, etc., is correct. He does 
not employ Hs about conjunctions of 
cues or about complex sequences. 

3. During presolution responding S 
samples neither the correct H nor the 
other H on the same dimension, Thus, 
if “right side” is the solution, before 
the TLE S tries neither the H right 
side nor left side. Before the last 
error S may be regarded as sampling 
from, at most, the remaining six Hs. 

4. If S's response is followed by a 
positive outcome (e.g, E says “cor- 
rect") he keeps his H for the next trial. 

5. If S's response is followed by a 
negative outcome (eg, E says 
"wrong") he abandons his last H and 
resamples. He might do this in several 
ways, all of them falling along a con- 
tinuum of amount remembered: (a) 
the Reject-Ü assumption: at one ex- 
treme S may remember nothing, not 
even the H he just rejected. He 
would, in effect, return this H to the 
set and, before the TLE, resample from 
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the full set of six Hs. This is the 
assumption made by Restle (1962) and 
by Bower and Trabasso (1964); (b) 
the Reject-1 assumption: he may re- 
member that this one H is wrong and 
resample without replacement, i.e., 
from among five Hs; (c) the Reject-3 
assumption: he might notice that the 
negative outcome following his re- 
sponse logically disqualified as many 
as three Hs. Following any error he 
resamples from a reduced, but fixed, 
number of Hs. This is similar to the 
assumption by Trabasso and Bower 
(1966); and (d) the Reject-all 
assumption: in addition to the Hs dis- 
qualified by a “wrong,” he might re- 
member additional Hs disqualified on 
earlier trials and could, therefore, re- 
duce the set still further. At the ex- 
treme his memory would be perfect 
and he would sample only from the set 
of logically correct Hs. This would 
be the performance of focusers as de- 
scribed by Bruner, Goodnow, and Aus- 
tin (1956, pp. 129-134). 

Applying this theory to problems 
with internally orthogonal stimuli 
yields the prediction that S will be cor- 
rect on less than 50% of the presolu- 
tion trials (see the Appendix). This 
prediction holds for all variants of 
Assumption 5. A second prediction is 
that the better S’s memory, as char- 
acterized by the scale of Assumption SB 
the worse he will do. These predic- 
tions are represented by the family of 
theoretical curves in Fig. 2. The pres- 
ent experiments were designed to test 
these two predictions. 

An inadequacy with presolution data 
is that they heavily represent slow 
learning. For example, protocols with 
no or only one error cannot be used. 
A better technique, which does not ex- 
clude data and for which the five 
assumptions, above, are applicable, is 
to rely on insoluble problems. During 
such problems the solution depends 
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. [x T] 
- [r£ X] 


STIMULI 


Fic. 1. 
stimuli in which each level of every dimen- 
sion appears exactly twice with each level 
of every other dimension (e.g, T is on the 
left twice, large figures are white twice, 
etc.). 


A sequence of four-dimensional 


upon a hidden dimension, ie. upon 
one which is not represented in S's 
H set. All Ss are in the presolution 
stage throughout the whole problem. 


EXPERIMENT I 


During this experiment the hidden 
dimension was established by instruc- 
tions. The S was told only about the 
form, color, and size dimensions. 
Nothing was said about the positions 
as possible solutions. Also, to en- 
hance the instructions, several prob- 
lems were first presented in which one 
of the six stipulated Hs was the solu- 
tion. It was felt that after these in- 
structions and preliminary problems S 
would have an H set consisting of the 
six instructed Hs (cf. Assumption 3). 

A methodological problem exists 
with this technique. During an insol- 
uble problem an S could conceivably 
learn and remember that all six Hs 
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were disconfirmed. In other words, he 
could, after a few trials, have an empty 
set. Two problems with this situation 
are that H theory has no assumptions 
to cover it and, less formally, that S' 
would recognize that the rules given 
in the instructions were not holding. 
To avoid such perfect information 
processing by S the trials, rather than 
being S-paced, were presented at a 
2-sec. rate. This turned out to be a 
satisfactory solution to the problem. 


Method 


Subjects.—Forty-four undergraduates ser- 
ved as S's. Twenty of these were volunteers 
Írom the beginning psychology course; 24 
were paid Ss attending the summer session. 

Materials—A Kodak Carousel slide pro- 
jector presented the stimuli. An Atcotrol 
timer controlled the projector so that slides 
could be presented at a 2-sec. rate. Slides 
were constructed of four-dimensional stimuli 
similar to those shown in Fig. 1. The 
slides were arranged into sets of eight, each 
such set comprising the materials for a 
problem. The eight slides were grouped 
into two sets of four internally orthogonal 
stimuli so that they had these characteristics 
throughout: no dimension could be corre- 
lated with any other dimension for more 
than two trials and only one pair of dimen- 
sions was so correlated for two trials (cf. 
Fig. 1). 

Opaque slides were placed after each set 
of eight to mark the end of one problem 
and the start of the next. 


PQ) .30 
20 SS 
H^ \ Perfect processing, (REJECT-ALL) 
WdNECNENEDEUOUTDOT 8 
TRIAL. 
Fic. 2. Theoretical (dashed lines) and 


obtained (solid line) curves of performance 
during insoluble problems. 
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Procedure.—A sample slide was shown 
and three dimensions, the two letters, the 
two sizes, and the two colors, were pointed 
out (the two positions were never men- 
tioned). The S was told that he would 
receive some short problems in which a 
series of such slides would be shown, that 
he always had to choose one of the two 
alternatives by saying one of the two letters, 
and that he would always be informed 
whether he had made the correct response. 
The correct response, he was told, would 
always depend upon one of the six attrib- 
utes. He was further instructed that the 
slides would appear at a fast rate and that 
he was to respond rapidly. Three sample 
problems were then presented, the three 
described dimensions each contributing one 
of the solutions. After these three prob- 
lems S’s questions were answered and the 
six solutions reiterated. 

The experimental problems followed. The 
first five had legitimate solutions, the sixth 
had a position solution. Following this, 
five more series of six problems each were 
presented, five with a legitimate solution, 
one with a position solution. There were, 
then, 30 legitimate eight-trial problems and 
6 insoluble eight-trial problems (Four Ss 
solved one of these 6 insoluble problems. 
These were discarded. The next experiment 
eliminates the need for even this small de- 
gree of selection.) The 2-sec. rate posed 
no problems. The Ss virtually always re- 
sponded (the probability of no responses 
was less than .01) and E always said "right" 
or "wrong" within the 2-sec, interval. 


Results 


Figure 2 shows three theoretical cur- 
ves and the obtained curve. The theo- 
retical curves embody the two predic- 
tions: (a) all versions predict below 
chance responding starting at Trial 2, 
and (b) the better the processing the 
worse the predicted performance. 
Note, also, that the curve for the Re- 
ject-all assumption stops at Trial 3. 
At this trial the perfect processor, 
having eliminated all six Hs, will face 
an empty set. The H theory has no 
assumptions for this contingency. 

The first prediction is confirmed by 
the data in Fig. 2. The obtained curve, 
averaged for all 40 Ss on the six insol- 
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uble problems, clearly shows below 
chance responding and tends to fluctu- 
ate between the Reject-0 and Reject-3 
condition. 

The second prediction may be tested 
by comparing the function for the best 
and poorest information processors. 
The best and worst may be independ- 
ently specified on the basis of per- 
formance during the soluble problems. 
Those 10 Ss showing the most solu- 
tions and those 10 showing the fewest 
solutions during the soluble problems 
were separated out. Performance on 
the insoluble problems was then plotted 
separately for the two groups. As 
demonstrated in Fig. 3 the better Ss 
show greater performance suppression 
on Trials 2 and 3 than the poorer Ss. 
This difference, significant at the .01 
level for each of the two trials, verifies 
the second prediction. Note also that 
the better Ss are virtually perfect in- 
formation processors (cf. the Reject-all 
curve in Fig. 2). This means that 
after the first two trials they rejected 
all but one H, the one correlated with 
the correct position on Trials 1 and 2. 
This H, of necessity, leads to an error 
on the third trial and is also discon- 
firmed. These Ss, then, by Trial 3 
have eliminated all six Hs as possible 
solutions. If one assumes that S 
“recycles” and starts checking the six 
Hs again, it is apparent that he does 
so with reduced efficiency. From Trial 
4 on, his performance is like that of the 
poorer Ss. 


ExPERIMENT II 


The preceding experiment verified 
the two predictions of H theory but 
had certain imperfections. First, be- 
cause six-H problems are so simple 
for college students, a compensatory 
difficulty was added by presenting the 
stimuli at a peculiarly high (2 sec.) 
rate. Second, the assumption that one 
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PG). 


Trial 


Fic. 3. Performance by the 10 best and 
the 10 worst Ss (defined by the number 
of soluble problems solved) on the insoluble 
problems. 


could eliminate a dimension by instruc- 
tions alone was not entirely perfect ; 
four Ss, it will be recalled, solved an 
insoluble problem. Finally, the experi- 
ment was run with stimuli for which 
the prediction of performance-suppres- 
sion was already found to hold (Le- 
vine et al, 1967). Some demonstra- 
tion of the generality of the effect is 
clearly desirable. These three con- 
siderations suggested the use of prob- 
lems with more dimensions. These not 
only permit presentation of the stimuli 
at a more usual pace but change the 
stimuli and stimulus sequences with 
which the phenomenon was discovered. 
Finaly, by defining the correct re- 
sponse for each trial on the basis of a 
dimension which is subsequently re- 
moved, a genuine insoluble problem is 
created. 


Method 


Subjects—Forty-eight students in the 
beginning psychology course served as Ss. 

Materials.—Eight different eight-dimen- 
sional stimuli were constructed from the 
following dimensions: letter (x or z), letter 
size (large or small), letter texture (solid 
or dashed), border (circle or diamond), 
border number (one or two), border tex- 
ture (solid or dashed), star (above or be- 
low the border), and position (left or right). 
The eight stimuli were internally orthogonal 
in that each level of a dimension appeared 
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TRIAL 


Fic. 4. Theoretical (dashed lines) and ob- 
tained (solid line) curves of performance 
during insoluble problems of eight dimen- 
sions (seven visible, one hidden). 


with each other level exactly four times. 
Problems.—Twenty-seven sets of the 
eight stimuli, each set in a different order, 
were used for 27 experimental problems. 
Not only were the eight stimuli internally 
orthogonal but the sequence within a prob- 
lem was restricted so that no levels were 
correlated for more than three consecutive 
trials (e.g, circles would not be dashed, 
large letters would not be on the right 
side, etc., for more than three consecutive 
trials). After the creation of these se- 
quences the star was removed and a record 
kept of its location. An insoluble problem 
was one in which the location of the star 
(now a hidden dimension) was the solution. 
Problems 5, 9, 14, 18, 23,.and 27 were 
arbitrarily chosen to be insoluble problems. 
Each of the seven visible dimensions pro- 
vided solutions to 3 of the remaining 21 
problems. Four different counterbalanced 
solution sequences were constructed. 
Procedure.—Twelve Ss were run under 
each solution sequence in groups of two to 
four. Each group first received two four- 
dimensional then two  seven-dimensional 
practice problems. For both types of prob- 
lems a sample slide was shown and all values 
of all dimensions were pointed out. The 
Ss were told that they would receive some 
problems in which they would see a series 
of such slides, that for each slide they were 
to choose one of the pair of stimuli by mark- 
ing the left or right space on their answer 
sheets, and that they would then be told 
which response was correct. They were also 
told that one value of one dimension would 
be the solution and that it would hold 
throughout the problem. In addition to 
identifying the correct response (by say- 
ing left or right at the end of each trial), 
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E announced the solution at the end of 
each of the four practice problems. The 
27 experimental problems were then ad- 
ministered. An opaque slide signified the 
end of each problem, The E did not an- 
nounce the solutions, 


Results 


Despite the change in problem dif- 
ficulty and in the stimulus sequence, 
the results were essentially replicated, 
Figure 4 shows that the percentage 
correct is surpressed on the insoluble 
problems. Figure 5 shows that the 
best Ss (the third of the Ss who solved 
the most soluble problems) were worse 
on insoluble problems than the poorest 
Ss (the third of the Ss who solved the 
fewest soluble problems). The dif- 
ference between the group proportions 
was significant (p < .01) at Trials 3 
and 4. 

An interesting comparison concerns 
the minimum value for the best Ss. 
This is at Trial 3 for the three-dimen- 
sional problems and at Trial 4 for the 
seven-dimensional problems (cf. Fig. 3 
and 5). In a three-dimensional prob- 
lem the perfect processor eliminates 
three incorrect Hs on Trial 1, two on 
Trial 2, and the last one on Trial 3. 
Thus, after Trial 3 the set is empty. 
In a seven-dimensional problem the set 
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Trial 


Fic. 5. Performance of the best and worst 
third of Ss (defined by the number of soluble 
problems solved) on the insoluble problems. 
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becomes empty after Trial 4. The 
data, then, suggest that the set has 
become empty for many of the best Ss 
at the crucial trial. The speculation 
seems warranted that at this point these 
Ss start over with a large set but 
with about the efficiency of the worst 
Ss. 


DISCUSSION 


The present data confirm the two pre- 
dictions of the H theory previously out- 
lined. With internally orthogonal stimu- 
lus sequences presolution (or insoluble 
problem) performance is impaired. The 
better the S the more it is impaired. Since 
this impairment does not occur when ran- 
domized stimulus sequences are employed 
(Bower & Trabasso, 1964; Erickson, Zaj- 
kowski, & Ehmann, 1966) the effect would 
appear to depend upon the restrictions of 
the stimulus sequences, ie, the ortho- 
gonality. With the stimuli of the present 
experiments no dimension could be corre- 
lated with any other dimension for more 
than two (Exp. I) or three (Exp. I) 
trials. In particular, no dimension is 
correlated for very long with the correct 
(hidden) dimension and any H very 
quickly leads to an error. With random- 
ized stimuli irrelevant dimensions are 
correlated with the correct dimension for 
runs of varying length. Thus, one can 
hold the incorrect H and still make the 
correct response for several consecutive 
trials. The number of correct responses 
produced when dimensions are correlated 
for several trials would offset the decre- 
ment in performance when dimensions 
alternate frequently with the correct one. 
This suggests that the level of presolution 
responding found in these and the other 
experiments follows a general rule: the 
more correlated the dimensions with the 
correct (or hidden) dimension the higher 
will be the level of presolution (or in- 
soluble problem) responding. 
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APPENDIX 


The derivation of the presolution (or 
insoluble problem) learning curve depends 
upon the internally orthogonal stimuli, 
the assumptions stated in the introduction, 
and the particular variant of Assumption 
5. Because of the relatively simple transi- 
tion probabilities, the Reject-3 assumption 
and the four-dimensional stimuli will be 
used for the derivation here. The stimuli 
are represented in Fig Al, which shows 
the distributed translation (cf. Glanzer, 
Huttenlocher, & Clark, 1963; Shepard, 
Hovland, & Jenkins, 1961) of the stimuli 
in Fig. 1. The distributed form recodes 
the dimensions by location, the position 
(P) represented in the first column, fol- 
lowed by color (C), form (F), and 
size (S). 

Assume that the position dimension is 
“hidden” (cf. Exp. I) and that the left 
position is always correct. On the first 
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STIMULI 


Fic, Al. The distributed form of the stim- 
uli shown in Fig. 1. (The columns show 
the trial-to-trial changes in the position, P ; 
color, C; form, F; and size, S, dimen- 
sions; the subscripts denote the correspond- 
ing level of the dimension.) 


trial S may sample any of the six non- 
position Hs, three of which are correlated 
with the correct level of the position di- 
mension (ie. leads to a response which 
E calls "right"), and three are followed 
by E saying "wrong." Therefore, the 
probability of a correct response on Trial 
1, P(4), is .5. If S is correct on 
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Trial 1 he will be correct again on Trial 
2 only if (in the example given) he sam- 
pled F, on Trial 1; if on Trial 1 he 
sampled C, or S, he must, according to 
Assumption 1, choose incorrectly on Trial 
2. Because of the Reject-3 assumption 
similar reasoning applies when the re- 
sponse on Trial 1 is incorrect. The S, 
having started with either Co, Fo, or So, 
makes the wrong response, recognizes that 
all three of these Hs are incorrect and 
chooses, for Trial 2, among Cj, F;, and 
S,. Again, only F} will be correct on 
Trial 2. Regardless of the first response 
on Trial 1, therefore, P (+2) — 3. The 
transition probabilities from Trial 2 to 
Trial 3 are the same as those for any 
subsequent pair of adjacent trials. If the 
response is correct on Trial 2, it can only 
be because F, was selected (in general, 
because S must have held on Trial 2 the 
H correct on both Trials 1 and 2). Since 
stimulus levels cannot be correlated for 
more than two trials, P (+3|+.) — 0. 
If the response is wrong on Trial 2 then 
S chooses among C,, F, and S, for 
Trial 3. Again, only one of these (So) 
will be correct on Trial 3. Therefore, 
P (+3|—2) — 3. Similar reasoning shows 
that generally these transition probabili- 
ties hold, ie, P(+p|+n) =0 and 
P (+nyl—n) =}. Given that P (+2) 
= 4, the trial-by-trial probabilities may be 
generated. These lead to the Reject-3 
curve in Fig. 2. 


(Received July 24, 1967) 
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FLICKER THRESHOLD SHIFTS AS A FUNCTION OF 
FREQUENCY OF INTERPOSED. STIMULATION: 


THE LOCAL ADAPTATION PHENOMENON F 


ARTHUR VEGA, J. PAUL COSTILOE, ann OSCAR A. PARSONS 
University of Oklahoma Medical Center 


The impairment of critical flicker frequency (CFF) following stimu- 
lation by a flickering light has been termed “local adaptation,” the 
underlying mechanisms remaining as yet unclear. The present study 
investigates the relation of the adaptation effect to the absolute 
frequency of the adaptation stimulation. 10 normal Ss were presented 
10 frequencies of a flashing light, ranging 5-50 cps, Each presenta- 
tion was preceded and followed by a CFF determination. Degree of 
“local adaptation" or shift was determined by subtracting the post- 
stimulation threshold from the prestimulation measure. Results 
indicated a highly significant treatment effect, of “inverted U” shape, 


with a peak effect at 25 cps. 


The results appear to illustrate the 


retinal component of the local adaptation phenomenon. 


The critical flicker frequency (CFF) 
has been shown to be influenced by 
nearly any stimulus variable E is cap- 
able of manipulating. A phenomenon 
termed “local adaptation" was de- 
scribed by Granit and von Ammon in 
1930, but since then has received little 
systematic study until rather recently. 
This phenomenon refers to the obser- 
vation that OFF is significantly 
lowered (impaired) by exposing the 
eye to an obviously flickering light 
just prior to threshold determination 
(Alpern & Sugiyama, 1961; Ginsburg, 
1966; Simonson, 1959). The early 
techniques used to demonstrate the 
adaptation effect were relatively crude. 
Riddell (1935), eg, required his Ss 
to fixate on a light which was flicker- 
ing 2 or 3 cps below S’s CFF, and 
reported that the light began to appear 
fused or steady within 5 sec. He fur- 
ther illustrated that the phenomenon 
differed from light adaptation, since af- 


iThis study was supported by Public 
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ter fixation of a steady light his Ss were 
again able to detect flicker in the inter- 
mittent stimulus when this was sub- 
stituted for the steady light. Riddell 
was unable to demonstrate a parallel 
effect in recording nerve potentials 
from the eye of a frog, and suggested 
that the phenomenon might best be 
explained by cerebral rather than ret- 
inal mechanisms. 

Other investigators (Alpern & Sugi- 
yama, 1961; Ginsburg, 1966) have 
studied this threshold shift more 
quantitatively by means of pre- and 
poststimulation measures of CFF, and 
have illustrated decreases in CFF of 
2-4 cps. One investigator (Ginsburg, 
1966) has reported the magnitude of 
the adaptation effect to be a function 
of the frequency of the pulsating light 
used as the “adaptation” stimulus. He 
concluded that the adaptation effect in- 
creased as the adapting stimulus was 
lowered in frequency to 20 cps below 
the CFF. 

In most investigations of the local 
adaptation phenomenon, the frequency 
of the stimulation light, when specified, 
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has been determined relative to the 
CFF of the individual S. Alpern and 
Sugiyama (1961), e.g., conducted the 
major part of their study using a light 
flashing 6 cps below the CFF (as well 
as 6 cps above CFF) as the interposed 
or adapting stimulation. Ginsburg em- 
ployed frequencies of 10, 20, and 30 
cps below S’s CFF. 

The abundance of studies of intrinsic 
rhythms of the brain and neuro-retinal 
system has encouraged attempts to in- 
voke such concepts in the interpreta- 
tion of visual and perceptual phenom- 
ena (Bartley, 1958; Bartley & Nelson, 
1963; Callaway, 1960; Lindsley, 
1952). This study represents another 
attempt in this general area, namely to 
obtain data regarding the adaptation 
phenomenon as a function of the ab- 
solute frequency of the adapting 
stimulus. It was an implicit hypothesis 
that a systematic relationship might 
exist between frequency of adapting 
stimulation, magnitude of adaptation 
effect, and some value representing 
cyclic neural activity of the retina and 
brain. 


METHOD 


The equipment consisted of an electronic 
control unit which drove a Sylvania R1131C 
glow modulator tube illuminating a target 
patch subtending 1.96? visual angle at 33 cm. 
from where S viewed the target. The in- 
tensity of the stimulus light was 3.3 ftc., 
and the general level of illumination of the 
room was subdued, being 2.5 ftc. at the point 
where S viewed the light. The control unit 
was accurate to .1 cps throughout its fre- 
quency range of 1-99.9 cps. i 

The Ss were 10 healthy individuals, with 
mean age of 30 yr. (SD=5.1 yr.). All 
but two of the Ss were naive as to the pur- 
pose of the study, and the other two (the 
first two authors) were unaware of the 
results until after they were run as Ss. 

Each S was presented 10 frequencies of 
the adaptation stimulus, each one preceded 
and followed by a CFF threshold deter- 
mination, all involving central binocular 
vision. The degree of “adaptation” or 
shift was determined by subtracting the post- 
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stimulation from the prestimulation measure, 
The 10 frequencies used as the adaptation 
stimulation ranged 5-50 cps in increment 
of 5 cps. Each S was instructed as to the 
three-part nature of each trial, i.e., that he 
would first see a steady light and should 
press his response button as soon as he 
noticed it began to flicker. He was told 
that he then would be presented with a 
light which would usually be flickering (the 
adaptation stimulation) for a period of 15 
sec., following which there would be another 
trial on which he should press the button as 
soon as he noticed the light begin to flicker. Í 
A l-min. rest period was allowed between 
each of the 10 trials, which were admin- 
istered in a Latin-square design to control 
Íor sequence and particularly order effects, 
CFF thresholds were determined by the de- 
scending method of limits (from fusion to 
flicker), with 53 cps taken as the starting 
point and with the control unit automatically 
decelerating at the rate of 1 cps/sec. 


RESULTS 


Analysis of variance revealed a 
highly significant treatment effect (F 
= 849, p < 001) indicating that the 
magnitude of the adaptation effect (low- 
ering or impairment of CFF) is related 
to the frequency of the adaptation stim- 
ulation. Figure 1 illustrates the curvi- 
linear shape of the function peaking at 
25 cps and decreasing rather sym- 
metrically in either direction. Fur- 
ther tests revealed that the curvilinear 
component was highly significant. 


Discussion 


The results of the present study are 
consistent with those of other investi- . 
gators, but in addition illustrate a char- 
acteristic of the “local adaptation” phe- 
nomenon not previously described, The 
authors’ findings agree with those of 
Ginsburg (1966) who found that the 
flicker adaptation effect increases as the 
stimulation frequency is lowered below 
the CFF. This picture however, appears 
to be incomplete since frequencies below 
25 cps are found to be decreasingly ef- 
fective in producing the flicker adaptation 
effect. 

Decrements in CFF greater than 4 cps 
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Fic. 1. Mean adaptation effect as a function of stimulus frequency. 


following exposure to the 25 cps stimulus, 
in contrast to the negligible shifts follow- 
ing 45 cps or 50 cps stimulation, lend 
further support to the contentions of Al- 
pern and Sugiyama (1961) and Mahneke 
(1956) that the descending method of 
limits of CFF determination is to be 
preferred over the ascending method (in- 
creasing frequency from flicker to fusion). 
Use of the ascending method exposes S 
to a relatively long period of intermittent 
stimulation, thereby confounding effects 
of acceleration rate and adaptation effect 
with CFF threshold. 

No explanation is immediately apparent 
for the unexpected curvilinear nature of 
the adaptation function and the maximal 
effectiveness of the 25 cps stimulation. A 
related unexpected finding was that 10 
cps stimulation was one of the least ef- 
fective of the series in producing the 
adaptation effect. Following reasoning 
advanced by Bartley (1958), Pieron 
(1965), and others, it might be argued 
that the 10 cps stimulation, most effective 
in producing photic driving of the EEG 


and the brightness enhancement phenom- 
enon, would be maximally effective in 
producing the adaptation effect, if indeed 
this frequency is most likely to stimulate 
neuro-retinal elements at the optimal rate. 
Such was not the case. 

Although Alpern and Sugiyama sug- 
gest that the CFF can be "driven" up- 
ward as well as downward, by use of 
adaptation stimulation above the CFF, 
the results of the present study do not 
indicate a significant upward threshold 
shift following the 50 cps stimulation. 
The conditions of the present experiment, 
however, were probably not conducive to 
the production of this upward shift, since 
the shift reported by Alpern and Sugi- 
yama (1961) was noticeable only after 1 
to 3 min. of stimulation whereas in the 
present study the stimulation was only of 
15-sec. duration. 

The results of this study are consistent 
with an explanation offered by Ginsburg 
(1966), despite the fact that he did not 
discuss the curvilinear nature of the adap- 
tation function. Ginsburg cited studies 
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by Grüsser and Creutzfeldt of single on- 
off retinal and cortical fibers of the cat 
which indicated that, within a certain fre- 
quency range, there is an inverse relation- 
ship between frequency of photic stimula- 
tion and frequency of nerve impulses. At 
low rates of photic stimulation, each pulse 
of light elicits multiple neural impulses; 
as flash frequency increases, the bursts 
become shorter until each flash evokes a 
single impulse. At still higher rates of 
stimulation, these fibers may fire sporad- 
ically or not at all (Bartley & Nelson, 
1963; Ginsburg, 1966). Ginsburg there- 
fore reasoned that since the lower fre- 
quencies of adaptation stimuli most likely 
elicit more rapid rates of firing of visual 
fibers, cortical cells are more rapidly stim- 
ulated at the lower frequencies and hence 
would show greater degrees of adaptation 
following the lower rates of stimuli. Fol- 
lowing similar reasoning to account for 
the decreasing effectiveness of stimulation 
below 25 cps in producing the adaptation 
effect, it is only necessary to refer to 
further studies by Grüsser of rate of 
"on" neuron discharge as a function of 
rate of photic stimulation (Reidemeister 
& Grüsser, 1959). In a study of the effect 
of light-dark ratio on discharge frequency 
of "on" neurons in the cat retina, their 
data for the 1:1 light-dark ratio indicate 
that the peak discharge rate is elicited by 
photic stimulation of about 25 cps, with 
decreasing discharge rates produced by 
stimuli greater than 25 cps and less than 
25 cps. Although it is tempting to note 
similarities between the maximal retinal 
element discharge at 25 cps found by 
Reidemeister and Grüsser and the authors" 
data indicating maximal adaptation effect 
at this frequency, such generalizations 
must remain at a highly speculative level. 
The curvilinear relation between stimula- 
tion frequency and response frequency in 
both the present study and that of Reide- 
meister and Grüsser appears more than 
coincidental, however, and encourages 
further study of the phenomenon. Data 
of the latter authors indicate that the 
maximal neural discharge rate for photic 
stimulation with light-dark ratios of less 
than 1:1 occurs at frequencies of photic 
stimulation lower than 25 cps. Future 
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studies of the adaptation phenomenon as 
a function of light-dark ratio may clarify 
the relation of this effect to retinal dis- 
charge rate. : 


REFERENCES 


ALPERN, M., & Suciyama, S. Photic driv- 
ing of the critical flicker frequency. Jour- 
nal of the Optical Society of America, 
1961, 51, 1379-1385. 

Bartley, S. H. Some facts and concepts 
regarding the neurophysiology of the optic 
pathway. American Medical Association 
Pe iro of Ophthalmology, 1958, 60, 775- 

91. 

BamrLEv, S. H., & Netson, T. M. Some 
relations between sensory end results and 
neural activity in the optic pathway. 
Journal of Psychology, 1963, 55, 121-143, 

Carraway, E. III, & ALEXANDER, J. D., Jn. 
The temporal coding of sensory data: An 
investigation of two theories. Journal 
of General Psychology, 1960, 62, 293-309, 

GiNsBURG, N. Local adaptation to inter- 
mittent light as a function of frequency 
and eccentricity. American Journal of 
Psychology, 1966, 79, 296-300. 

Granit, R. A, & von Ammon, W. Com- 
parative studies on the peripheral and 
central retina: III Some aspects of local 
adaptation. American Journal of Physiol- 
ogy, 1930, 95, 229-241. 

LiwpstLEv, D. B. Psychological phenomena 
and the electro-encephalogram. Electro- 
encephalography and Clinical Neurophysi- 
ology, 1952, 4, 443-456. 

MAHNEKE, A. Flicker fusion thresholds. 
Acta Ophthalmologica, 1956, 34, 113-120. 

Pieron, H. Vision in intermittent light. 
In W. D. Neff (Ed.), Contributions to 
sensory physiology. Vol. 1. New York: 
Academic Press, 1965. 

Rewemetster, C, & GnÜssEm, O. J. Flim- 
merlichtuntersuchungen an der Katzen- 
retina, I.  On-Neurone und on-off 
Neurone. Zeitschrift für Biologie, 1959, 
3, 241-253. Cited by J. L. Brown. Flicker 
and intermittent stimulation. In C. H. 
Graham (Ed.), Vision and visual percep- 
tion. New York: Wiley, 1965. 

Ruppert, L. A. Local adaptation to flicker 
and its relation to light adaptation. Jour- 
nal of Physiology, 1935, 2, 111-120. 

SrwoNsoN, E. The fusion frequency of 
flicker as a criterion of central nervous 
system fatigue. American Journal of 
Ophthalmology, 1959, 47, 556-565. 


(Received July 22, 1967) 


Journal of Experimental Psychology 
1968, Vol. 77, No. 4, 613-619 


MEMORY PROCESSES AND THE SERIAL POSITION CURVE: 
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| 4 experiments are reported testing the hypothesis of 2 storage 
processes in recall. In 3 experiments, Ss were presented a series of 9 
numbers in a probe-type task at differing rates. As expected, rate 
(item duration or interitem interval) affected the primacy and middle 
portions of the curve and not recency. Lower rate presumably 
facilitated learning and consequently long-term memory (LTM). 
Another experiment (using 12 letters) varied rate and also inter- 
polated a 0-sec., 10-sec., or 10-sec. filled delay between presentation 
and the probe. Delays attenuated recency (a short-term memory, 
STM, effect), filled more than unfilled. Unfilled delay elevated 


critical theoretical problem in this area. 
The present studies provide informa- 
tion on this issue. 

Recently, Glanzer and Cunitz (1966) 
have reported evidence for two mem- 
ory processes in a free recall task. In 
one study, rate of presentation (3, 6, 
or 8 sec.) and repetition (1, 2, or 3 
times) was expected to affect long- 
term memory (LTM) as reflected in 
performance on the beginning and mid- 
dle items of a list of 20 common nouns 
presented with a tape recorder. Rate 
did have the expected effect ; repetition 
did not. In a second study a delay 
interval between presentation and re- 
call was expected to affect recency. A 
15-word list was shown with a slide 
projector, one word at a time, and re- 
called 0, 10; or 30 sec. later. Again, 
the prediction was upheld. Only the 

1 This research was supported by a grant 
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primacy and middle portions when rate was fast, but in no other 
instance. These results were viewed as compatible with the 2 storage 
hypothesis. Strong evidence for the role of rehearsal in this task was 
found. 
1 
Among the more important issues last few positions were affected by the 
facing a theory of memory is whether delay. These results were interpreted 
there are two types of memory proc- as support for a two process explana- 
esses or only one. Melton (1963) tion of the serial position effect; per- 
suggested that this may be the most formance on the primacy and middle 


portions depending upon LTM, and 
the recency effect upon short-term 
memory (STM). Glanzer and Cunitz 
also adduce evidence for the two 
process hypothesis from a number of 
experiments by other investigators. 
In the main, these are findings which 
show that some variables affect a pre- 
sumed index of one storage mechanism 
and not the other. 

The present studies investigated fur- 
ther the hypothesis of two storage 
processes and tested its generality in 
a structured recall situation. A second 
objective was to develop a "probe" 
technique for the study of memory 
which would provide for rigorous con- 
trol and manipulation of pertinent vari- 
ables, and also minimize the interfer- 
ing effects of the recall process itself. 
The task used was conceptually re- 
lated to a "card guessing game" de- 
vised by Atkinson, Hansen, and Bern- 
bach (1964) for the study of STM. 
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In their study, eight cards containing 
pictures of common objects were 
shown to Ss one at a time and then 
placed face down. A sample card 
like one of the eight was placed face 
up and S's task was to choose the 
match. Correct first choice perform- 
ance usuall yields a  bow-shaped 
serial position curve (e.g., see Calfee, 
Hetherington, & Waltzer, 1966; Ellis & 
Munger, 1966). 

The present paradigm consisted of 
a series of random numbers presented 
automatically one at a time in miniature 
projectors arranged in a row. A 
sample or "probe" number was pre- 
sented in another projector. The $'s 
task was to press the "position" where 
the sample number appeared within 
the series. 


EXPERIMENT I 


In this initial study, rate of pre- 
sentation was varied and it was pre- 
dicted that performance would be fa- 
cilitated on the primacy and middle 
portions of the curve with lower rates. 
On the other hand, the recency seg- 
ment of the curve was expected to be 
invariant with change in rate. 


Method 


Subjects.—The Ss were 24 undergraduates 
who participated to fulfill a psychology 
course requirement. 

Apparatus.—The S console was a cabinet 
with a sloping front, 12.5 in. high X 30 in. 
wide. Nine miniature projectors (Indus- 
trial Electronics; one-plane digital readouts), 
were mounted in a row 4 in. from the bottom 
and 3 in. apart. The sample projector was 
mounted in the horizontal center of the 
panel 2 in. from the top and 6.5 in. above 
the row of nine. (All measurements are 
from center to center on projectors.) Lucite 
response buttons (Grason-Stadler) were 
mounted over the nine projectors which al- 
lowed S to see the numbers through the 
push buttons. The numbers were about 1 in. 
high and illuminated on a dark background. 
A door chime within the S console signalled 
a correct response. The S console sat on a 
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table. The E sat at another table behind 
S and the electromechanical programming — 
apparatus was mounted in a relay rack on 
E's table. By presetting a rotary switch 
any one of nine randomized orders of the 
nine numbers (one through nine was pre- 
sented automatically). No number appeared 
twice in any series. The sound of a stepping 
switch and a recycling timer in syncrony 
with the onset of each number was clearly 
audible to S. 

Procedure.—The Ss were told that this 
was a study of memory, and that they would — 
see a sequence of numbers, one in each of 
the "windows" in the row which would be 
followed by a probe number in the top 
window. They were further instructed to _ 
try to remember where each number had 
appeared and to press the position where 
the number like the sample had been. The 
chime sounded if S’s choice was correct. 
The Ss continued to respond until the correct 
response was made. All responses were 
recorded by E. 

Each S received 36 trials, four repetitions 
of the nine different orders. Each of these 
nine-trial blocks was presented at a differ- 
ent rate .1-, .5-, 1-, and 2-sec. item duration, 
with a .1-, .5-, 1-, and 2-sec. interitem inter- 
val, respectively, ie, item duration and 
interitem interval were confounded. The 
probe also appeared .1, .5, 1, and 2 sec., re- 
spectively, following the last item and re- 
mained on until S responded correctly. Each 
of the 24 Ss was assigned to one of the 24 
permutations of the four rates. 


Results 


Figure 1 shows the percentage of 
correct first choice responses for the 
four conditions across the nine posi- 
tions. Position 1 is the first item pre- 
sented and Position 9 the last. An 
analysis of variance showed that rate, 
serial position, and their interaction 
were statistically significant, Rate, F 
(3, 72) = 22.06, p < .001 ; Serial Posi- 
tion, F (8, 72) = 10.50, p< .001; 
Rate X Serial Position, F (24, 72) = 
2.35, p < .005. Further analyses re- 
vealed that the interaction derived from 
the absence of rate effect on Positions 
7, 8, and 9, F (3, 18) = 2.86. Rate 
did not influence performance on the 
recency segment of the curve. 
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Serial Position 


Fic. 1. Mean percentage of correct first 
choice responses as a function of presentation 
rate and serial position. 


EXPERIMENT II 


Experiment II explored another 
aspect of rate. Item duration was 
varied, with no interitem interval. 


Predictions similar to those of Exp. I 
were again made. 


Method 


Subjects—The 48 Ss were recruited from 
the same source as those in Exp. I. None 
had participated in the previous study. 

Procedure—The procedure was identical 
to that used in Exp. I, except item duration 
was .1, .5, 1, and 2 sec. and the offset of one 
item was contiguous with the onset of the 
next. Also, there was no delay (actually 05 
sec.) between Item 9 and the onset of the 
probe. Two Ss were assigned to each of 
the 24 permutations of rates. Each S re- 
ceived 36 trials. 


Results 


Figure 2 presents the results. Gen- 
erally, these findings are quite similar 
to those in the previous study. An 
analysis of variance yielded significant 
effects for both Rate, F (3, 72) = 18.41, 
p < 001, and Serial Position, F (8, 
72) 210.99, p< 001. Unlike the 
finding in Exp. I the interaction was 
not significant, F (24, 72) — 1.08. 
However, a similar trend is quite ap- 
parent. Further analyses showed the 
rate main effect to be significant for 
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Positions 1 through 6, F (3, 45) — 
19.30, p < .001, and for Positions 7, 8, 
and 9, F (3, 18) 2 4.24, p < .025. 
The latter is due to performance under 
the .1-sec. condition; the .5-, 1- and 2- 
sec. conditions do not differ on these 
positions. Apparently the .1-sec. rate 
without an interitem interval is not 
adequate from a perceptual standpoint ; 
subjectively this seemed to be the case. 
Comparisons between performance 
under each condition in this study and 
the analogous condition in Exp. I re- 
vealed that the interitem interval fa- 
cilitated performance in each instance. 


EXPERIMENT III 


In Exp. III interitem interval was 
varied and the item duration was con- 
stant. Longer interitem intervals were 
expected to be functionally equivalent 
to longer item duration. 


Method 


Subjects—The 60 Ss were recruited in a 
similar fashion. None had participated in 
either of the two previous studies. 

Procedure.—The Ss were assigned without 
bias to four groups of 15 each. Each group 
received one of four conditions, an interitem 
interval of 0, .5, 1, or 2 sec. The item 
duration was 1 sec. An S received 54 trials 
under the one condition. The onset of the 
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Fic. 2. Mean percentage of correct first 
choice responses as a function of presentation 
rate and serial position. 
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probe was contiguous with the offset of 
the ninth item. 


Results 


Figure 3 presents the results. Serial 
position and the interaction of serial 
position and rate were statistically sig- 
nificant, F (8, 449) = 1171, p < .001; 
F (24, 449) = 1.96, p < .005, respec- 
tively. Rate differences failed to reach 
statistical significance, F (3, 56) = 
2.03. However, these data show clear 
trends similar to those seen in the pre- 
vious studies. In this instance pre- 
sentation rate was a between-Sss effect. 
(In the two previous studies it was a 
within-Ss effect.) 


EXPERIMENT IV 


This experiment was designed to 
show the effects of rate of presentation 
and a delay interval interpolated be- 
tween presentation and recall, as well 
as their interaction. The task dif- 
fered from the previous ones in that 
12 letters were used instead of the nine 
numbers. The delay was expected to 
affect only the recency segment of the 
serial position curve. 


Method 


Subjects and apparatus—The 120 Ss were 
recruited in a manner similar to that used 


Serial Position 


Fic. 3. Mean percentage of correct first 
choice responses as a function of presentation 
rate and serial position. 
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Percent Correct First Choice Responses 


Seriol Position 


Fic. 4. Mean percentage of correct first 
choice responses for the three .75-sec, presen- _ 
tation rate groups. 1 


in the previous studies. A new apparatus E 
presented 24 different orders of 12 letters 
(A, C, E, H, K, L, N, P, S, U, XZ) 
The S console was larger so that similar 
spacings between projectors could be res 
tained. In other respects the apparatus was 
functionally similar, & 

Procedure.—The Ss were randomly as- 
signed to six groups of 20 each. For thr 
of these item duration was .75 sec. and { 
the other three, 1.50 sec. There was | 
interitem interval. Two groups, one ro 
each rate, received the probe letter im- 
mediately following the last stimulus letter, 
For two others, a 10-sec. delay was inter- 
polated between the offset of the last stimu- 
lus letter and the onset of the probe. Th 
two remaining groups received the 10-sei 
delay and were instructed to count back- 
wards by threes from a three digit number 
until the probe appeared. The E presented 
the number immediately following the off. 
set of the last stimulus letter. 


Results 


Figures 4 and 5 show the main re- — 
sults, Fig. 4 the .75-sec. rate groups — 
and Fig. 5 those for the 1.50-sec. 
groups. An analysis of variance of — 
these data yielded significant main 
effects for Delay, F (2, 114) = 14.83, — 
p < .001, Rate, F (1, 114) = 1749, 
P < 001, and Serial Position, F (11, 
1254) = 37.27, p < .001. The Rate X 
Delay, F (2, 114) = 2.66, interaction 
was not significant (p <.10). Delay - 
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X Serial Position, F (22, 1254) — 6.66, 
p «001, and Rate X Serial Position, 
F (11, 1254) = 1.91, p < .05, are sig- 
nificant and sois Rate X Delay X Serial 
Position, F (22, 1254) — 1.81, p < .01. 
The last is due to the elevated primacy 
performance in the .75-sec. group re- 
ceiving the 10-sec. delay. 

The effect of rate is quite similar to 
that seen in the previous studies. With 
the lower rate performance is elevated, 
with the exception of the terminal 
positions. This generalization is quali- 
fied by the different delay conditions. 
Performance on Positions 1 through 
5 of the .75-sec. rate group receiving 
the 10-sec. unfilled delay coincides with 
that of the 1.50-sec. groups. Perform- 
ance of the other two .75-sec. groups 
falls substantially below that of the 
1.50-sec, groups in this part of the 
curve. A separate analysis of variance 
of the data of Positions 1 through 5 for 
the three .75-sec. rate groups yielded 
significant F values for delay, F (2, 
57) = 12.63, p < .001, and serial posi- 
tion, F (4, 228) = 10.07, p < .001. 
The interaction was also significant, F 
(8, 228) = 2.35, p < .025. The effects 
of delay are orderly as expected. An 
analysis of variance of the six groups’ 
performance on Positions 11 and 12 
showed that delay only had an effect, 
F (2, 114) = 4809, p < .001. Rate, 
F (1, 114) = 1.15, and the Rate X De- 
lay interaction, F (2, 114) = 178, 
were not significant. Serial position 
was collapsed in this analysis. 


DISCUSSION 


In the main, these results are similar 
to those of Glanzer and Cunitz (1966) 
and could be viewed as support for the 
“two storage mechanism” hypothesis of 
memory. Moreover, the generality of 
their findings are extended to a probe 
type, structured recall task involving both 
numbers and letters as stimulus materials. 
The orderliness of the present data as well 
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Percent Correct First Choice Responso 


Fic. 5. Mean percentage of correct first 
choice responses for the three 1.50-sec. 
presentation rate groups, 


as their concurrence with expectation at- 
test to the feasibility of the methodology. 

Rate of presentation defined as item 
duration or interitem interval influences 
the primacy and middle segments of the 
serial position curve, ie. parts of the 
curve viewed as reflecting LTM processes. 
Performance. in the last two or three 
positions (recency) is not affected by 
changes in rate. A delay between pre- 
sentation and recall clearly has an effect 
upon recency. The delay with the inter- 
polated task, counting backwards, has 
even more profound effects. Delay ef- 
fects are not independent of presentation 
rate. Figure 5 shows that a “filled” or 
“unfilled” delay with the lower presenta- 
tion rate affects only the recency seg- 
ment of the curve. 

Low presentation rate seems to pro- 
vide opportunity for rehearsal during 
presentation hence better LTM as re- 
flected by the primacy and middle por- 
tions of the curves. The delay has no 
effect upon primacy with low rate. On 
the other hand a 10-sec. unfilled delay 
enhances the primacy effect when pre- 
sentation rate is high perhaps by provid- 
ing opportunity to rehearse (see Fig. 4). 
Indeed, the delay is functionally equiv- 
alent to the lower rate in so far as the 
primacy effect is concerned. Preventing 
rehearsal by filling the interval is equiv- 
alent to no delay. 
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In terms of a "limited channel" con- 
ception of STM, such as those proposed 
by Broadbent (1958) and Waugh and 
Norman (1965), the 10-sec. unfilled delay 
provides the occasion for items to be lost 
from short-term storage, as a result of 
either (or both) spontaneous decay or 
interference. Since an even greater loss 
occurs during the filled delay, some re- 
hearsal of old as well as new items may 
occur during the unfilled delay. For the 
filled delay conditions rehearsal may be 
minimal, if it occurs at all, and items in 
STM are replaced by more recent ones, 
in this instance three-digit numbers, 

A number of studies (e.g., Glanzer & 
Cunitz, 1966; Mackworth, 1962; Mayzner 
& Schoenberg, 1965; Pollack, Johnson, 
& Knapp, 1959) report findings re- 
garding rate of presentation. Signifi- 
cantly these studies have not attempted to 
control rehearsal, Waugh and Norman 
(1965) attempted to control rehearsal in 
a study employing a probe-digit paradigm. 
They state: “The subjects received ex- 
plicit instructions to control rehearsal by 
‘thinking only of the last digit you have 
heard and never of any of the earlier 
ones." A series of 15 numbers were 
presented with a tape recorder; a probe 
digit identical to one in the series fol- 
lowed immediately. The S was to report 
the number which had followed the probe 
in the series. Digits were presented at 
a rate of either 1 or 4/sec. Rate of 
presentation did not affect forgetting. 
Waugh and Norman concluded that inter- 
ference could account for their finding. 
This conclusion seems appropriate only if 
they succeeded in controlling rehearsal. 
In view of the pronounced recency effect 
and near total loss of memory for digits 
at the beginning of the series, it is likely 
that rehearsal was appreciably attenuated. 
Absence of rehearsal could account for 
their failure to find a primacy effect, a 
finding which contrasts sharply with the 
authors’, 

Norman (1966) using the task proce- 
dure devised by Waugh and Norman 
(1965) found substantial recency effects 
due to rate of presentation (1, 4, 7, and 
10 items/sec). Performance was sig- 
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nificantly facilitated by slower rate. How- 
ever, these differences may reflect per- 
ceptual difficulties rather than differences 
in memory. No primacy differences are 
apparent in his data though performance 
on the initial four items is not determined, 
On the surface Norman's results do not 
agree with the authors, However, a 
number of methodological differences pre- 
cludes meaningful comparisons, 

On the basis of these data as well as 
those of others, a hypothesis of two mem- 
ory processes seems tenable. A “pri- 
mary memory,” to use the Waugh and 
Norman terminology, may provide a 
transient store for only a few items, in 
the authors’ case the last two or three 
items. A “secondary memory” is un- 
limited and is more stable. The transfer 
of items to secondary memory seems to 
depend upon rehearsal or other strategies. 
Primary memory would appear to depend 
mainly upon perceptual clarity. This con- 
ception of memory does not deny the role 
of interference in tasks such as the one 
presented here. It may provide a fresh 
point of departure for studies of human 
memory. 
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INFERENCE ABOUT A NONSTATIONARY PROCESS? 


JAMES O. CHINNIS, Jr. ano CAMERON R. PETERSON 
University of Michigan 


This paper. presents optimal models for inferential performance in a 
task characterized by both (a) a probabilistic relation between data 
and the process generating the data and (b) a possibility that a 
change can occur in that process. An experiment introduced the 
possibility of change halfway through each sequence of data. The Ss 
were conservative in making inferences prior to the possibility of 
change; they were equally as conservative after the possibility of 
change. The Ss reacted appropriately to information about moderate 
and large probabilities of change; they underacted to information 
about small probabilities. They were only partially sensitive to differ- 
ential diagnostic value of data when revising estimates about whether 


or not a change had occurred. 


Much of man’s world is uncertain. 
Observation of events may provide in- 
formation about the underlying pro- 
cess that generated the events, but the 
information is often fallible. The 
theory of statistical inference is a set 
of formal procedures for using fallible 
data in order to make inferences about 
underlying processes, and a body of 
psychological research has profitably 
used that theory as a framework within 
which to investigate the way people in- 
tuitively make such inferences (Peter- 
son & Beach, 1967). 

Processes about which inferences are 
made may change with time. A pro- 
cess that remains the same through 
time is called stationary and one that 
changes is called nonstationary. Non- 
stationarity may increase the difficulty 
of statistical inference. Temporal 
fluctuations in a sequence of data gen- 
erated by a stationary process may be 
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assumed to be random fluctuations. In 
a nonstationary situation, however, the 
temporal fluctuations can be either ran- 
dom or the result of a change in the 
process generating the data. Previous 
research has shown that people are 
sensitive to change ; a change in a data- 
generating process is tracked by a cor- 
responding change in inferences made 
by Ss (Peterson & Beach, 1967, pp. 
4142). The purpose of the present 
experiment was to relate inferences 
about nonstationary processes to cor- 
responding optimal inferences. 

The theory of statistical inference has 
been developed for the purpose of deal- 
ing with fallible data generated by a 
stationary process, but it can also be 
applied to a nonstationary situation. 
One way of making such an application 
consists of “looking at a changing world 
so that it seems to be unchanging [Ed- 
wards, Lindman, & Phillips, 1965, p. 
310].” For example, even though it is 
unknown whether a change in the data- 
generating process will occur, the 
probability of the change may be 
known. If that probability of change 
does not vary, the change itself may 
be considered as generated by a prob- 
abilistic process that is stationary. It 
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seems reasonable to assume that many 
of the dynamic aspects of a person's 
environment may be characterized as 
change that is in turn controlled by a 
higher-order process that is stationary. 
Accordingly, this hierarchical structure 
was designed into the stimulus situation 
and made available to Ss in the present 
experiment, 


MopeEts 


The models presented below pre- 
scribe optimal performance for the spe- 
cific task that was used in the experi- 
ment. In that task, one of two different 
binomial populations, either 4 or B, 
was chosen at random to generate each 
sequence of data. A possibility of 
changing the population was introduced 
halfway through the sequence. With a 
specified probability of change, P(C), 
the other population was selected to 
complete the sequence; with probability 
1 — P(C) the original population com- 
pleted the sequence. After observing 
each datum, S revised his odds estimate 
about which population was generating 
the sequence; during the last half of 
the sequence he also revised his esti- 
mates about whether or not a change 
had occurred. 

Model of a stationary process.— 
Bayes's theorem prescribes the optimal 
revision in odds about which stationary 
process has generated a datum. Given 
two populations, A and B, and the oc- 
currence of a datum, D, the odds-likeli- 
hood ratio form of Bayes's theorem 
prescribes 


P(A|D) P(D|A) P(A) t 
P(B|D) P(D|B) P(B) 
or more simply, 
0,5 La, [2] 


where Q, is the odds in favor of Popu- 
lation A over Population B prior to 
the observation; Q, is the correspond- 
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ing odds posterior to observing D ; and 
L is the likelihood ratio, the ratio of the 
probability of sampling D from Popu- 
lation A to the probability of sampling 
D from Population B. Inferences are 
additive when Equation 2 is expressed 
in the following logarithmic form 


[3] 


Several experiments have shown that 
people make conservative inferences. 
The log L inferred by subtracting the 
log of estimated prior odds from the 
log of estimated posterior odds is of 
smaller absolute value than the cor- 
responding Bayesian log L. Further- 
more, the amount of conservatism in- 
creases as data become more diagnostic, 
i.e., as the Bayesian log L departs from 
zero (Peterson & Miller, 1965). 
Hierarchical model of a nonstation- 
ary process.—Bayes's theorem applies 
to a stationary process but can be ex- 
tended to the present nonstationary sit- . 
uation as follows. At any time prior 
to the possibility of change, there is 
only one way in which Population 4 
could be generating the data, and that 
is to have been selected at the begin- 
ning of the sequence. After the pos- 
sibility of change, however, there are 
two ways for A to be generating the 
data: (a) to have been selected origin- 
ally or (b) for B to have been selected 
originally and then changed to 4. 
Bayes's theorem can be used to revise 
the probability of sampling from 4 un- 
der each of these assumptions. It can 
also be used to calculate the probability 
that each assumption is correct, i.e. 
whether or not a change occurred : 


P(D|C) P(C) 
P(D) [4] 


where D refers to the datum upon 
which the revision is based and 


P(D|C) = P(D|B,) P(Ay) + 
P(D|A,) P(By) 


log 9, = log L + log Q,. 


P(C|D)- 


[5] 
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where A, means that Population A was 
generating the data prior to the pos- 
sibility of change and A, means that 
Population 4 is generating the data 
after the possibility of change. 

This set of probabilities permits a 
calculation, at any time after the pos- 
sibility of change, of the probability 
that Population A is generating the 
data. The P(A,|D) is equal to the 
following expected value: 


P(A,|D) = P(A,|D,NC) P(NC|D) 
+ P(B,D,C) P(C|D) [6] 


where P(A,|D,NC) is revised by 
Bayes's theorem under the assumption 
of no change. 

The hierarchical model captures the 
flavor of making inferences about a 
changing process by looking at the pro- 
cess from a point of view from which 
it appears stationary. The model is 
cumbersome, however, because it re- 
quires that each datum be processed 
three separate times: once with respect 
to which population is generating the 
data under the assumption of no 
change, again under the assumption of 
change, and yet again with respect to 
whether or not the change really oc- 
curred. This complexity would, of 
course, increase greatly if it were pos- 
sible for change to occur at more than 
one time in a sequence. 

Amunesic model of a nonstationary 
process.—A model formally equivalent 
to the hierarchical model, but one far 
simpler in conception, is the amnesic 
model. The essence of the amnesic 
model is that information, whether it 
be information about a sampled datum 
or about a probability of change, is 
incorporated into the current opinion 
about which population is generating 
the data ; the source of that information 
is thereafter forgotten or ignored. 

The model applies to the present ex- 
perimental situation as follows. Prior 
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to the chance of change, data are pro- 
cessed by Bayes's theorem (Equation 
1). The introduction of a chance of 
change revises the current probability 
that Population A is generating the 
data. This revision is given by the 
following expected value: 


P(A,) = P(Ay) P(NC) + 
P(B.) P(C) [7] 


where P(A,) is the probability, im- 
mediately prior to the chance of change, 
that 4 was generating the data and 
P(A,) is the probability of A immedi- 
ately after learning the probability of 
a change. 

After this revision, as data are ob- 
served, the appropriate revision rela- 
tion is again expressed by Bayes's 
theorem, 


P(D|A1) P(A:) 
PD) [8] 


where the prior, P(A,), is the resultant 
of taking the expected value in Equa- 
tion 7. The amnesic model avoids con- 
cern with change after incorporating 
information about the chance of change. 


P(A,|D) = 


METHOD 


Experimental — design.—The experiment 
compared inferential performance with per- 
formance prescribed by the models described 
above. This required Ss to revise, after 
each datum, their estimates of odds about 
which population was being sampled. In 
addition, a test of the hierarchical model re- 
quired a revision of estimates of odds about 
change vs. no change. { 

The experimental design used three prob- 
abilities of change, .2, .5, and .8. It 
also used two levels of data diagnosticity, 
manipulated by varying symmetric-likeli- 
hood ratios; they were 2:1 and 1:2 for the 
pair of hypotheses used in Cond. L2, 4:1 and 
1:4 for the pair of hypotheses used in Cond. 
L4. Each combination of probability of 
change and likelihood ratio contained 3 
data sequences for a total of 18. All Ss 
saw the same 18 sequences. For half of the 
Ss, the 9 sequences in Cond. L4 preceded the 
9 in L2; the order was reversed for the 
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other Ss. Each sequence consisted of 11 
items: five binomial data samples from one of 
two possible populations, a statement of a 
probability that a change occurred, and then 
another sequence of five binomial data. The 
sequences faithfully represented information 
displayed to Ss about the binomial processes 
and about the probabilities of change. 

Subjects.—Forty undergraduate men at the 
University of Michigan served as Ss in 
groups of five or less at a fixed hourly pay 
rate. 

Apparatus and procedure,—The experi- 
mental design required two different random 
processes, one to generate sequences of data 
and the other to determine whether or not a 
change occurred in the data-generating pro- 
cess. Condition L2 used two different urns 
to generate sequences of data; one con- 
tained two black chips and a single white one, 
the other contained one black chip and two 
white ones. Condition L4 also used two 
urns; one contained four black chips and one 
white one, the other contained one black 
chip and four white ones. A random sample 
with replacement from an urn determined the 
sequence of data. The random process that 
determined change or no change was a 
spinner. The appropriate percentage of one 
color under the pointer on a spinner repre- 
sented the probability of change; the other 
color represented no change. 

Each of two practice sequences proceeded 
as follows. A display indicated the number 
of black chips and the number of white chips 
in each of the possible urns. One of the 
urns was selected by the toss of a coin; a 
sequence of five chips was sampled randomly 
and with replacement from that urn. The 
Ss observed the color of each sampled chip 
and revised their odds estimates about which 
of the two urns was being sampled. Next, 
the spinner dictated whether or not E 
switched the urn from which he was sam- 
pling to the other urn. The Ss observed 
only the percentage of the colors represent- 
ing change and no change, respectively, and 
not the outcome of the spin. Immediately 
after the spin, Ss again revised their odds 
estimates about which urn was now being 
used. Finally, a sequence of five chips was 
sampled from the current urn and Ss re- 
vised their odds estimates about the popu- 
lations after each chip was sampled. In 
addition, after each of the five data in the 
second half of the sequence, they revised their 
odds estimates about whether or not a change 
had occurred. 

The E actually used these random devices 


in full view of Ss only for two practice 
sequences prior to beginning the experiment. 
He then instructed Ss that all remaining 
sampling had been conducted and recorded 
prior to the experiment. This had been 
done in order to accelerate the experimental 
procedure and to insure that all groups of 
Ss observed the same sequences. Thus, 
during the experiment, E merely displayed 
the outcomes of the prior sampling. The 
actual outcomes of the sequential sampling 
were controlled to correspond to the proba- 
pilities displayed to Ss. 


RESULTS 


Inferences about populations —Fig- 
ure 1 compares the revision in odds 
by the mean S with corresponding 
Bayesian revisions. The amount of 
revision as a result of each datum was 
calculated by finding the absolute dif- 
ference between the log of S’s esti- 
mated prior odds and the log of his 
estimated posterior odds (Equation 2 
in logarithmic form). This measure 
of revision, the absolute inferred 
log-likelihood ratio (see Peterson, 
Schneider, & Miller, 1965, for the ra- 
tionale of this measure), was averaged 
within each of the 10 trials (data 


MEAN CUMULATIVE URN |LLRÍ OVER 
ALL SEQUENCES AND SUBJECTS 


TRIAL 


Fic. 1. Cumulative values of absolute 
log-likelihood ratios about which population 
was being sampled. (The solid line shows 
optimal values for Cond. L4 and the dashed 
line for L2. Horizontal bars indicate +1 SE 
around each mean.) 
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items) in Cond. L2 and also in LA. 
The averaging was accomplished by 
calculating means across the nine se- 
quences for each S, and then by cal- 
culating means and standard errors 
across the 40 Ss. The resulting mea- 
sure on each trial was then transformed 
to a cumulative measure of revision 
by adding it to the sum of the measures 
for all preceding trials in the sequence. 
This yielded a cumulative log-likeli- 
hood ratio. 

The data points in Fig. 1 also ap- 
pear to form linear functions with 
intercepts of zero, and with a greater 
slope for the L4 than the L2 data 
points. Since the empirical slopes are 
smaller than the Bayesian slopes, S's 
were conservative ; on the average they 
revised their estimates less than the 
optimal amount. But the important 
new result in Fig. 1 is not that Ss 
are conservative; it is that revisions on 
Trials 6 through 10, after the possible 
change, are a nearly perfect linear 
extrapolation of Trials 1 through 5, 
prior to the possible change. The 
amnesic model prescribes that likeli- 
hood ratios based upon data sampled 
after the chance of change should be 
the same as those prior to the chance 
of change. That is what happened. 

Odds revisions based upon prob- 
ability of change.—Figure 1 does not 
display how Ss revised their odds 
estimates as a result of the informa- 
tion provided by a probability of 
change that occurred between Trials 5 
and 6. By recovering probabilities 
from odds estimates immediately before 
and after the chance of change, and 
substituting them into Equation 7, it is 
possible to calculate the probabilities of 
change and no change S would have 
been using if he were actually revising 
his odds estimates according to Equa- 
tion 7. The median inferred prob- 
abilities of change, across all Ss and 
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MEAN CUMULATIVE CHANGE |LLRÍ OVER 
ALL SEQUENCES AND SUBJECTS 
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Fic. 2. Cumulative values of absolute 


log-likelihood ratios about whether a change D 


occurred. (The solid line shows optimal 
values for Cond. L4 and the dashed line for 
L2. Standard errors of the displayed means 


range .036-.045 for Cond. L2, and .063- — 


091 for L4. Predicted and obtained points 
coincided for Trial 8 of Cond. L4.) 


conditions, were .05, .50, and .82, — 
for displayed probabilities of .2, .5, 
and .8, respectively. There was some — 
variability from condition to condition, 
but it was not systematically related 
to the L2-L4 variable or to the mag- 
nitude of the Bayesian probability prior — 
to the possibility of change. The Ss 
reacted quite appropriately to the mod- 


erate and large probabilities of change, — 
but underreacted to the small probabil- 


ity. In none of the six different ex- 
perimental conditions did the median- | 
inferred probability of change asso- 
ciated with .2 exceed .1. 

Inferences about — change.—The. 
sampling of data leads to revisions in. 
odds about the populations. Follow. 
ing the chance of change, the hier- 
archical model implies that correspond- 


ing revisions should occur in odds 


about the change itself. Figure 2 
compares Bayesian with S's revisions 


of odds favoring change. The depen- 


dent variable is the mean cumulative 
absolute log-likelihood ratio as in Fig. | 
l. The upper solid line represents 
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mean normative values for Cond. L4 
and the lower dashed line for L2. 
Note that these functions are mot 
linear as in Fig. 1. If a Bayesian were 
uncertain about which population were 
being sampled prior to chance of 
change, then he must also remain un- 
certain about whether or not there has 
been a change even though later data 
become highly diagnostic about which 
population is finally being sampled. 

The circles are also theoretical values 
based, not on the actual composition of 
the urns in the two conditions but, 
upon the mean inferred log-likelihood 
ratios during the first five trials in 
Fig. 1, prior to the chance of change. 
The hypothesis that a structure of sub- 
jective probabilities is internally con- 
sistent, and remains consistent through- 
out the process of change (Beach, 
1966 ; Peterson, Ulehla, Miller, Bourne, 
& Stilson, 1965), implies that the tri- 
angles in Fig. 2 should conform to 
the corresponding circles. 

The data points, triangles, associated 
with L4 go up at a greater rate than 
the data points associated with L2. 
In the more diagnostic condition Ss 
extracted more information about 
whether or not there had been a 
change. The Ss, while sensitive to the 
difference in diagnostic value of the 
data in the two conditions, were not 
adequately sensitive. The circles in 
Fig, 2, derived from the degree of con- 
servatism in Fig. 1, do not adequately 
account for this departure from opti- 
mality. 

Discussion 


Previous research has shown that Ss 
track changes introduced into probabilistic 
stimulus sequences. This experiment used 
two optimal models to evaluate different 
components of the overall reaction to 


change. Both models prescribe how Ss 
should react to direct information about 
the probability that a change has occurred. 
The Ss reacted appropriately to moderate 
and large probabilities, but underreacted 
to the small probability. The amnesic 
model prescribes that, after processing in- 
formation about the probability of change, 
the nonstationary aspect of the task can 
be ignored and subsequent data my be 
processed just as if the situation were 
stationary. In accord with the amnesic 
model, Ss revised estimates of which pop- 
ulation was being sampled in almost 
exactly the same manner both before and 
after the introduction of a possibility of 
change. The hierarchical model necessi- 
tates using data to revise estimates about 
whether or not a change occurred. Data 
that are more diagnostic about which 
population is being sampled are also more 
diagnostic about a change. The Ss were 
sensitive to that differential diagnosticity, 
but only partially so. 
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EFFECTS OF MAXIMIZING AVAILABILITY AND 
MINIMIZING REHEARSAL UPON ASSOCIATIVE 
SYMMETRY IN TWO MODALITIES' 


KEITH A. WOLLEN 
Washington State University 


The principle of associative symmetry was examined using highly 
available materials presented at fast rates in both acquisition and 
recall. In each of 2 experiments, 80 Ss were given a series of 
consecutive study trials (with no tests) on 5 pairs of even-odd or 
odd-even single digit numbers. In both experiments, 1 using visual 
presentation and the other auditory, S-R recall was significantly 
greater than R-S recall. Moreover, S-R associations were retained 
better from 1 trial to the next. 


The data were interpreted as 


inconsistent with the symmetry hypothesis. 


Two experiments were run to test 
the principle of associative symmetry 
(Asch & Ebenholtz, 1962) that back- 
ward (R-S) associations are as strong 
as their forward (S-R) counterparts. 
Previous attempts to test for symmetry 
have all suffered from one or more 
problems which preclude interpretation 
of the data. Thus it is important to 
specify these problems and the methods 
used to circumvent them. 

Differential S and R availability.—As 
Asch and Ebenholtz pointed out, the 
fact that R-S recall is typically less 
than S-R recall does not necessarily 
mean that R-S associations are weaker. 
It may be that R-S associations are 
equally strong but remain latent since 
stimulus members are typically less 
available for recall than response terms. 
Although the problem of unequal stim- 
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Penny Parmenter who gathered the data of 


Exp. II. 


ulus and response-term availability is 
recognized, no completely satisfactory 
method has been used to eliminate it. 
The procedure of using familiarization 
trials is inadequate since the subsequent 
S-R acquisition trials may reintroduce 
differential availability (Asch & Eben- 
holtz, 1962). The procedure of using 
double-function lists in which both 
stimulus and response terms must be 
learned (eg. Young & Jennings, 
1964) introduces the possibility that 
backward associations become inhibited 
to reduce competition with forward 
associations; e.g., the backward tend- 
ency of D to elicit C may become in- 
hibited because it interferes with 
the acquisition of D-E. This would re- 
sult in spuriously low R-S recall. Prob- 
lems also arise when conventional uni- 
directional training is compared with 
bidirectional training in which pairs are 
presented in the S-R direction on some 
trials, and in the R-S direction on 
others (e.g., Schild & Battig, 1966). 
The apparent asymmetry in such ex- 
periments may have resulted from the 
fact that both stimulus and response 
members must become available (i.e., 
learned) with bidirectional training, 
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whereas only response terms need be 
acquired when training is unidirec- 
tional. 

Rate.—The importance of a fast re- 
call rate has been emphasized by Ek- 
strand (1966), and the importance of 
a fast presentation rate during acquisi- 
tion by Wollen and Gallup (1965). 
Except for Murdock's (1962) studies, 
which are of questionable relevance to 
symmetry, the fastest acquisition rate 
in tests of symmetry has been 2:2 sec., 
and rates as slow as 6 or 8 sec. per pair 
are frequent (e.g, Asch & Ebenholtz, 
1962; Horowitz, Norman, & Day, 
1966).. With such slow rates, Ss 
would have time to rehearse the “back- 
ward” associations in the forward di- 
rection; eg., in learning A-B, S 
might overtly or covertly say A-B-A- 
B-A-B in which case A would follow 
as well as precede B. The fact that 
Ss do look back and forth has been 
demonstrated (McCormack & Halt- 
recht, 1966). Thus, a comparison of 
S-R and R-S recall might really be a 
comparison of two forward associations 
rather than of forward and backward 
associations. It is interesting that all 
Es reporting evidence of associative 
symmerty have used slow rates of pre- 
sentation. 


EXPERIMENT I 


To avoid the indicated shortcomings, 
the following procedures were adopted. 
The problem of unequal stimulus and 
response availability was handled by 
using single-digit, number-number 
pairs in which the odd numbers served 
as stimuli and the even as responses, 
or vice versa, and by telling Ss which 
specific numbers would be on each 
side. Thus both stimulus and response 
terms were highly available. Acquisi- 
tion consisted only of study trials to 
preclude the possibility that the re- 
sponse terms would become more avail- 
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able from being the only ones pro- 
duced (as suggested by Asch & Eben- 
holtz, 1962). The absence of recall 
tests during acquisition also eliminated 
the possibility of superior S-R. recall 
due to practice of S-R but not R-S re- 
call during acquisition (Ekstrand, 
1966, p. 59). Thus Ss had no recall 
practice in either direction prior to the 
S-R or R-S test. The use of a very 
fast rate which permitted just enough 
time for Ss to pronounce each stimulus 
and response term reduced the possi- 
bility of uncontrolled forward rehearsal 
of the R-S associations. 

Ekstrand (1966, p. 50) pointed out 
that R-S Ss might run through the 
pairs covertly in the S-R direction 
until they encounter the pair contain- 
ing the response member in the win- 
dow. It would then be a simple mat- 
ter to "recall" the stimulus term even 
in the absence of backward association. 
Another scheme was reported by a 
pilot S who covertly said each of the 
stimulus numbers until he found the 
stimulus that “sounded right.” He 
then said the stimulus number. 
Clearly such devices could result in 
spuriously large amounts of R-S re- 
call. In order to preclude such devices, 
the time available for recall was ap- 
preciably less than in previous studies. 


Method 


The Ss were 80 freshmen írom intro- 
ductory psychology classes whose participa- 
tion was partial fulfillment of a course re- 
quirement. The learning materials consisted 
of the following odd-even (O-E) number 
pairs: 3-0, 5-8, 1-6, 7-4, 9-2, or. the same 
pairs in the reverse (E-O) order. Each S 
was given instructions for acquisition and 
practice, practice, more instructions for ac- 
quisition, acquisition, instructions for recall, 
and recall, in that order. 

The acquisition instructions emphasized the 
following points. Each pair would consist 
of an odd number on the left and an even 
on the right (or E-O, depending on the 
condition), The five pairs would be pre- 
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sented at a very rapid rate. The task would 
be to pronounce each number of the pair 
just as soon as it appeared, and to learn 
which even number was paired with each 
odd number, for O-E acquisition, or which 
odd number was paired with each even 
number in the case of E-O training. Aware- 
ness of which specific numbers would ap- 
pear on the left and which on the right was 
insured by requiring Ss to verbalize the 
arrangement prior to training. The Ss were 
told they would be tested later, but they were 
not told anything about the test until the 
end of acquisition. 

Each S was informed that there would 
be a series of practice trials using pairs of 
letters in place of pairs of numbers. He 
was instructed to pronounce each letter as it 
appeared in order to become accustomed to 
the fast rate. Then three practice trials 
were administered using five pairs of single 
letters presented on a Lafayette drum. The 
stimulus was presented alone for 3 sec. and 
was paired with the response term for 3 
sec.; there was no intertrial interval, Recall 
of the letters was not required. 

Following practice, S was reminded of the 
major points of the instructions and given 
six consecutive study trials at the same rate 
as in practice. The pairs were presented in 
the O-E direction for 40 Ss, and in the E-O 
direction for the remaining 40 Ss. 

After acquisition, the recall instructions 
were read, resulting in a delay of approxi- 
mately 50 sec. from the end of acquisition to 
the start of recall. The Ss were told that 
each of the even (odd) numbers would be 
shown, and that they were to say the odd 
(even) number that was previously paired 
with it. They were reminded again that 
the missing numbers would be even (or odd). 
All Ss were given two consecutive recall 
trials with no intertrial interval and no 
knowledge of results. Half of the O-E 
and half of the E-O groups were shown 
each of the stimulus terms in the left window 
with the right (response) window closed and 
were asked to recall the responses. The re- 
maining Ss were shown the response terms in 
the right window with the left (stimulus) 
window closed and recalled the stimuli. 
Each number was exposed for 1 sec. during 
which time Ss had to pronounce the number 
in the window and then say the missing num- 
ber. The importance of guessing when un- 
certain and not falling behind was stressed. 
Six orders of presentation were used on 
study trials, and two different orders in 


recall. 
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Results and Discussion 


The total number correct on both . 
recall trials was 162 for S-R Ss as 
compared to only 83 for R-S Ss. The 
respective means of 4.05 and 2.08 were 
significantly different, F (1, 78) = 
11.51, p < .01. 

It is also possible to examine short- 
term retention by determining the per- - 
centage of pairs correct on the first - 
recall trial that were also correct on 
the second. According to associative 
symmetry, R-S asso"iations should be 
retained as well as S-R since both are 
equal in strength. However, the S-R 
Ss retained 60% of the 86 pairs they 
got correct on the first trial, whereas 
the R-S retained only 36% of 47 pairs. 


Experiment IT 


Whenever pairs are presented vis- 
ually, as with the memory drum used 
in Exp. I, there is the possibility of an — 
artifact. The R-S Ss may be at a di 
advantage since they have been trai 
to look to the left after each drum - 
advance but in recall they must loo 
to the right. The procedure of cover- 
ing the left window for R-S recall 
should have reduced this problem, 
but perhaps not enough. An alterna- 
tive procedure of presenting the re- 
sponse terms in the left side for R-S 
Ss would also have been potentially 
problematic since this would have re- 
sulted in a change of contextual cues | 
from those present in acquisition. 
Exp. II used auditory presentation to 
achieve temporal direction without 
spatial cues. Thus the symmetry hy- 
pothesis was tested without the prob- 
lems associated with spatial directional- 
ity. 

Method 


The lists were identical to those of Exp. 
I, and there were 20 Ss in each list-recall — 
condition. The procedure for Exp. II was 
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the same as for Exp. I, except for changes 
necessitated by the use of auditory presenta- 
tion. The first member of each pair was 
at a slightly higher pitch than the second to 
make the pairs distinguishable. A possibly 
important difference between the two ex- 
periments was that Ss did not pronounce the 
stimulus or response terms, as in Exp. I, 
since pronunciation would have interfered 
with listening to the presentation. Thus 
there was less control over Ss' rehearsal. In 
recall, Ss simply provided the missing num- 
ber of the pair after hearing the stimulus (or 
response) term. After two recall trials at 
a l-sec. rate, two additional recall trials 
were given at a 2-sec. rate. The purpose 
of this was to determine whether symmetry 
obtained for the slower (2-sec.) rate but 
not for the faster (1-sec.) one. 


Results and discussion 


The total number correct on the 
first two recall trials was 124 for S-R 
vs. 81 for R-S. The respective means 
of 3.10 and 2.02 were significantly dif- 
ferent, F (1, 78) = 5.14, p«.05. 
Moreover, the S-R Ss retained 49% of 
the 65 pairs they got correct on the 
first test trial, as compared to only 
21% of 51 pairs for the R-S Ss. One 
might suspect that Ss would learn the 
pairs as one two-digit number (e.g., 
30) rather than as two single digits 
(eg. 3-0). However, only 10 of 80 
Ss reported using this strategy in Exp. 
IL Although not recorded in Exp. I, 
the proportion was probably even 
lower since those Ss were required to 
to verbalize the pairs as two single 
digits. 

The argument might be advanced 
that the difference between S-R and 
R-S recall is due to a greater tendency 
to guess when recall is in the forward 
direction, If this were the case, S-R 
Ss should have had more overt errors 
since guessing would produce more 
wrong than correct responses. How- 
ever, the total number of errors on 
Trials 1 and 2 was 42 for S-R Ss and 
47 for R-S. The error data for Exp. 
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I appeared highly similar but the totals 
are not presented since E used the 
same symbol for both omissions and 
errors in some instances. This was 
necessitated by S's pronunciation of 
both members of the pair at such a fast 
rate. Since S-R recall did not result 
in more overt errors than R-S recall, 
it does not seem reasonable to attribute 
the observed S-R superiority to guess- 
ing. 

The fact that asymmetry was ob- 
tained using a fast presentation rate 
whereas findings of symmetry have 
typically used slow rates suggests that 
rate may be a critical variable. In 
evidence of this, the S-R superiority 
was reduced when the recall time was 
increased from 1 sec. to 2 sec. per 
item on Trials 3 and 4 of Exp. II. 
The mean correct on these two trials 
was 6.32 for S-R vs. 570 for R-S. 
Thus, fast recall rates do appear im- 
portant in obtaining asymmetry. 

The use of a fast rate will not neces- 
sarily insure asymmetry, especially if 
Ss are not required to pronounce the 
pair members. Murdock (1962, Exp. 
II) reported nearly symmetrical recall 
despite the use of a fast rate. How- 
ever the relevance of his data as a 
test of symmetry may be questioned. 
To compare backward and forward 
associations implies that it is meaning- 
ful to talk about direction. In the ex- 
periments reported herein, the pair 
members were presented directionally, 
and S was required to verbalize them 
directionally and overtly. In contrast, 
Murdock eliminated directionality by 
presenting both members of a pair 
simultaneously and by requiring no 
verbalization of any kind. In addition, 
Murdock’s Ss were instructed that 
either forward or backward recall (but 
not both) would follow each of 240 
lists. These instructions, together with 
the fact that Ss did not have to pro- 
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nounce the terms, might well have en- 
couraged rehearsal in the backward 
direction in some instances and in the 
forward direction in others. Backward 
rehearsal would be especially likely af- 
ter two or three successive recalls in 
the forward direction. Thus it is not 
surprising that Murdock found little 
evidence of directionality. 

Ekstrand (1966, p. 60) has con- 
cluded that “the difference between for- 
ward and backward associative learn- 
ing has been drastically overestimated 
and ... if symmetry is not the rule, 
asymmetry will be very small.” How- 
ever, the data obtained in the present 
experiments provide clear evidence of 
asymmetry both in total recall and in 
retention. 
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Time judgments of 1-10 sec. by the methods of estimation, production, 


and reproduction were obtained 
phrenic patients. 
upon whether individual response: 


from 20 normal Ss and 54 schizo- 


The latter were classified into 2 groups, depending 


s met a criterion for significance of 


regression upon the given time values. Slopes of the Response X Time 
Functions for Ss exhibiting significant regressions conformed to 
expectations of consistency among the methods based upon an internal- 


clock analogy. Individual mean 


responses for Ss failing to meet the 


criterion for significance of regression were highly reliable and highly 


correlated with the intercepts of 


the functions, indicating that level 


of response is not necessarily a valid measure of time judgment. 


Many accounts of time perception 
involve some form of analogy to a 
clock. However, different methods of 
obtaining time judgments have not 
yielded the straightforward mutual 
consistencies one might expect if such 
an analogy were useful (Clausen, 1950; 
Du Preez, 1963; Kruup, 1961; Och- 
berg, Pollack, & Meyer, 1965; Sieg- 
man, 1962; Wallace & Rabin, 1960, 
pp. 217-221; Warm, Morris, & Kew, 
1963). Since the apparent inconsis- 
tencies may be due to a lack of explicit- 
ness as to just what the analogy im- 
plies, an attempt to set forth its essen- 
tial requirements, at least for rela- 
tively short time intervals, is made in 
this experiment. 

The analogy consists basically of a 
comparison of one clock, a hypothetical 
internal one, with another, E's real 
clock. There is therefore only one 
fundamental variable to be assessed, 
namely the rate at which the internal 


1The authors wish to acknowledge the 
kind generosity of N. Beckenstein, Super- 
intendent, and W. Keup, Director of Re- 
seach, Brooklyn State Hospital, in raaking 
patients available for this study, and the 
excellent assistance of S. Farber, L. Fein- 
gold, and S. Weinstein in testing the patients. 


clock runs relative to the rate of the 
real clock. For convenience the “run- 
ning” of the internal clock is referred 
to as counting on the part of S, but 
the clock itself should be understood 
theoretically to mean whatever process 
may exist that renders S capable of 
counting at a regular rate whether he 
counts overtly or not. 

One may illustrate the consistencies 
required among the three main meth- 
ods of time judgment (Bindra & 
Waksberg, 1956) by reference to 
Fig. 1. The time interval presented or 
defined by E (the independent vari- 
able) is represented on the abscissa, 
and the response of S (the dependent 
variable) is represented on the ordi- 
nate. In the method of estimation, E 
presents an actual time interval and S 
names the number of seconds he judges 
to have elapsed. If the given interval 
is 4 sec. and S counts at the rate of 
2/sec, he will count to 8 during the 
interval and E will plot 8 against 4. 
If E presents 2 sec., S will count to 4 
and E will plot 4 against 2. In gen- 
eral, the slope of the response function 
is determined directly by the rate of 
the internal clock, p. If p= l/sec 
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Fic. 1. Hypothetical response as a func- 


tion of given time interval. (The slope of 
Function 1 is equal to 1. The slopes of 
Functions 2 and 3 are reciprocally related. 
A, Bs, By', C represents the sequence for re- 
production if Function 2 represents estima- 
tion and Function 3 production. A, Bs, By', 
C represents the sequence if Function 3 
represents estimation and Function 2 produc- 
tion. See text.) 


the function labeled 1 in Fig. 1 will be 
obtained, if p — 2/sec, the function 
labeled 2 will be obtained, and if p — 
.5/sec, the function labeled 3 will be 
obtained. 

In the method of production, E 
names the given time interval and .$ 
is required to hold down a key (or 
otherwise cause a real clock to operate) 
for the amount of time he judges to be 
equivalent to the stated interval. If 
the given interval is 4 sec. and S counts 
at the rate of 2/sec, he will release 
the key at the end of 2 sec. and E will 
plot 2 against 4. In general, the slope 
of this function is determined by 1/p 
and by-— l/b, where b, =the slope 
of the function for production and be = 
the slope of the function for estimation. 
The clock analogy provides no basis 
for assuming that the rate of the inter- 
nal clock will necessarily be the same 
for both methods of judgment. But if 
the rate is the same and if Function 2 
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is obtained for estimation, Function 3 
represents the logically consistent re- 
sult for production. If Function 3 is 
obtained for estimation, Function 2 
represents the logically consistent re- 
sult for production. 

In the method of reproduction E 
presents an actual time interval, opera- 
tionally as in the method of estimation, 
and S holds the key down for a judged 
equivalent interval, operationally as in 
the method of production, neither E 
nor S overtly naming the interval. If 
E presents 4 sec. and S counts at the 
rate of 2/sec, then, in order to be con- 
sistent with the result for estimation, 
8 becomes the effective given time 
interval to be produced. At the rate 
of 2/sec S will count to 8 in an actual 
interval of 4 sec, and E will plot 4 
against 4. In Fig. 1 the dashed lines 
trace this sequence, with 4 represent- 
ing the presented actual interval, B, 
and B; the effective given interval for 
production required for consistency 
with the method of estimation, and: 
C the overt response required for 
consistency with the method of pro- 
duction. If Function 3 represents esti- 
mation and Function 2 production, the 
sequence is A, B,, B,', C. In general, 
br, the slope of the function for repro- 
duction, will be equal to 1 for all values 
of p and the function labeled 1 will be: 
obtained for reproduction no matter 
what the positions of 2 and 3 and irre- 
spective of which one represents esti- 
mation and which one production. 

The expected empirical consequences 
of the foregoing may be summarized 
by the following proposition: If the 
rate of the internal clock varies from 
one S to another but remains constant 
within Ss for the several methods of 
judgment, then over Ss b, and b, must 
be reciprocally related and the between- 
Ss variance for b, can consist only of. 
error variation around a value of 1. 
The between-Ss variance of b, must be 
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less than the between-Ss variances of be 
and b, (variation in the latter reflect- 
ing variation in p among Ss as well as 
error variation) and, based only upon 
error, the between-Ss variability of b, 
must exhibit zero reliability (test-retest 
replication) and no correlation with the 
between-Ss variability of be or bp. 
Since the operations involved are 
different for estimation and production, 
the rate of the internal clock for esti- 
mation, p, may differ from that for 
production, py, and the extent of this 
difference may itself vary among Ss. 
If so, the reciprocal relationship be- 
tween b, and b, must be attenuated, 
the variance of b, increased, and a 
relationship introduced among b, be, 
and b, It follows directly from the 
A, B, By, C or A, B,, Bs’, C sequence 
in Fig. 1 that b, must vary as pe/pp and 
consequently as the product b, X bp 
(since b, — p, and by=1/py). Tf, 
therefore, the relationship bp = 1/b, is 
less precise for one group of Ss than 
another, then the correlation between 
br and b, X bp should be higher and 
the variance and reliability of b. rela- 
tive to those of be and b, should also 
be higher for that group. 
Theoretically if the given time inter- 
val is zero, only zero can be registered 
on any kind of clock regardless of what 
its rate might be for nonzero intervals. 
That is, the intercepts of the functions 
in Fig. 1 must be zero. For empirical 
functions variations in level of response 
(height of the function above the axis 
of the independent variable at any 
designated time value for which re- 
sponses are obtained) may be more 
highly associated with variations in 
intercept values than with variations 
in slope. In particular, a higher cor- 
relation with the intercepts can be ex- 
pected if Ss differ with respect to con- 
stant errors of response but not with 
respect to p. The question of inter- 
cept variation and the relation of other 


measures to it thus becomes important 
if there are systematic errors involved. 

A systematic error would occur if 
S always began his count with 1 in- 
stead of O0 at the instant defining the 
beginning of an interval, and there 
are various possible kinds of anticipa- 
tions and delays which could occur 
in S's manner of responding that 
would introduce constant errors in the 
functions. It seems likely in the ex- 
treme case, where responses were only 
randomly related to the given time 
intervals, that individual levels of re- 
sponse might still vary reliably, Even 
so, this variation could not logically 
be assumed to reflect corresponding 
variations in the rate of an internal 
clock. It is even possible that such 
systematic errors might themselves be 
affected by factors that also affect p, 
complicating the problem analytically ; 
but there is no necessary a priori rea- 
son why this should be so, and hence 
no necessary reason why consistency 
among the methods of time judgment 
should obtain in terms of level-related 
measures. 

The preceding discussion has 
assumed that the functions are linear. 
Although the kinds of consistency ad- 
duced for the linear case could in 
principle be extended to curvilinear 
functions, the appropriate calculations 
and analysis would be different and the 
difficult problem of whether curvi- 
linearity were due to changes in p or 
to changes in systematic errors as a 
function of the given time interval 
would arise. 

The present investigation was di- 
rected toward determining (a) whether 
the functions are linear for the range 
of time intervals from 1 to 10 sec., 
(b) whether b, and bp are reciprocally 
related over this range, (c) whether 
b, varies as a function of b, X bp, and 
(d) whether individual mean responses 
to the time values 1-10 sec. are re- 
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lated to slope or to intercept values. 
In addition to normal Ss, schizo- 
phrenic patients were tested for two 
reasons, First, it was hoped that they 
would manifest greater individual in- 
consistencies between estimation and 
production and hence allow a check on 
the expected higher correlation be- 
tween b, and b, Xb, and on the ex- 
pected relatively greater variance and 
reliability of b». The opposite actually 
occurred in relation to the normal 
group, but this circumstance is just 
as useful for the present purposes. 
Secondly, it was presumed to be un- 
likely that a sufficient number of nor- 
mal Ss would fail to achieve a statis- 
tically significant relationship between 
their responses and the given time in- 
tervals, and schizophrenic Ss offered 
a more appropriate population in which 
to ascertain whether level of response 
is reliable even when the responses are 
of questionable validity as far as time 
judgment is concerned. 


METHOD 


Subjects —Ten male and 10 female junior- 
college students constituted the normal 
group. They were paid for serving as Ss, 
and the median age for each sex was 19,5 
yr. Testing was tried with a heterogeneous 
sample of 60 schizophrenic patients, unse- 
lected with respect to subtype, 6 of whom 
proved to be incapable of performing the 
task. Of the 54 able to do the task, 26 
were males, age range = 17-66 yr., mdn. = 
36.5 yr., and 28 were females, range — 16-52 
yr., mdn. — 35.5 yr. 

Apparatus.—A. response key was connected 
to a circuit permitting the key to operate an 
interval timer or an elapsed timer, depending 
upon the method of time judgment. For 
estimation the response key started the 
interval timer, which turned on a light visible 
to S for the duration of the selected time 
interval. For production the response key 
operated the elapsed timer and light for as 
long as the key was held down. For repro- 
duction the first depression of the key oper- 
ated the interval timer and the second de- 
pression operated the elapsed timer, the light 
coming on at the first depression, going off 
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at the end of the timed interval, coming on 
again at the second depression, and going 
off at the second release of the key. A mis- 
trial signal was conveyed to E if the re- 
sponse key was released before the termi- 
nation of an interval being produced by the 
interval timer or if the key was released and 
depressed again during an interval being 
registered on the elapsed timer. A second 
key was available to E, which could operate 
the interval timer and light in lieu of opera- 
tion by the response key. One selector 
switch determined the kind of operation that 
the response-key would accomplish, and an- 
other selector switch determined the dura- 
tion to be produced by the interval timer, 

Procedure.—Before each trial S was told 
whether that trial was to be one of estima- 
tion, production, or reproduction, and the 
value of the interval if production. For 
normal Ss, on an estimation trial S pressed 
the key, turning the light on, and held the 
key down until the light went off. He then 
stated how long he thought the light had 
been on. On a production trial S held the 
key down, maintaining the light on, for as 
long as he judged to be equivalent to the 
stated interval. On a reproduction trial, 5 
held the key down until the light went off 
and released the key, as on an estimation 
trial, and then pressed the key again, as on a 
production trial but attempting to equate 
the second duration of light on with the 
first. 

The essential features of this procedure 
were: (a) The S determined the exact time 
at which each trial began and the delay be- 
tween the first and second parts of each 
reproduction trial. (b) The mechanics of 
the procedure were as similar as possible for 
the three methods of time judgment. 

This procedure would, however, be un- 
duly difficult to get across to many schizo- 
phrenics, and a simpler, more conventional 
procedure was used with the schizophrenic 
Ss. On estimation or reproduction trials E 
started the interval timer and turned the 
light on for the presented durations, the 
procedure otherwise being the same as for 
normal Ss. Since the schizophrenics’ re- 
sponses were more consistent in spite of the 
greater operational differences among meth- 
ods, it was not deemed worthwhile for the 
present purposes to determine whether re- 
sults for normal Ss are significantly affected 
by these details of procedure or not. 

Instructions.—The task was presented to 
Ss in terms of judging how long the light 
was on (estimation), keeping the light on 


VARIATIONS IN TIME JUDGMENT AND THE INTERNAL CLOCK 635 


for the designated number of seconds (pro- 
duction), and keeping the light on during the 
second part of the trial for the same amount 
of time as it was on during the first part 
of the trial (reproduction). All Ss were 
told to go about the task in whatever way 
they felt to be the simplest, easiest, and most 
natural In particular, Ss were never told 
either to count or not to count, and any 
explicit suggestion that S should make esti- 
mates on reproduction trials or otherwise try 
to achieve consistency from one trial or 
method to another was deliberately avoided. 

Schedule.—Each session was made up of 
30 trials plus repetitions of any trials on 
which mistrials occurred. The integer time 
values, 1-10 sec., were presented randomly 
once each for each method within the restric- 
tion that neither the same method nor the 
same time value was allowed to occur more 
than twice on successive trials. The cri- 
terion for mistrial was failure to respond or 
improper operation of the response key, and 
when either of these conditions occurred the 
trial was repeated. All Ss were tested on 
two sessions with at least 1 day and usually 
several days intervening. 


RESULTS 


A regression analysis was performed 
on the 10 time responses for each S 
for each method of judgment and each 
session. The sum of squares for linear 
regression was tested against the mean 
square for deviations from linear re- 
gression (df = 1, 8), and the difference 
between the sums of squares for linear 
and quadratic regression (indicating 
curvilinearity) was tested against the 
mean square for deviations from quad- 
ratic regression (df — 1, 7). The cri- 
terion used for significance of regres- 
sion was p < .05 for either test for all 
three methods on both sessions. All of 
the normal Ss met this criterion easily. 
Thirty of the. schizophrenic Ss (15 
males, mdn. age = 37 yr., 15 females, 
mdn. age = 36 yr.) met the criterion 
and are referred to as the schizophrenic 
group with significant regression. 
Twenty-four Ss (11 males, mdn. age 
—36 yr, 13 females, mdn. age — 
35 yr.) failed to meet the criterion and 


are referred to as the schizophrenic 
group with nonsignificant regression. 
in the latter group 16 Ss failed on 
estimation and 16 on production, 10 of 
these Ss failing on both. Eight Ss 
failed on reproduction, 6 of whom 
failed also on estimation or production 
or both. 

It was originally intended to classify 
Ss according to the significance or non- 
significance of curvilinearity in the 
functions. It became apparent, how- 
ever, that significant curvilinearity 
occurred at about the chance expected 
frequency (596 of the instances at the 
.05 level). Only one normal S and 
two schizophrenic Ss showed signif- 
icant curvilinearity (p < .05) in the 
same direction for both sessions for 
any method, and the linear regressions 
were highly significant in these three 
instances (p < .001). The intent to 
form a group of Ss with curvilinear 
functions was therefore abandoned, 
and the distributions of linear slope 
scores were simply inspected to see 
whether these three Ss were extreme. 
Their scores were well within the dis- 
tributions and hence could not unduly 
affect the results. 

Overall regression analyses were 
then performed on the time scores 
separately for each method and sepa- 
rately for the normal Ss and schizo- 
phrenics with significant regression, 
evaluating the linear regression and 
quadratic, cubic, and higher-order re- 
sidual deviations from linear regres- 
sion, using in each case the correspond- 
ing component of the pooled (male, 
female) Ss X Time Values interaction 
as the error term (df=18 for the 
normal Ss, 28 for the schizophrenics). 
The linear regressions were all highly 
significant but none of the deviations 
from linear regression reached signif- 
icance. There was therefore no con- 
traindication to use of the linear inter- 
cept and slope coefficients as adequate 
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TABLE 1 
AVERAGE INTERCEPT AND SLOPE VALUES 
Intercept (sec.) Arctan Slope (deg.) 
Time nd Subject Group* 
Normal Schizophrenic Normal Schizophrenic 

Estimation 

M 35 AT 40.10 49.92 

SD 43 NI 4.85 10.94 

ru 23 54 51 16 

t 3.67** 9:208 4.52** 2.46* 
Production 

M —.36 —.01 48.26 37.10 

SD 40 .64 4.29 11.45 

nu 17 .63 .32 .82 

te 4,05** 12 3.40** 3.78** | 
Reproduction | 

M —.22 .07 43.28 41.02 | 

SD 33 51 2.67 4.39 | 

n 29 42 40 .59 

t 3.03%* 18 2.88** 4,97** 
Estimation X 

Production? 
m Am 43.31 41.45 

SD = = 3.62 6.50 

n Ed = 435 .32 

1 = — 2.09* 2.98** 


a 20 normal Ss, 30 schizophrenic Ss with individually significant regressions. 
» Correlation between like measures for Sessions 1 and 2. 
* Associated with difference from zero for intercept, difference from 45° for arctan slope. 
: pes product of nontransformed slopes for estimation and production, 
bs .05. 


p< 01. 


representations of the Response X 
Time Functions. Since slope coeffi- 
cients can vary from zero to infinity, 
however, the population distributions 
must be presumed to be skewed, so the 
individual slope coefficients were trans- 
formed to their corresponding arctan 
equivalents. The arctan values are 
simply the slopes expressed in degrees, 
with 45? equal to a slope of 1 and 
the degree values distributed more 
nearly normally around the mean. 
The arctan slope scores were next 
used in a separate analysis of variance 
for each of the two groups of Ss, 
evaluating sex and session differences. 
The deviations of the slopes from 45° 
for estimation and production were 
greater for the schizophrenic females 
but in the same directions as those 


for the schizophrenic males. This re- 
sult appeared in a significant Sex X 
Methods interaction tested against the 
Pooled Ss x Methods mean square, 
F (2, 56) = 5.18, p<.01. For the 
normal Ss the deviations from 45? for 
estimation and production were greater 
for Session 2 but in the same directions 
as those for Session 1. This effect pro- 
duced a significant Sessions x Meth- 
ods interaction tested against the 
Pooled Ss x Sessions x Methods 
mean square, F (2, 36) = 475, p< 
.025. In each group the main effect of 
Methods was highly significant, F (2, 
36) = 16.6, p < .001, for normals and, 
F (2, 56) = 11:8, p < .001, for schizo- 
phrenics. The corresponding response- 
time values for Sessions 1 and 2 were 
then averaged together and the linear 
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intercept and arctan slopes recomputed 
on the averages for each S and method. 

The overall results are summarized 
in Table 1. In terms of the slopes 
and rates corresponding to these aver- 
ages, the internal clock for the schizo- 
phrenics ran faster than real time, more 
so for production (p — 1.322) than for 
estimation (p = 1.188). The estima- 
tion/production ratio of these two rates 
(.899) and the slope corresponding to 
the Estimation X Production average 
(.883) agreed closely with the slope for 
reproduction (.870). The internal 
clock for the normals ran slower than 
real time but also faster for production 
(p = 892) than for estimation (p= 
.842). The ratio of these rates (.944) 
and the slope corresponding to the 
Estimation X Production average 
(.943) were essentially identical with 
the slope for reproduction (.942). 

The relationship between individual 
arctan slopes for estimation and pro- 
duction was linear for both groups, and 
the product-moment correlation was 
— 70 for normals and —.87 for schizo- 
phrenics (5 «.001 in both cases). 
The criterion for consistency between 
judgments for estimation and produc- 
tion was specifically asserted to be a 
reciprocal relationship between the 
nontransformed slopes, however, and 
nonsignificant departure from linearity 
between the arctan values does not 
necessarily imply a significant recipro- 
cal relationship between the slopes. 
The nontransformed slopes for the 
schizophrenics are shown in Fig. 2. 
For this set of points the difference 
between the sums of squares for linear 
regression and hyperbolic regression, 
best-fitting function of the form y= 
a+b/(x—c), was significantly 
greater than the residual mean square, 
F (1, 27) = 28.3, p < .001. The non- 
transformed slopes for the normal Ss 
did not depart significantly from a 
linear relationship. 
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ESTIMATION (SLOPE) 


Fic. 2. Relationship between slope co- 
efficients for estimation and production for 
schizophrenic Ss with individually significant 
regressions of response upon the given time 
values. (Each point represents a different 
S, N = 30.) 


The less closely reciprocal relation- 
ship for the normal Ss therefore im- 
plies the expectation of a higher cor- 
relation for that group between the 
slope for reproduction and the product 
of the slopes for estimation and pro- 
duction. The correlation between the 
arctan slope for reproduction and the 
arctan of the product of the slopes for 
estimation and production was .56 
(p= .01) for normals and 34 (p< 
.10) for schizophrenics. Expectation 
of greater variance and reliability (cor- 
relation between like measures for Ses- 
sions 1 and 2) for reproduction rela- 
tive to the variances and reliabilities 
for estimation and production is also 
implied for normal Ss. The ratio 
of reproduction variance to the average 
of the estimation and production vari- 
ances was .34 for the normals and .15 
for the schizophrenics. The ratio of 
the reliability coefficient for reproduc- 
tion to the average of the reliability co- 
efficients for estimation and production 
(averaged via Fisher’s z transforma- 
tion) was .95 for the normals and .75 
for the schizophrenics. The variances 
and reliability coefficients for Estima- 
tion X Production relative to those for 
estimation and production (Table 1) 
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TABLE 2 


RELIABILITY oF INDIVIDUAL MEAN TIME RESPONSES AND THEIR 
CORRELATION WITH INTERCEPT AND SLOPE VALUES 


Correlation^ with 
Reliability 
Intercept Arctan slope 
Subject Group 
Method of Time Judgment: 
E P R E P R E P R 

Normal (N = 20) -70 | .58 | .44 | .43 -36| .03 | .86 | .92 | .74 
Schizophrenics with Significant 

Regression (V = 30) 86 | .80 | .82 | 48 | —.07| 49 | .91 | .92 | 76 
Schizophrenics with Non- 

significant Regression 

(N = 249 90 | .79 | .87 | .75 «82| .91 | (31) | (.77) | (12) 
E Note.—p = .01 at r = .56, .46, .51, respectively, for N = 20, 30, 24. 3 


* Correlation 
b Individual 
* Estimation 
4 Individuals 
values (p > .05). 


were also consistent with these ex- 
pectations and results for reproduction. 

Although the absolute variances and 
reliabilities for the schizophrenics were 
greater than those for the normals, a 
larger proportion of the between-Ss 
variation in estimation was consistent 
with variation in production and, cor- 
respondingly, the proportion of the 
variation associated with reproduction 
was smaller and relatively less replic- 
able. No basis for expecting a correla- 
tion between reproduction and either 
estimation or production taken sepa- 
rately was derived, the only necessary 
requirement being that none could exist 
if judgments for estimation and pro- 
duction were perfectly consistent with 
each other. These correlations (be- 
tween the arctan slopes) were not sig- 
nificant, with the exception of the cor- 
relation between reproduction and esti- 
mation for the normal Ss, r= 45 
(p < .05). 

The means of the 10 time responses 
for each S were not less reliable for 
Ss with nonsignificant regression than 
for other Ss and were clearly cor- 


between individual mean time responses for Sessions 1 and 2. 

measures averaged for Sessions 1 and 2. 

(E), production (P), reproduction (R). 

manifesting one or more instances of nonsignificant regression of response upon the given time 


related with the intercept values, as 
they must be if individual differences 
in level of response are reliable in the 
absence of a consistent relationship be- 
tween response and the given time 
values (Table 2). The significant cor- 
relation between individual mean re- 
sponse and slope for production (.77) 
is not theoretically meaningful, since 
if the selection criterion for nonsignif- 
icance of regression were perfect this 
coefficient could not be significant on 
other than a chance basis. For the 
normal Ss variations in individual 
mean response were related more 
closely to variations in slope than to 
variations in intercept, and the schizo- 
phrenic Ss with significant regression 
resembled the normal Ss in this respect 
more than they did the schizophrenic - 
Ss with nonsignificant regression. 


Discussion 


Conformity of the results to a simple 
linear clock model was remarkably good, 
and it is pertinent to consider whether this 
apparent simplicity might be artifactual. 
The most likely source of spurious inter- 
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action among the methods of judgment 
would seem to lie in the possibility that 
S's appraisal of the intervals presented 
for estimation might be extraneously in- 
fluenced by the statement of time values 
for production. If, e.g., an S were dis- 
posed to estimate some intervals as being 
15 or 20 sec. but never heard an inter- 
val greater than 10 sec. stated for 
production, he might assume that he 
should be counting at a slower rate for 
all methods. With practice the task might 
reduce to one in which Ss attempted to 
identify the estimation intervals in a rank- 
order sense and to adjust their counting 
rates to agree with the rank values. 

Such an approach would imply an in- 
crease in absolute accuracy and a de- 
crease in between-Ss variance with prac- 
tice. The variances for the normal Ss 
did decrease somewhat from Session 1 
to Session 2, but on the average these Ss 
counted too slowly on Session 1 (slope 
<1 for estimation, 2 1 for production) 
and tended to count even more slowly on 
Session 2. This result also suggests that 
Ss did not practice with a real clock be- 
tween sessions, or, if they did, the prac- 
tice did not serve to improve average per- 
formance. The authors feel nevertheless 
that the clock model cannot be general- 
ized beyond this particular experimental 
situation until questions of this kind are 
explored more fully. 

The level of response at any single time 
value may be a useful measure in studies 
of time judgment with normal young 
adults, since mean response is apparently 
correlated more highly with slope than 
with intercept values for these Ss. The 
results with the schizophrenic Ss indicate, 
however, that level of response cannot 
simply be assumed to be a valid measure 
of time judgment. Even with normal Ss, 
the assumption may be misleading. The 
average intercept for normals was posi- 
tive for estimation and negative for pro- 
duction (Table 1); since slope was <1 
for estimation and >1 for production, 
both functions crossed the line of absolute 
accuracy (intercept = 0, slope = 1). The 
crossing point may seem to have some 
special significance as the time interval 


over which experienced time is equal to 
real time or as the point below which 
time is overestimated and above which 
time is underestimated. The basic prob- 
lem has been elaborated extensively in 
connection with the concept of the “in- 
difference interval" (Woodrow, 1951, pp. 
1225-1227). But in terms of a simple 
linear clock model, the crossing point 
exists by virtue of a nonzero intercept 
due largely to systematic errors of mea- 
surement. In order to encompass the “in- 
difference interval" as an intrinsic aspect 
of time perception some additional hypo- 
thetical mechanism must be postulated, as 
has been done by Treisman (1963). 

In general, since the intercepts of the 
functions differ from zero in more than 
chance fashion, it is not possible to inter- 
pret the mean of a set of responses to a 
single time value or to a set of time 
values as an indicator of internal clock 
rate. There may, of course, be a true 
nonzero intercept in the sense that there 
exists a finite duration too short to be 
registered by the human O. ‘The order of 
magnitude of that duration, however, ap- 
pears to be negligible in relation to the 
means and variabilities of the intercept 
values in the present data (Robinson, 
1967). 

It has not been the authors' purpose to 
investigate differences in time judgment 
that may have implications for under- 
standing the nature of schizophrenia. Any 
suggestions in that regard would have to 
be tested with normal Ss who were more 
comparable to the schizophrenics in age 
and other factors, Nonetheless three 
points are worth noting. First, the re- 
versal in slope values, as compared to the 
normals, is consistent with previous re- 
ports of time overestimation (faster in- 
ternal clock rate) for schizophrenics 
(Goldstone & Goldfarb, 1962; Wallace & 
Rabin, 1960, pp. 224-225). In the data 
this result was more pronounced for the 
female schizophrenics than for the males. 
Second, the lesser sensitivity of the 
schizophrenics to differences in method 
(greater consistency between estimation 
and production) suggests that compari- 
sons with normals may depend upon 
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method, not because the schizophrenics 
tend to respond differentially but because 
the normals may. Third, it is important 
to examine separately the results for Ss 
whose responses vary systematically with 
variation in presented time intervals from 
the results for those whose responses do 
not. Probably no single generalization 
could meaningfully apply to both groups. 
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DELAYED RECOGNITION AND THE SERIAL 
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ed to measure the recognition of 


7-digit strings after delays of 4, 8, or 12 sec. Operating characteristics 


showed that retention was poorer at 


longer delays and for items from 


the middle of the string. Recognition rates (d) appeared to drop most 
rapidly for terminal items as delay increased, in agreement with 


analogous results obtained with recall procedures. 


However, the 


effects of recency were always greater than those of primacy and the 
form of the serial-position curve was essentially unchanged by 
increasing delay, in contrast to analogous results obtained with recall 


procedures. 


Studies of short-term memory, es- 
pecially those concerned with the serial 
organization of memory, have relied 
heavily on recall as a measure of reten- 
tion. Such studies have shown that 
the nature of serial organization is in- 
fluenced by a variety of variables (see 
Posner, 1963; Postman, 1964), includ- 
ing the order in which items are 
emitted in recall (Deese, 1957; Mur- 
dock, 1963; Tulving & Arbuckle, 1963, 
1966). ^ Traditional recall measures 
inevitably raise the problem, therefore, 
of removing, or assessing, the influence 
of order of recall on S’s output from 
memory and on the distributions of 


1'The research reported in this paper was 
sponsored by the Aerospace Medical Re- 
search Laboratories, Aerospace Medical 
Division, Air Force Systems Command, 
Wright-Patterson Air Force Base, Ohio, 
under Contract No. AF 33(615)-2224 with 
Miami University. This paper is identified 
as AMRL Technical Report No. AMRL- 
TR-67-182. Further reproduction is author- 
ized to satisfy needs of the United States 
Government. The helpful criticisms made 
by the reviewer of this paper and by D. J. 
Weintraub are gratefully acknowledged. 
Gary A. Schneider collected the data and 
assisted in their analysis. 
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memory-trace strengths which pre- 
sumably underlie that output. Fur- 
ther, recall scores presumably are in- 
dicative of S’s strongest responses; 
since the decision criteria imposed by 
S on the emission of responses is un- 
known, it may be argued that recall 
may provide little information about 
the strengths of the memory traces in 
question (Murdock, 1966). 
Recognition-memory procedures 
which afford a measure of trace 
strength unbiased by a variable de- 
cision criterion are now available (e.g., 
Egan, 1958). Such procedures can 
permit the testing of individual items 
and thus avoid in large measure the 
problems associated with the overt re- 
call of a series of items. For these 
reasons, the present study used a 
probe-stimulus technique to address 
the serial organization of short-term 
recognition memory (STRM). 
Earlier studies, which have em- 
ployed a constant delay, have shown 
the potent effects of recency on 
STRM; however, the evidence for 
the effects of primacy is more equivocal 
(cf. Corballis, 1967; Norman & Wic- 
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kelgren, 1965; Wickelgren & Norman, 
1966). The present study differs 
from other studies of STRM in that it 
is concerned primarily with the effects 
of delay on the serial-position effect 
and asks whether delay affects in the 
same way the recognition of items 
which occupied different serial posi- 
tions in presentation. It is already 
known that delay affects primarily the 
short-term recall of terminal items of a 
list (Glanzer & Cunitz, 1966; Jahnke, 
1968; Postman & Phillips, 1965), and 
it is of interest to know if such is also 
the case for short-term recognition. 


METHOD 


Subjects.—The Ss were 30 male Miami 
University students who were paid for their 
participation. None of the Ss had previously 
served in an experiment on recognition 
memory. 

Apparatus and materials—The stimulus 
materials were derived from a total of 140 
seven-digit strings. Strings were drawn from 
a table of random numbers with the restric- 
tion that no digit was repeated within any 
string. The strings were constructed and 
ordered such that in successive strings no 
particular digit occupied the same serial posi- 
tion and such that the last digit of a string 
was not the same as the first digit of an 
immediately subsequent string. No ascend- 
ing or alternating ascending sequences, such 
as 2, 4, 6,..., greater than Length 2 
were allowed. By error there were two 
descending sequences and a single descending 
alternating sequence of Length 3. The in- 
terpolated materials were randomly ordered 
permutations of the first four letters of 
the English alphabet. Both the stimulus 
and interpolated materials were presented 
visually and sequentially at a rate of 4 sec/ 
item on an Industrial Electronics Engineers, 
Inc., Model 089 Bina-View operated by a 
Friden Model S-2 paper tape reader. A .01- 
sec. Standard Electric clock was used to 
record response latencies. 

Procedure.—On each trial S was pre- 
sented with a string of seven digits and was 
then occupied for a period of either 4, 8, or 
12 sec. with the interpolated task (the read- 
ing of 2, 4, or 6 quadrigrams). A probe 
digit was then presented, and S indicated 
whether or not the probe digit had appeared 
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in the original set of digits and rated his 
confidence in his judgment on a 5-point 
scale. The digits and letters were pre- 
sented in an unbroken series, and all items 
but the probe digit were read out loud as 
they were presented. The Ss were instructed. 
to make their decisions as rapidly as possible, 
consistent with what they thought was a 
good degree of correctness. Approxi- 
mately 8 sec. were given S on each trial to 
indicate a decision and confidence rating; a 
S-min. rest followed Trial 70. After each 
trial, Ss were informed whether or not their 
decision had been correct. 

Half the probe digits were "old," that is, 
were digits which had appeared in the 
original set; half were "new." When the 
probe digits were old, they were equally 
often those occupying each of the seven 
serial positions in the original string. There 
were, therefore, 10 tests at each of the 
seven serial positions for cach S. Each 
serial position was tested once in each block 
of 14 strings; the remaining tests in that 
block were made with probe stimuli which 
were new. Each of the 10 decimal digits 
were employed as probe stimuli about equally 
often. Response latencies were recorded 
to the nearest .01 sec. from the timer, which 
was activated 720 msec. after the pulse 
which displayed the probe digit and was 
stopped when S depressed either of two ap- 
propriately labeled (“Yes” and “No”) tele- 
graph keys. The left-right position of the 
two keys was counterbalanced in an abba 
order. The Ss were assigned to only a 
single condition of delay; Ss were assigni 
to these conditions at random, except that 
one of each set of three successive Ss was 
assigned to a different delay condition. 

Data analysis—A response placed in con- 
fidence category “yes-5” of the present study 
represents O’s application of the strictest 
criterion, “yes-4” the next strictest, etc. 
With the 10 ratings of the present study, 
there were nine decision criteria rang- 
ing from strict to lax. Plots which show 
the cumulative probability of the assignment 
of a response to Category j or stricter, 
given that the probe digit was old (So), as 
a function of the cumulative probability of 
assignment to Category j or stricter, given 
that the probe digit was new (Sz), define the 
operating characteristic; in contexts such as 
the present, these plots have been called 
memory-operating characteristics (Norman 
& Wickelgren, 1965). In the present study, 
memory-operating characteristics (Moc) 
were obtained for each condition separately 
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from the data summed across all Ss in that 
condition, 


RESULTS 


Correct and false recognition rates. 
—The proportions of correct recogni- 
tions when the probe digit was old 
were .693, .581, and .551 for Cond. 
4, 8, and 12, respectively. Correspond- 
ing proportions of total correct recog- 
nitions (accepting an old stimulus cor- 
rectly and rejecting a new stimulus 
correctly) were .757, .703, and .669. 
False-alarm rates (incorrect accept- 
ance of a new stimulus) were .178, 
.175, and .213 for Cond. 4, 8, and 12, 
respectively. 

Serial-position analyses based on 
recognition data when the probe digit 
was old are presented separately in 
Fig. 1 for each delay condition. The 
effects of both primacy and recency 
may be noted in Fig. 1; recency ap- 
pears to be rather stronger than pri- 
macy.. In general, performance for 
items at Serial Positions (SP) 2-5 
and, possibly, 2-6, appears to be much 
the same, Mixed-factorial analyses 


Serial-position curves for each experimental condition. 


of variance of the data on which Fig. 
1 is based showed that the main effects 
of delay and SP were significant, F 
(2, 27) = 4.51 and F (6, 162) = 6.38, 
respectively, p < .01 in both cases, but 
that their interaction was not, F < 1. 
A Newman-Keuls analysis showed 
that performance at SP 7 was signifi- 
cantly better (p< .01) than that at 
any other position but SP 1, where 
the difference approached significance 
at the .05 level, Performance at SP 1 
was significantly better than perform- 
ance at SP 4 and 6, p < .05 and p< 
.01, respectively, while performance at 
SP 2-6 was much the same, p > .05 in 
all comparisons. For these reasons, 
data for SP 2-6 were combined for 
subsequent analysis and were treated 
separately from those for SP 1 and SP 
7, which were also treated separately. 

Memory-operating characteristics.— 
The operating characteristics for SP 1, 
2-6, and 7 are shown separately in the 
three panels of Fig. 2. Each panel 
presents data for one, of the SP. condi- 
tions; within each panel delay is the 
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Fic. 2. Memory-operating characteristics 
for the experimental conditions. 


parameter. The data are presented on 
linear coordinates in Fig. 2, and the 
smooth curves were fitted by inspec- 
tion. As can be seen in Fig. 2, detec- 
tion rates were superior at the shorter 
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delays. Also, performance was best 


for items at SP 7, next best for those _ 


at SP 1, and poorest for those at SP 
2-6. Plots on normal-normal proba- 
bility paper of these data were fitted 
relatively well by straight lines of near 
unit slope; curve fitting was again by 
inspection. Deviations from unit slope 
were not large and were always in the 
direction of slope<1. Following 
Wickelgren and Norman (1966), val- 
ues of d, a measure of signal detectabil- 
ity, were obtained for each condition 
by estimating the coordinates of the in- 
tersection of the straight lines with the 
negative diagonal. 
estimation has the advantage of mini- 
mizing the effect of deviations of the 
slope of the operating characteristic 
from unity. These values of d were 
1.41, 1.35, and .94 for the data, given 
in order of increasing delay, for SP 1. 
Corresponding values were 1.16, .94, 
and .77 for SP 2-6 and 1.76, 1.54, 
and 1.11 for SP 7. These data show 
clearly that d is a function both of de- 
lay and SP. There is some suggestion 
that d falls more quickly as a function 
of delay for SP 7 than for SP 1. In 
turn, d may fall more quickly with de- 
lay for SP 1 than for SP 2-6. 

A posteriori probabilities, the prob- 
abilities with which a probe digit was 
called old when S’s response was as- 
signed to a particular confidence cate- 
gory, are presented separately for the 
delay conditions in Table 1. For con- 


TABLE 1 


A PosTERIORI PROBABILITIES FOR THE 
Conpitions or DELAY 


Rating 
Delay 
44,45] +2.43] +1,-1] 22,73] -—4,-5 
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8 | 89 | 69 | 41 | .38 | .26 
12 84 | 64 | 44 40 | .30 


i 


This method of ` 


SERIAL ORGANIZATION 


IN SHORT-TERM MEMORY 645 


TABLE 2 
Means or MEDIAN RECIPROCAL LATENCIES FOR ALL CORRECT RECOGNITIONS 


OLD NEW 
Delay 
SP1 SP2 SP3 SP4 SP5 SP6 SP7 = 
4 .691 .670 .670 .607 .641 .672 749 .620 
8 492 490 468 557 -576 468 544 529 
12 :523 .583 .531 .592 531 .601 557 542 


venience in presentation, data were 
grouped both across serial position and 
successive pairs of confidence cate- 
gories. The data of Table 1 indicate 
that a decreasing linear function relates 
a posteriori probability and response 
category for each condition; the slopes 
of the functions appear to be quite simi- 
lar. The trends shown in Table 1 were 
also obtained with the ungrouped data. 
Both Fig. 1 and Table 1 show that Ss 
were apparently able to use effectively 
the various decision criteria. When Ss 
were quite positive that stimuli were 
old, they were very likely to be correct ; 
when Ss were less positive, they were 
less often correct. It appears that all 
confidence categories convey informa- 
tion; that is to say, there appears to be 
no threshold below which Ss are un- 
able to assign their responses with ac- 
curacy to the categories. 
Latencies—Means of median recip- 
rocal response latencies were computed 
separately for each condition and sepa- 
rately for the distributions of “hits” and 
"correct rejections.” These data ap- 
pear in Table 2. Inspection of Table 2 
suggests that latencies for hits were 
shorter for the 4-sec. delay condition 
than for the 8- and 12-sec. conditions. 
For the 4-sec. delay condition, there ap- 
pears to be a bowed serial-position ef- 
fect for latencies which parallels that 
for correct recognitions in Fig. 1. Re- 
sponse speeds were fastest for items in 
SP 7, next fastest for items in SP 1, 
and slowest for items in SP 4. How- 
ever, this trend is not apparent in the 


data for the 8- and 12-sec. conditions, 
and a mixed-factorial analysis of vari- 
ance of the overall data from which the 
means for hits were derived showed 
that the effects of delay, SP, and their 
interaction were not significant, F (2, 
27) = 2.09, p > .05; F (6, 162) <1; 
F (12, 162) = 1.75, p > .05, respec- 
tively. Table 2 shows that latencies 
were shorter for hits than correct rejec- 
tions at the 4-sec. delay for all SPs but 
SP 4. For the 8- and 12-sec. delays, 
respectively, latencies were shorter for 
hits than correct rejections for three of 
seven and four of seven SPs, but no 
particular relation between SP and 
speed was obvious. 

Latencies were also obtained for 
"misses" (incorrect rejections of old 
stimuli) and false alarms (incorrect 
acceptances of new stimuli). For the 
4-sec. delay condition, latencies for 
misses were always greater than those 
for hits. Sign tests showed that these 
differences were large, p < .07, at all 
SPs but 3, 4, and 5, for which p > .11. 
For the 8- and 12-sec. delays, at no 
SP were the latencies for misses sig- 
nificantly different at the .05 level from 
those for hits. Correct rejections were 
faster than false alarms, but not sig- 
nificantly more so, p > .05, by separate 
sign tests of the data for the 4-, 8-, and 
12-sec. conditions. 


Discussion 


The MOC analyses and the analysis 
of a posteriori probabilities support the 
notion that memory-trace strength is a 
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smooth, continuously graded function at 
all delays. There was no evidence for a 
single, fixed threshold below which O's 
responses carried no information about the 
stimulus, although the data presented here 
do not contradict the possibility that there 
are multiple (fixed) thresholds. 

In the present study, recognition (d) 
was a function both of delay and serial 
position. Although d appeared to de- 
crease most rapidly with delay for items 
at SP 7, performance was best at that 
position, and the form of the serial-posi- 
tion curve was largely unchanged by de- 
lay. These results are not wholly con- 
sistent with those of studies of short-term 
recall. Glanzer and Cunitz (1966), 
Jahnke (1968), and Postman and Phillips 
(1965) have shown that the result of in- 
creasing retention interval was to weaken 
the effect of recency and to leave the 
effect of primacy largely unchanged. The 
apparent rapid decline with delay in the 
strengths of traces for terminal items and 
the apparent stability of trace strengths 
for the remaining items, as evidenced by 
their probabilities of recall, has been at 
least partly responsible for the suggestion 
that there are two identifiably different 
components of memory—a short-term, or 
primary, and a long-term, or secondary, 
memory (eg. Glanzer & Cunitz, 1966). 
However, present results indicate that the 
strengths of all items declined with delay 
and that the strength of the final item 
remained superior to that of any other 
item, even after relatively long delays 
(and relatively large numbers of inter- 
polated items). It is true that recognition 
procedures were used here and that the 
dual-component model of memory is prop- 
erly a model of recall. It is also true that 
the effect of recency was restricted to but 
the final item. Nonetheless, present re- 
sults may prove not easily accommodated 
by a dual-component model such as con- 
sidered here. 

Finally, in the theory of signal de- 
tectability, d decreases with decreases in 
the signal-to-noise (S/N) ratio. Pres- 
ent data, therefore, suggest an identifi- 
cation of forgetting with a decline in 
the S/N ratio, an identification which 
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was made earlier by Brown (1959), 
Such a suggestion implies that forgetting 
is not so much a function of the trace 
strength of the target (to-be-remem- 
bered) item as it is the strength of the 
target item relative to those of other 
items from which it must be differentiated, 
It then follows that any variable which — 
improves retention does so by raising | 
the strength of the target item relative 
to those of "competing" items and/or by ' 
lowering the strengths of competing items 
relative to that of the target. Such a 
conceptualization suggests that any vari- 
able affecting retention does so by affect- 
ing the S/N ratio. Such a conceptualiza- 
tion appears to have much in common 
with a number of proposals, including 
that by Underwood and Ekstrand (1967) 
which states that proactive interference 
results from S's inability to differentiate. 
target items from prior items. 
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EFFECT OF A SIMULTANEOUS CONDITIONING PROCEDURE 
UPON SUBSEQUENT EXTINCTION AND ACQUISITION? 
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The possibility that the failure to obtain classical conditioning with 
simultaneous presentations of CS and UCS may be the result of the 
inadvertent use of a partial schedule of reinforcement was tested in 
2 experiments. In one experiment the test occurred in extinction; in 
the other experiment the test was provided in subsequent acquisition. 
The results of the Ist experiment supported this interpretation; the 


results of the 2nd did not. 


Studies of classical conditioning with 
strictly simultaneous presentations of 
CS and UCS have usually employed 
nonreinforced test trials to provide a 
measure of conditioning (Fitzwater & 
Reisman, 1952; Fitzwater & Thrush, 
1956; Wolfle, 1930, 1932). The intro- 
duction of such trials has the unfor- 
tunate consequence of introducing a 
partial-reinforcement schedule into the 
experiment, a procedure that inter- 
feres with the progress of classical con- 
ditioning. It could be, therefore, that 
the failure to obtain conditioning with 
a 0.0-sec. interstimulus interval (ISI) 
is a function of the particular experi- 
mental procedure. The two studies re- 
ported in this paper were designed to 
explore this possibility. In Exp. I the 
effects of simultaneous pairings of CS 
and UCS were examined in extinction ; 
in Exp. II the same effects were 
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studied in subsequent acquisition of the 
CR using a more favorable IST. 


EXPERIMENT I 
Method 


Subjects and design—In partial fulfill- 
ment of course requirements, 50 students 
from an introductory psychology course at 
Duke University served as Ss and were 
randomly assigned to one of two groups. 
One S was discarded for failing to follow 
instructions and another S was discarded 
because of an E error. Thus, there were 24 
Ss in each of the final groups. Group P: 
(pseudoconditioning) contained 16 males and 
8 females; Group Si (simultaneous pairings) 
contained 14 males and 10 females. 

Group P; received 40 presentations of the 
CS and 40 presentations of the UCS in an 
irregular fashion with no more than two 
successive trials to each stimulus and with 
an equal number of CS and UCS presenta- 
tions in each block of 10 trials. Group S: 
received its first 20 trials with the CS and 
UCS paired with simultaneous onsets. Im- 
mediately following these 20 trials, there 
were 20 CS and 20 UCS presentations under 
the pseudoconditioning procedure employed 
from the beginning with Group Pa. Both 
groups received two preliminary CS trials 
and two preliminary trials with UCS im- 
mediately prior to the experimental series. 

Apparatus and procedure.—The CS was a 
1,000-c. tone of 56 db. (SPL) intensity pre- 


*In Exp. I and II no correction was made 
for the lag in the UCS (30-40 msec.). 
Thus the ISI was actually a very slightly 
forward interval. 
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sented for 600 msec. It was generated by a 
Hewlett-Packard audio oscillator (Model 
200CD) and presented through a speaker 
located behind S’s chair. The UCS, an air 
puff sufficient to support 120 mm. Hg, lasted 
for 100 msec. The air puff was delivered to 
S’s right cornea through a nozzle (1 
mm. id.) attached to a headpiece. The on- 
set and duration of the CS and UCS were 
controlled by Grason-Stadler timers (Model 
E1100H). The intertrial interval during 
the paired (simultaneous) trials for Group 
Sı was 15, 25, or 35 sec. with a mean of 
25 sec, and during the unpaired trials for 
both groups it was 7.5, 12.5, and 17.5 sec. 
with a mean of 12.5 sec. These intertrial 
intervals were controlled by a Gerbrands 
film-tape programmer. 

Eyelid responses were recorded following 
the procedure employed at the Duke Univer- 
sity conditioning laboratory. Each S was 
seated in a dental chair in a moderately lit, 
sound proofed IAC chamber. Recorded in- 
structions delivered after E had left the 
chamber were a variation of those used by 
McAllister and McAllister (1958) and were 
designed to establish a neutral set. 


Results and Discussion 


No standards are available for de- 
fining a CR in a situation where CS 
and UCS presentations are simultane- 
ous. The evidence (e.g. Boneau, 
1958) indicates that there is a positive 
correlation between the length of the 
ISI and the latency of the CR. This 
would suggest that the latency of the 
CRs in this experiment might be short. 
For this reason, it seemed appropriate 
to include very short latency responses. 
Preliminary examination of the data 
provided no basis for a more exact 
definition. Accordingly, Es counted 
as a CR any response that occurred 
in the presence of the CS with an 
amplitude greater than 1 mm. on the 
polygraph record. 

Figure 1 presents the mean number 
of such responses for both groups. 
Data were available for Group P, from 
the beginning of the experiment but no 
measures were obtainable for Group 
S, on Trials 1-20 because these were 


ponses 
E 
2 
Cs 


d ge rj Res 

7 

SÉ 

tet 

; 
( 

x 

b 


Meon Percentai 


p 
uo 


mio 
Blocks of Five Trials 

Fic. 1. Mean percentage of responses for 

pseudoconditioning and simultaneous condi- 


tioning groups to nonreinforced CS presenta- 
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paired CS-UCS trials. The data for 
Group P, show some evidence of habi- 
tuation but a ¢ test for related measures 
on Trial Blocks 1 and 4 showed that 
the decrement was not significant. 
The right side of the figure shows re- 
sponses for Groups P, and S, and, as 
is evident, responding was greater for 
Group S, than for Group P,. Assessed 
by analysis of variance, the difference 
is significant, F (1, 46) = 4.48, p< 
.05. This analysis was for the first 
two trial blocks in order to minimize 
the effects of extinction. The effect 
of trials and the interaction were not 
significant. 

On the basis of this conditioning 
procedure, it can be concluded that, 
following simultaneous conditioning 
trials, performance is significantly su- 
perior to that which results from the 
typically employed pseudoconditioning 
control procedure. These results are 
contrary to those found by other in- 
vestigators (e.g., Fitzwater & Thrush, 
1956) who failed to find evidence for 
simultaneous conditioning. The dis- 
crepancy may be attributable to the 
different experimental test procedures. 
It must be noted, however, that the 
absolute magnitude of the effect is 
small. 


EXPERIMENT II 


With the evidence from Exp. I that 
simultaneous conditioning can occur, 
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an attempt was made in this second ex- 
periment to assess the effect of a simul- 
taneous conditioning procedure upon 
subsequent forward-order conditioning. 


Method 


Subjects and design—Forty-four Ss from 
the same population as those used in Exp. I 
served as Ss and were randomly assigned 
to one of two groups. Four Ss were dis- 
carded: two because of apparatus failure; 
one because of a very fast blink rate; and 
one for giving two responses on preliminary 
trials. Thus, there were 20 Ss in Group S; 
receiving simultaneous conditioning trials (13 
males and 7 females) and 20 Ss in Group Pe 
receiving pseudoconditioning trials (13 males 
and 7 females). ` 

Group S; initially received 20 trials during 
which the CS and UCS were presented with 
simultaneous onsets and immediately fol- 
lowing these trials received 30 acquisition 
(CS-UCS) trials with an ISI of 500 msec. 
Group Pz received 20 unpaired presentations 
each of the CS and UCS under the same 
conditions as in Exp. I. Immediately fol- 
lowing these trials, 30 acquisition trials were 
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presented under conditions that were iden- 
tical to those for Group Ss. 

Apparatus and procedure—The intensity, 
duration, and mode of presentation of the 
auditory CS and the air puff were the 
same as in Exp. I, as were the distribution 
of simultaneous and pseudoconditioning - 
trials. In subsequent acquisition the dura- 
tion of the CS continued to be 600 msec, 
but now the last 100 msec. of the interval. 
contained the UCS. : 


Results and. Discussion 


An eyeblink deflection of 1 mm. or 
more in amplitude occurring from 
0-517 msec. following CS onset on the’ 
forward-conditioning trials was counted © 
as a CR for the reasons described in - 
connection with Exp. I. The interval 
employed was from CS onset to 17 
msec. after UCS onset. Figure 2 pre- 
sents the percentage of responses thus 
defined occurring in blocks of five trials 
for Groups P, and S,. These data in- 
dicate that performance improved with 
practice for both groups, and the re- 
sults of a trend test indicated that trials 
was the only significant factor, F (5, 
190) = 14.99, p < .05, although there 
was a Trials X Groups interaction 
which failed to reach an; acceptable sig- 
nificance level, F (5, 190) = 201, p< 
.10. Thus this experiment failed to 
provide any strong evidence for si- 
multaneous conditioning, although the 
interaction of borderline significance 
may hint of such an effect. It is to be - 
noted that, if the simultaneous condi- 
tioning procedure did have the effect 
suggested by this interaction, it occurs 
after several conditioning trials. Al- 
though there was no reason to expect 
such an effect, Es examined responses 
in more limited latency ranges (0-199 
msec. 200-299 msec., and 200-517 
msec.) in order to determine the gen- 
erality of the finding. The same quali- 
tative effect appeared in these three 
analyses but statistical tests again: 
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failed to reach accepted levels of sig- 
nificance. 

The results of Exp. I supported the 
idea that a small amount of condition- 
ing occurs with a O-sec. ISI when the 
effect is tested with a pseudocondition- 
ing extinction procedure. Tests dur- 
ing acquisition in Exp. II revealed no 
such conditioning but suggested the 
possibility that a delayed effect occurs 
later on. 
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EFFECTS OF INTERSTIMULUS INTERVAL AND 
DISCRIMINATION LEARNING IN EYELID 
CONDITIONING USING BETWEEN- 

AND WITHIN-Ss DESIGNS ' 


GREGORY A. KIMBLE, THOMAS B. LEONARD, III,? 
AND LAWRENCE C. PERLMUTER* 
Duke University 


80 human Ss were conditioned, 3 with a between-Ss procedure and 
3 with a within-Ss procedure in an eyelid-conditioning experiment 
where the interstimulus interval (ISI) was either .5 sec. or 1.2 sec. 
The overall level of conditioning was significantly higher in the 
between-Ss groups. In these groups performance was superior at the 
.5-sec interval but not significantly so. In the within-Ss procedure, 
conditioning was significantly inferior at the .5-sec. ISI. This 
lowered level of performance was attributed to the fact that Ss in the 
within-Ss design made a type of discrimination that led to the develop- 
ment of an inhibitory process. Data on generalization and discrimi- 
nation obtained from the same Ss during and following acquisition 


lent detailed support to this interpretation. 


Recent conditioning studies (Beck, 
1963; Cermak, 1967; Grice & Hunter, 
1964; Wickens & Harding, 1965) 
have shown that the effect of manip- 
ulating certain variables is not the same 
for experiments in which the same Ss 
receive all experimental treatments as 
it is for experiments where different 
Ss receive different treatments. For 
example, Grice and Hunter demon- 
strated that the effect of the CS inten- 
sity in eyelid conditioning depends 
upon the type of design employed. 
Using a 2X2 design in which CS 
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intensity was one factor and the num- 
ber of stimulus intensities (one or two) — 
presented to S was the other factor, 
Grice and Hunter found that the effect 
of CS intensity was significant only 
for the double-intensity condition and 
that the effect was five times as large 
in that condition as in the single-inten- 
sity condition. I 
The striking difference obtained by : 
Grice and Hunter suggests that direct 
comparisons of between-Ss and within- — 
Ss designs involving other independent : 
variables should be made. This study 
is such an investigation of the inter- 
stimulus interval (ISI) with a design 
similar to that used by Grice and - 
Hunter. The primary purpose of this 
study was to determine the level of 
conditioning to long and short ISIs for 
Ss who receive only one ISI and Ss 
who receive two ISIs. Secondary pur- 
poses which turned out to have a di- 
rect bearing upon the main question. 
were to study generalization and dis- 


crimination during and following these — 


two types of treatment. 
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METHOD 


Apparatus.—The eyelid conditioning appa- 
ratus at Duke University was used in this 
experiment. The Ss were seated in a dental 
chair inside a well-lighted IAC chamber. 
The CSs were a green and a red light pre- 
sented on a Grason-Stadler multiple-stimu- 
lus projector (Model E4580) mounted ap- 
proximately at eye level 3 ft. in front of S. 
The CS had a duration of either .5 or 1.2 
sec. and was immediately followed by an 
air puff having a duration of .1 sec, The 
air puff was sufficient to support 100 mm. Hg 
and was delivered from approximately .5 in. 
to S's right cornea through a nozzle having 
an inside diameter of 1 mm. The onsets and 
offsets of the CS and the UCS were con- 
trolled by Grason-Stadler timers. The mean 
intertrial interval was 20 sec, with each 
successive set of three trials containing 
randomly ordered intervals of 15, 20, and 
25 sec. The intertrial intervals were con- 
trolled by a Gerbrands film-tape program- 
mer. Eyelid responses were recorded on a 
Grass Model 5 polygraph and were detected 
by the usual false-eyelash microtorque-po- 
tentiometer system. 

Subjects.—Seventy-eight Duke undergrad- 
uates participating in order to fulfill a course 
requirement and 11 high-school students 
participating for pay served as Ss in the 
experiment. Of the total of 89 Ss, 9 were 
discarded for the following reasons: 1 be- 
cause a response was given to each of two 
CS presentations prior to conditioning; 2 
because of E's error; 2 for apparatus fail- 
ure; and 4 because data could not be obtained 
for S on at least half of the trials in a 10- 
trial block, due to the occurrence of short- 
latency responses characterized by com- 
plete closure of the eyelid during the period 
in which a CR might otherwise occur. These 
latter Ss were all in the within-Ss condition. 
The remaining Ss were randomly assigned 
to one of four groups with the restriction 
that the numbers of males and females be 
approximately the same for each group. 
There were 4 or 5 males and 15 or 16 fe- 
males in each group of 20 Ss. 

Procedure.—A ter S was seated in the 
dental chair in the IAC chamber, an adjust- 
able headpiece carrying the microtorque 
potentiometer and the air-puff nozzle was 
mounted on S’s head, and an artificial eyelid 
was taped to S's right upper eyelid. The E 
then left S’s room and played tape-recorded 
instructions designed to induce a neutral set 
toward blinking. Next, each S was given 
two presentations of each of two CSs to be 


used and two presentations of the UCS, 
followed immediately by 80 acquisition and 
60 discrimination trials. 

Design.—During acquisition each group re- 
ceived 80 trials, 40 with a green CS and 
40 with a red CS presented in a random 
order with the restrictions that each color 
of CS be presented five times in each block 
of 10 trials and no more than two successive 
trials be given with the same color CS. 
For one group (Group LL, » —20) the ISI 
associated with both CSs was long (LL). 
The ISI was 1.2 sec. on all reinforced trials 
throughout acquisition and discrimination. 
A second group (Group SS, »=20) was 
conditioned with two short (SS) CSs with 
an ISI of .5 sec. for all acquisition and 
discrimination trials. ^ Forty additional S's 
were conditioned with both long and short 
ISIs. For these Ss one stimulus (red for 
half and green for half) was associated 
with the long ISI; the other with the short 
ISI. For analysis of acquisition data, this 
group was subdivided into two groups. 
Group S and Group L. Acquisition data 
for Group S (n=20) were obtained from 
trials when the ISI was short. For Group 
L (n=20) performance to the stimulus 
associated only with the long ISI was ana- 
lyzed. In forming Groups S and L, Ss 
were assigned randomly with the restriction 
that there be an equal number of Ss in each 
group for which the responses counted were 
to the red or green CS. The Ss in Groups 
SS and LL were also randomly subdivided, 
so that responses to just the red CS were 
analyzed for half the Ss and responses to 
just the green CS were analyzed for the 
other half. By using these procedures, 
four independent groups were created in 
which color of the CS was counterbalanced. 
These groups formed a 2X2 factorial de- 
sign with ISI and within vs. between pro- 
cedures defining the marginals. As Grice 
and Hunter (1964) pointed out, procedures 
of this type are essential to meet the assump- 
tions of the analysis of variance 

Immediately following the 80 acquisition 
trials, 60 discrimination trials were given, 30 
with each CS, in which the UCS was main- 
tained for one CS and omitted for the other 
CS. The order of presentation of CSs and 
their corresponding ISIs were the same in 
acquisition and discrimination. 

Definition of the CR.—For all trials hav- 
ing an ISI of .5 sec., eyelid closures having 
an amplitude of at least 1 mm. on the oscil- 
lograph recording and beginning between 
250 and 517 msec. after the onset of the 
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CS were scored as CRs, For trials hav- 
ing an ISI of L2 sec, any closure of at 


least 1 mm. and beginning between 600 and , 


1,217 msec. aíter CS onset was scored as a 
CR. In short, for the two intervals a CR 
was defined as any response in the last half 
of the ISI plus the first 17 msec. of the 
UCS. Various types of evidence within the 
experiment indicate that these definitions are 
appropriate. In any event, the results to be 
described are not explainable merely in terms 
of these criteria. 


RESULTS 


Conditioned responses during acqui- 
sition.—The major result of the experi- 
ment appears in one form in Fig. 1. 
The Ss who received only one ISI 
performed at a significantly higher gen- 
eral level than did Ss receiving two 
ISIs, F (1,76) = 748, p < 01. Anal- 
ysis of simple effects reveals that this 
difference was significant at the short 
ISI, t (38) = 3.42, p < .002, but not 
at the long ISI value, £ (38) < 1.00. 
The ISI factor itself was not signif- 
icant, F < 1.00; but, as Fig. 1 suggests, 
the interaction between the number of 
ISIs presented to S and ISI was signif- 
icant, F (1, 76) = 4.50, p < .05. For 
groups receiving two ISIs, performance 
was significantly higher for the long 
ISI, t (38) = 208, p< .05, while for 
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the single ISI groups, performance was 
higher for the short ISI, although the 
difference was not significant, t (38) < 
1.00. Approximately equal perform- 
ances at CS-UCS intervals ranging 
from 400 to 1,200 msec. have been a 
common finding in this laboratory 
(Fowler, 1964; Ominsky, 1967), using. 
the between-Ss type of design. 
Figure 2 shows conditioning curves 
for all groups and indicates that the 
inferior level of conditioning with the 
.5-sec. ISI shown by the group of | 
Ss receiving two ISIs was present 
from the first block of conditioning 


trials on. An analysis of variance re- 


vealed that the effect of trial blocks 
was significant, F (3, 228) — 11,59, 
p < .005, as is indicated by the general 
increase in percentage of CRs. In 
addition, a Blocks X ISI interaction 
was significant at the .05 level, F (3, 
228) — 3.55. 

Discrimination.—For purposes of 
studying discrimination, the 40 Ss re- 
ceiving two ISIs were divided into two 
separate groups of 20 Ss each follow- 
ing original acquisition. For Group 
L+S— the long stimulus continued 
reinforced (L+) at a 1.2-sec. ISI 


Mean Percent CR 
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Fic. 2. Mean percentage of conditioned - 
responses during blocks of 10 acquisition 
trials for four treatment conditions. 
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throughout the discrimination trials 
while the short stimulus was not rein- 
forced (S—) during these trials. For 
Group L—S+ the short stimulus con- 
tinued to be reinforced at a .5-sec. 
ISI while the long stimulus was no 
longer reinforced. For the groups that 
had previously experienced only a 
short ISI (S+S—) or a long ISI 
(L+L—), one of these stimuli was no 
longer reinforced during the discrimi- 
nation trials. In all of the groups the 
color of the reinforced stimulus was 
red for half the Ss and green for half. 

The discrimination data appear in 
Fig. 3. The top part of Fig. 3 shows 
the discrimination performance for 
Groups L+L— and S+S—. The four 
curves in this portion of the figure 


represent mean CR performance to the , 


two long and two short CSs and ISIs 
in Groups L--L— and S4 S—. Per- 
formance was essentially similar at the 
end of acquisition and, as is indicated, 
the introduction of discrimination trials 
had the effect of lowering performance 
to both reinforced and nonreinforced 
CSs. Four ¢ tests, £ (19) > 2.12, 
p < 05, in all cases, for related mea- 
sures indicated reliable decrements to 
all CSs from the end of acquisition to 
the first trial block in discrimination. 
This phenomenon regularly occurs 
with a shift from acquisition to dis- 
crimination and is usually attributed to 
the accumulation of inhibition to CS— 
(as a result of nonreinforcement) and 
its generalization to CS+ (Gynther, 
1957). 

To evaluate differential performance 
during the discrimination trials, dif- 
ference scores were used to compare 
performance on the last 10 trials of 
acquisition with performance on the 
last 10 discrimination trials. For 
Group L+L— the difference between 
these differences was significant, t (19) 
— 808, p< .005, revealing the de- 
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Fic. 3. Mean percentage of responses dur- 
ing the last block of acquisition trials and 
three blocks of discrimination trials for re- 
inforced (R) and nonreinforced (NR) CSs 
of Groups L--L— and S+S— (top) and 
Groups L+S— and L—S+ (bottom). 


velopment of discrimination. A simi- 
lar analysis for Group S+S— failed to 
show reliable discrimination, Analyses 
involving only the last block of 10 dis- 
crimination trials tell the same story. 
There was a significantly higher level 
of performance to the reinforced stim- 
ulus than to the nonreinforced stim- 
ulus in Group L+L—, t (19) = 246, 
p < 025, but not in Group S+S— (t 
« 10). In terms of either measure, 
these data confirm the results of Hart- 
man and Grant (1962) that discrimina- 
tive eyelid conditioning is established 
more readily with relatively long ISIs 
than with a short ISI. 

Performance at the end of acquisi- 
tion and during the discrimination 
trials is shown in the bottom part of 
Fig. 3 for Groups L+S— and L-S+. 
At the end of acquisition these groups 
show different levels of responding to 
the short and long stimuli, a result de- 
scribed previously, although for a dif- 
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ferent subdivision of Ss. The intro- 
duction of the discrimination procedure 
may have had some tendency to lower 
performance to both CSs. The only 
reliable decrement (as evaluated by a 
t test for realted measures) from the 
end of acquisition to the first trial block 
of discrimination occurred to the short 
nonreinforced CS in Group L+S—. 
Using difference scores, statistical 
analysis revealed that there was a sig- 
nificant change in performance in both 
groups. Inspection of Fig. 3 reveals, 
however, that this change produced 
an actual discrimination only in the 
case of Group L+S—. On the last 10 
trials there was a significant difference 
in the performance of this group to 
the two stimuli, ¢ (19) = 5.65, p< 
.005, but not in the performance of 
Group L—S+ (t < 1.0). In the case 
of this group only a more extended 
series of discrimination trials would 
reveal whether a discrimination could 
be established at all. For reasons to 
be developed in the discussion of this 
paper, however, it is thought to be un- 
likely that discrimination will occur 
under these circumstances. 
Generalized responses.—The Ss con- 
ditioned with two ISIs were subjected 
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Fic. 4. Mean percentage of responses 
meeting a 250-517-msec. latency criterion 
for long and short CSs for all Ss receiving 
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Fic. 5. Mean percentage of responses 
meeting a 250-517-msec. latency criterion and 
occurring during blocks of trials with a 1.2- 
sec, ISI for three groups. 


to a type of discrimination procedure, 
even during original acquisition. As 
CRs were defined for each interval, 
there was no overlap in the latency dis- 
tributions for responses counted for 
.5-sec. trials (250-517 msec.) and 
those in the 1.2-sec. trials (600-1,217 
msec.). It had been anticipated that 
Ss would make this discrimination and 
produce responses with latencies ap- 
propriate to the interval signaled by 
the particular CS. This did not hap- 
pen, however. In the presence of the 
long-interval CS, Ss gave as many re- 
sponses appropriate to the short inter- 
val as they did responses in the pre- 
sence of the short CS. This fact is de- 
picted graphically in Fig. 4 where per- 
centage of CRs appropriate to the short 
ISI are plotted separately for the short 
and long intervals during acquisition. 
These responses shall be treated as 
generalized responses because of their 
dependence upon the reinforcement of 
the short CS (see Fig. 5). A process 
of disinhibition may also be involved, 
however, because these responses occur 
under conditions where overall level 
of responding is relatively low. 

The discrimination trials provide the 
opportunity to obtain further data on 
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the generalization of short-interval re- 
sponses to the long-interval CS. Fig- 
ure 5 shows the percentage of short- 
interval CRs occurring on each block 
of trials during acquisition and dis- 
crimination for the three groups re- 
ceiving a long ISI. For Groups 
L4-S— and L—S+, each point on the 
graph represents 10 trials, while for 
Group L--L— each point represents 20 
trials. During acquisition both Groups 
L+S— and L—S+ received a short 
ISI, showing an increase in percent- 
age of generalized responses. Group 
L+L— provides an appropriate con- 
trol condition and showed essentially 
no change. The total percentages of 
responses for the three groups were 
significantly different, F (2, 57) — 
7.87, p < .01, and a Tukey test showed 
Group L+L— to be responding sig- 
nificantly below the levels of the other 
two groups. On this basis it was con- 
cluded that the short-interval responses 
during acquisition were indeed general- 
ized responses and not simply mis- 
classified CRs to the long CS. 

During the discrimination trials, the 
curves showing percentages of general- 
ized responses indicate a superior per- 
formance of Group S+L— the only 
group to have a short CS reinforced 
during the discrimination phase. Per- 
formance of Group S—L+ decreased 
to the level of Group L+L—, indi- 
cating that reinforcement of the short 
CS was necessary for the level of gen- 
eralized responses to remain high. An 
analysis of variance showed differences 
among the three groups to be signif- 
icant, F (2, 57) = 14.92, p < .01, and 
no significant block effect or inter- 
action. A Tukey test showed perform- 
ance of Group L—S+ to be signif- 
icantly higher than that of the other 


two groups. 
The materials presented in Fig. 4 
suggested that the generalization 


gradient for the short-latency responses 
was essentially flat : that the percentage 
of generalized reactions to the long 
stimulus was the same as the percent- 
age of nongeneralized reactions to the 
short stimulus. This led to the ques- 
tion of the shape of the generalization 
gradient for the long-interval re- 
sponses. Data on this point could 
reasonably be obtained only from the 
records of Group L+S— on discrimi- 
nation trials when the short CS was 
not followed by the UCS. For this 
group, the percentage of responses was 
36 during the interval 600-1,217 msec. 
(which had been used to define a CR 
in the long interval). On the same 
trials the percentage of CRs of the 
same latency to the reinforced stimulus 
was 40. If this analysis were valid, it 
would suggest, of course, that the 
generalization gradient for these re- 
sponses was also flat. Being aware, 
however, that these responses were 
obtained following CS cessation and 
could be short-latency responses made 
to that event, the records of Groups 
S+L— and SS were examined during 
discrimination and following the termi- 
nation of the nonreinforced CS and 
obtained percentages of 26 and 29. 
For this reason it was concluded that 
the majority of the 36% responding 
mentioned above and perhaps all of 
it was provided by a set of short-la- 
tency reactions to CS cessation, and 
this experiment provides no useful evi- 
dence pertaining to the generalization 
of long-latency responses.* 


Discussion 


This experiment was conceived in terms 
that were mainly empirical rather than 
theoretical. The authors wanted to de- 


4Dr. B. W. Feather (personal communi- 
cation) has observed a similar tendency for 
human Ss to produce salivary reactions to 
the cessation of the US. The phenomenon 
appears to be worth further investigation. 


658 


termine whether the type of experimental 
design would have an effect upon the ISI 
function as it does upon the effects of 
other variables in eyelid conditioning, but 
no particular outcome of this experiment 
was strongly anticipated. Similarly, there 
was interest in the phenomena of gen- 
eralization and discrimination because 
they could be studied in a somewhat un- 
usual context. Therefore, the authors 
remain quite close to the data in inter- 
preting the results just presented. 

The finding of chief importance to come 
from the acquisition portion of this ex- 
periment was that the ISI function differs 
significantly for the between-Ss and 
within-Ss designs. With the between-Ss 
procedure the expected higher level of 
conditioning at an ISI of .5 sec. rather 
than at 1.2 sec. was obtained, although 
the difference was not significant. With 
the within-Ss design a significantly supe- 
rior conditioning at the 1.2-sec. interval 
was obtained. Examination of Fig. 1 and 2 
reveals that the main factor responsible 
for this outcome was a marked reduction 
in level of responding in the short ISI 
in the case of Ss conditioned at both 
intervals. This should somewhat simplify 
the problem of explanation. 

Before considering what was regarded 
as the main explanatory possibility, how- 
ever, it should be mentioned that there 
was reasonable certainty that the dif- 
ference in ISI functions comes about be- 
cause of this reduction of responding in 
the short interval. One strength of the 
design of this experiment was that the 
within-Ss data not employed in the analy- 
sis of variance could be used to provide 
an independent check on points of this 
type. When these data are considered, 
the results remain very much the same 
as they are in Fig. 1 and 2. 

Turning now to the factors responsible 
for the reduction in responding in the 
short ISI, it is important that the low 
level of responding in Group S appears 
in the very first block of 10 acquisition 
trials. Elsewhere (Kimble, 1967) it has 
been shown that such immediate reduc- 
tions in percentage of CRs are often as- 
sociated with the operation of cognitive 
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or attitudinal variables. In this case, the 
experimental design involving two stimuli 
forces S to discriminate between the in- 
tervals and S tends to suppress his re- 
sponses until that discrimination is in 
some sense complete, something that takes 
more than .5 sec. (Kimble, 1967, p. 652). 

In various ways the data on generaliza- 
tion and discrimination suggest further 
points about the way in which this process 
operates, First, the generalization data 
plotted in Fig. 4 show that Ss do indeed 
fail to discriminate between the two CSs 
during the first 517 msec. of the ISI. 
They give the same number of responses 
to each. The level of responding, how- 
ever, is significantly lower than in the 
case of Group SS where both stimuli are 
reinforced at the same interval as is 
shown in Fig. 2. This fact, thus, sub- 
stantiates the conception that, while fail- 
ing to discriminate, Ss inhibit their re- 
sponses. 

These considerations support the idea 
that the presentation of the CS immedi- 
ately elicits some variety of inhibition, 
It will be argued in a moment that this 
inhibition is a by-product of the process 
of discrimination. It is possible that it is 
also the Pavlovian inhibition of delay and 
that inhibition of delay emerges with the 
development of a temporal discrimination 
that would be required of 5 with any long 
ISI. In any event, it is quite clear that 
responses are inhibited early in the ISI. 
In Group LL this leads to a very low level 
of responding in the interval 250-517 
msec. following CS presentation (Fig. 5). 
In the groups conditioned with two ISIs, 
however, the short-latency responses con- 
ditioned with the short interval generalize 
widely and automatically break through 
this inhibitory barrier. The fact that as 
many responses of this type occur as do 
responses appropriate to the short ISI is 
the basis for referring to them as auto- 
matic and for maintaining that they hap- 
pen before discrimination is complete. 

Performance during discrimination adds 
further elements to the interpretation. 
Confirming the results of Gynther (1957) 
and others, a significant inhibitory decre- 
ment was obtained when Ss in Groups 
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LL and SS were shifted to the discrimina- 
tion procedure. In the cases of the groups 
conditioned with two ISIs, however, 
decremental performance occurred dur- 
ing acquisition and the shift to discrim- 
ination produced a further decrement only 
in the case of responses in the short ISI 
when they were not reinforced. All other 
responses either showed a slight, insig- 
nificant decrement or an increase, as is 
shown in the bottom part of Fig. 3. 

These facts tend to tie the inhibitory 
process directly to the making of a dis- 
crimination. It now seems that the oper- 
ation of nonreinforcement is probably not 
the sole basis for the development of dis- 
criminative inhibition as traditional “con- 
ditioning-extinction” theories (e.g., Gyn- 
ther, 1957) usually imply. It is believed 
instead that nonreinforcement is one of 
many possible bases for the development 
of a discrimination. Others might be that 
different stimuli signify different ISIs (as 
in this experiment), different delays of 
reinforcement, different magnitudes, or 
different schedules. In any such case, 
it is suspected that an inhibitory state 
would be associated with the process of 
discrimination. In this experiment, Ss in 
the groups conditioned with two ISIs had 
been required to make a discrimination 
during the acquisition phase of the ex- 
periment. They showed a substantial de- 
gree of inhibition there but, in general, 
showed little additional inhibitory decre- 
ment when the discrimination was en- 
hanced by the nonreinforcement of one 
CS. By contrast, Groups LL and SS, for 
whom discrimination began following ac- 
quisition, showed a pronounced and sig- 
nificant decrement. A special experiment 
directed at this point (Posnick, 1967) has 
subsequently verified the existence of an 
inhibitory mechanism related to discrim- 
ination as well as certain other findings 
of this study. 

Discriminative performance. itself sup- 
ported the findings of Hartman and Grant 
(1962) and also revealed something of 
the mechanism that apparently controls 
the ease of establishing differential eyelid 
conditioning. Discrimination occurred 
with long ISIs in Group L+L— but not 


with short ISIs in Group S+S—. In the 
case of the groups with two ISIs, dis- 
crimination occurred only in the case of 
Subgroup L+S—. It is to be noted that 
discrimination failed to occur in any con- 
dition involving S+, that is, reinforce- 
ment of the short-interval responses. In 
the case of Group S+S—, this is not 
hard to understand because it has al- 
ready been seen that short-interval re- 
sponses generalize almost completely to 
other stimuli. In the case of Group 
L—S+, however, the situation is more 
complex. In this group discrimination 
failed to occur because the level of re- 
sponding to L— remained high, as if 
somehow supported by the reinforcement 
of S+, although the responses to the two 
stimuli had nonoverlapping ranges of la- 
tencies. In order to show how this comes 
about, one final result of the experiment 
shall be reported. 

During the 40 presentations the stim- 
ulus associated with the long ISI, during 
acquisition for the group conditioned with 
two ISIs, various analyses revealed a 
tendency for the occurrence of a short- 
latency response to be more strongly as- 
sociated with the occurrence than with 
the nonoccurrence of a second long-la- 
tency response on the same trial. This 
is in spite of the fact that there probably 
is a tendency not to blink immediately 
following a blink. To make this point, 
10 Ss from Group LL and 10 Ss from 
the group conditioned with two ISIs who 
(a) gave at least some long-latency CRs 
during acquisition and (b) had produced 
records on which all trials were scorable 
were selected. For the 10 Ss in Group 
LL a long-latency CR was preceded by a 
short-latency response only 9% of the 
time, a fact that may be taken.as showing 
the normal tendency for a blink to inhibit 
immediately subsequent blinking. By 
contrast, in the case of the 10 Ss from 
the group with two ISIs, 32% of the 
long-latency CRs followed short-latency 
generalized responses. This percentage is 
nearly four times the value obtained for 
Group LL (p < .02), supporting the idea 
that the widely generalized responses pro- 
duced by reinforcement at the short IST 
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indirectly supported responding at a 
longer interval. 


CONCLUSIONS 


The authors believe that the results of 
this experiment justify the following main 
conclusions: (a) The ISI function ob- 
tained with a within-Ss design is different 
from that obtained with a between-Ss 
design. (b) This effect occurs because of 
the inhibition of responses in the short 
ISI in the case of the within-Ss design. 
(c) This inhibitory process (possibly in- 
hibition of delay) results from making a 
discrimination and does not depend ex- 
clusively upon nonreinforcement. (d) In 
spite of this inhibition, there is a strong 
tendency for generalized short-latency 
CRs to occur, a fact that explicates the 
meaning of Hartman and Grant's con- 
clusion that making a discrimination takes 
time. (e) These short-latency responses 
also tend indirectly to support responses 
at longer ISIs and to retard the develop- 
ment of differential performance. 
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FOCUSING STRATEGY FOR EIGHT CONCEPT RULES* 


PATRICK R. LAUGHLIN 
Loyola University, Chicago 


192 college students solved 3 concept-attainment problems in an 
8X2X3 repeated-measures factorial design with the variables (a) 
concept rule: conjunctive (CONJ), conjunctive absence (CONJ AB), 
exclusion (EXC), inclusive disjunctive (IN DIS), exclusive dis- 
junctive (EX DIS), disjunctive absence (DIS AB), conditional 
(COND), and biconditional (BIC); (b) memory: paper and pencil 
allowed or not allowed; (c) problems: (3 per S). Major results 
were: (a) The use of focusing strategy generally supported the order 
hypothesized from considerations of card selection, memory, and 
inference requirements: (CONJ = EXC) > CONJ AB > (BIC = EX 
DIS) > (IN DIS=DIS AB)>COND. Rules requiring com- 
plementary focusing (IN DIS, DIS AB, COND) had less focusing 
than rules requiring a type of simple: focusing. (b) The order of 
difficulty of the rules as measured by increasing card choices to 
solution was CONJ, CONJ AB, EXC, EX DIS, BIC, DIS AB, 
IN DIS, and COND. (c) No difference between paper allowed and 


not allowed conditions, 


Bruner, Goodnow, and Austin 
(1956) originally distinguished two 
general selection strategies in concept 
attainment: (a) scanning, a hypothesis- 
testing strategy, and (b) focusing, an 
attribute-testing strategy. ^ Laughlin 
(1965, 1966, 1968) and Laughlin and 
Jordan (1967) have developed quanti- 
tative scoring rules for focusing 
strategy for conjunctive, inclusive dis- 
junctive, biconditional, and conditional 
concepts. The following study ex- 
tended the scoring of focusing to con- 
cepts defined by the rules of conjunc- 
tive absence, exclusion, exclusive dis- 
junction, and disjunctive absence 
(Haygood & Bourne, 1965; Neisser & 
Weene, 1962). The relative order of 
focusing for the eight concept rules was 
predicted from analytic considerations 
of the differential instance selection, in- 
ference, and memory requirements 


1 Supported by Grant No. GB 5369 from 
the National Science Foundation. The 
author wishes to express his appreciation 
to Richard P. McGlynn, Jon A. Anderson, 
Everett S. Jacobson, and Michael Schmitt 
for assistance in the collection of the data. 


necessary to employ focusing with each 
rule. Secondly, the influence of 
memory, as manipulated by allowing 
or not allowing the use of paper and 
pencil, was compared for the eight 
rules. The problems involved two 
relevant attributes from a six-attribute, 
two-value stimulus array. 

With conjunctive concepts (CONJ), 
focusing requires S to select a card that 
differs in only one attribute from the 
given card. If this selected card is 
positive, the changed attribute is irrele- 
vant to the concept, while if the card is 
negative, the changed attribute is rele- 
vant. Previously developed scoring 
rules for CONJ (Laughlin, 1966) can 
be directly extended to apply to con- 
junctive absence (CONJ AB) and ex- 
clusion (EXC), with appropriate vari- 
ations in the inferences from positive 
and negative instances due to the nega- 
tive statement of both relevant values 
for CONJ AB and one relevant value 
for EXC. 

With two-attribute disjunctive con- 
cepts the given card may contain both 
relevant values, only one, or neither, 


661 


662 PATRICK R. 
Thus, with inclusive disjunctive (IN 
DIS) concepts the positive given card 
may include both relevant values or 
only one; with disjunctive absence 
(DIS AB) it may include one relevant 
value or neither one; while exclusive 
disjunctive (EX DIS) requires that 
only one relevant value be on the given 
card. Consequently, with IN DIS and 
DIS AB the type of possible focusing 
strategy depends upon S's knowledge 
of the composition of the given card. 
If S knows that the given IN DIS card 
contains both relevant values, or that 
the given DIS AB card contains 
neither relevant value, he can solve 
the problem by selecting cards that 
change all but one attribute from the 
given card, i.e., five attributes for the 
six-attribute concept universe in the 
present experiment. For these cards a 
positive instance means that the un- 
changed attribute is relevant, while a 
negative instance means that it is ir- 
relevant. This negative focusing 
strategy (Bruner, Goodnow, & Austin, 
1956) is theoretically as efficient as 
positive focusing (CONJ, CONJ AB, 
EXC), although Laughlin and Jordan 
(1967) found less focusing for IN DIS 
than CONJ. 

However, if the given IN DIS or 
DIS AB card contains only one rele- 
vant value, this strategy is impossible 
since all cards that change all but one 
attribute will be positive. The card 
which changes all but the relevant value 
present on the given card will be posi- 
tive because of the presence of this 
relevant value, while the other cards 
that change all but one attribute will 
be positive because the change results 
in the presence of the relevant value 
that was not on the given card. Such 
concepts may be solved by a more elab- 
orate strategy of complementary focus- 
ing, which combines aspects of conjunc- 
tive focusing and disjunctive focusing 
for given cards known to contain both 
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relevant values (IN DIS) or neither 
relevant value (DIS AB). Both 
the card that alters only one attribute 
from the given card and the card that 
alters all but that attribute are chosen. 
If both of these complementary cards 
are positive, the attribute in question 
is irrelevant, If the attribute is rele- 
vant and the relevant value was on the 
given card, the card that alters only 
that attribute will be negative while the 
card that alters all but that attribute 
will be positive. If the attribute is rele- 
vant and the relevant value was not 
on the given card, the card that alters 
only that attribute will be positive, 
while the card that alters all but that 
attribute will be negative. Thus, com- 
plementary focusing proceeds by a 
sequence of inferences from a series of 
pairs of card selections that are the 
exact opposite or complement. of each 
other. Finally, it may be noted that 
with two-value attributes S can deter- 
mine whether the given IN DIS card 
contains both relevant values or only 
one by selecting the complement of the 
given card, ie, the one card that 
changes all attributes. If this card is 
negative, the given card contains both 
relevant values, while if this card is 
positive, the given card contains only 
one relevant value. An analogous 
process can be used to determine 
whether the given DIS AB card con- 
tains neither relevant value nor one 
relevant value. 

Exclusive disjunctive concepts may 
be solved either by selecting cards that 
change only one attribute from the 
given card, or by selecting cards that 
change all but one attribute. If one 
attribute is changed, a positive card 
indicates that the changed attribute is 
irrelevant, while a negative card indi- 
cates that the changed attribute is rele- 
vant. If all attributes but one are 
changed, a positive card indicates that 
the unchanged attribute is irrelevant, 
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while a negative card indicates that it 
is relevant. Thus, the card that changes 
one attribute and its complement that 
changes all but that attribute give re- 
dundant information on the attribute in 
question. Therefore, complementary 
focusing is not required, and EX DIS 
should be easier to solve than IN DIS 
or DIS AB if S does not know the 
composition of the given card. 
Conditional concepts (COND) en- 
tail two types of positive instances: (a) 
those that satisfy both the "if factor" 
and the "then factor," (b) those that 
do not contradict the if factor. For 
example, for the concept “if blue plus, 
then red plus," an instance with a 
blue plus and red plus is positive for 
Reason a, while any blue minus in- 
stance (whether red plus or red minus) 
is positive for Reason b. Thus, three 
types of given positive cards are pos- 
sible: Case 1: given card has both the 
relevant if factor and relevant then 
factor. Case 2: given card does not 
have the if factor but has the then fac- 
tor. Case 3: given card does not 
have the if factor or the. then factor. 
Therefore, a complementary focusing 
strategy must be used. If both the card 
that changes only one attribute and the 
card that changes all but that attribute 
are positive, the attribute in question is 
irrelevant, irrespective of which case 
the given card entails. In Case 1 a 
positive card that changes one attribute 
indicates that the changed attribute is 
irrelevant to the then factor, and a 
negative card that changes one at- 
tribute indicates that the changed attri- 
bute is relevant to the then factor. A 
positive card that changes all but one 
attribute indicates that the unchanged 
attribute is irrelevant to the if factor, 
and a negative card that changes all 
but one attribute indicates that the un- 
changed attribute is relevant to the if 
factor. In Case 2 the cards that change 
one attribute will all- be - positive, 
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whether the changed attribute is rele- 
vant or irrelevant. A positive card 
that changes all but one attribute in- 
dicates that the unchanged attribute 
is relevant, while a negative card that 
changes all but one attribute indicates 
that the unchanged attribute is irrele- 
vant. There will be two positive in- 
stances: (a) because the unchanged at- 
tribute is the if factor, which was not 
present on the given card; (b) because 
the unchanged attribute is the then 
factor, which was present on the given 
card. Hence the relevant value of the 
if factor is the opposite of the given 
card. In Case 3 a positive card that 
changes one attribute indicates that the 
changed attribute is irrelevant to the 
if factor, and a negative card that 
changes one attribute indicates that the 
changed attribute is relevant to the if 
factor. A positive card that changes 
five attributes indicates that the un- 
changed attribute is irrelevant to the 
then factor, and a negative instance that 
changes five attributes indicates that the 
unchanged attribute is relevant to the 
then factor. The relevant values of 
both the if factor and the then factor 
will be the opposite of those on the 
given card. 

With biconditional concepts (BIC) 
focusing is possible by selecting cards 
that either change one attribute or all 
but one attribute from the given card. 
For cards that change one attribute, a 
positive instance indicates that the 
changed attribute is irrelevant, while a 
negative instance indicates that the 
changed attribute is relevant. ^ For 
cards that change all but one attribute, 
a positive instance indicates that the un- 
changed attribute is irrelevant, while a 
negative instance indicates that the 
unchanged attribute is relevant. Thus, 
as with EX DIS, the card that changes 
one attribute and its complement that 
changes all but one attribute give re- 
dundant information. 
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From these considerations it is evi- 
dent that the card selection, inference, 
and memory requirements for the use 
of focusing strategy vary considerably 
for the eight concept rules.  Conjunc- 
tive, CONJ AB, and EXC focusing 
requires only selection of cards that 
change one attribute and an inference 
from the resulting positive or negative 
instance to the changed attribute. Fur- 
thermore, successful focus gambles, 
card choices that change more than one 
attribute and are positive (Bruner 
et al., 1956), are possible. However, 
CONJ AB may result in less focusing 
because it requires a more complex 
inference to two absent rather than 
present attributes; likewise, EXC re- 
quires one inference to an absent rele- 
vant value, Biconditional and EX 
DIS focusing likewise require only se- 
lection of cards that change one at- 
tribute and an inference to the changed 
attribute. However, to the extent that 
Ss attempt to solve these rules by 
changing all but one attribute and in- 
ferring to the unchanged attribute, 
focusing should be correspondingly 
more difficult. Furthermore, successful 
focus gambles are not possible with 
BIC and EX DIS. Inclusive disjunc- 
tive and DIS AB require comple- 
mentary focusing, which doubles the 
selection, inference, and memory re- 
quirements. Finally, COND focusing 
is extremely demanding, requiring com- 
plementary focusing and a complex pat- 
tern of inferences differing for three 
possible given cards. Thus, the hy- 
pothesized order of focusing strategy 
in the present experiment was: (CONJ 
= EXC) > CONJ AB > (BIC = EX 
DIS) > (IN DIS = DIS AB) > 
COND. 


METHOD 


Design and subjects —The experimental 
design was a 8X2*X 3 repeated-measures 
factorial with the variables : (a) concept rule 
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(CONJ, CONJ AB, EXC, IN DIS, EX 
DIS, DIS AB, COND, BIC), (b) memory 
(paper and pencil allowed or not allowed), 
(c) problems (three for each S). The Ss 
were 192 introductory psychology students, 
half males and half females, at Loyola Uni- 
versity. Six males and six females were 
randomly assigned to each of the 16 ex- 
perimental conditions. 

Stimulus display and problems.—The stim- 
ulus display was a 28 X 44-in. white poster- 
board containing an 8 X 8 array of 64 23 X 
4-in. cards drawn in colored ink with dark 
outlines. The 64 cards represented all pos- ' 
sible combinations of six plus and/or minus 
signs in a row. Each position was a differ- 
ent color, so that the color name was the 
attribute and plus or minus was the value of 
each color. The cards were arranged sys- 
tematically on the board; for example, the 
top four rows were blue plus and the bottom 
four blue minus. 

Three sets of three problems each were 
randomly selected from the total set of 
possible two-attribute concepts, and the same 
relevant attributes and values used for all 
eight concept rules. The rule with an ex- 
ample was on a reference card available to 
S throughout the experiment. Two males 
and two females in each condition were ran- 
domly assigned to each of the three sets of 
problems. The initial given positive in- 
stances were randomly selected from the 
subset of possible initial positive instances 
for each particular problem. With IN DIS 
half of the given instances contained both 
relevant values and half one relevant value. 
With DIS AB half of the given instances 
contained one relevant value and half neither 
relevant value. For COND one third of the 
given instances represented each of the 
three possible cases. For BIC half of the 
given instances contained both relevant 
values and half neither relevant value. All 
given instances for CONJ, CONJ AB, and 
EXC satisfied the specified presence and/or 
absence of the relevant values, while those 
for EX DIS contained only one of the rele- 
vant values. 

Procedure—The instructions thoroughly 
explained with an example the nature of the 
task and the particular concept rule. The 
problem began with a given positive card, 
although the particular subtype of positive 
card was not specified. The S then selected 
a card, E gave oral feedback of “yes” or 
“no” to indicate a positive or negative card, 
S made one hypothesis, and E gave oral 
feedback of yes or no on the hypothesis. 
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This cycle was repeated until § made the 
correct hypothesis. Depending upon the 
memory condition paper was allowed or not 
allowed. It was emphasized in all condi- 
tions that the problems were to be solved in 
as few cards choices as possible, regardless 
of time (Laughlin, 1964). 


RESULTS 


Focusing strategy.—Focusing stra- 
tegy was scored according to two rules. 


Rule 1: each card choice had to obtain 
information on a new attribute. Rule 
2: the hypothesis had to be tenable. 
The total number of instances that 
satisfied both of these two rules was 
divided by the total number of card 
choices on the problem to obtain a con- 
tinuous focusing score from .00 to 
1.00. The specific types of informative 


TABLE 1 


INFORMATIVE CARD CHOICES AND TYPES OF UNTENABLE HYPOTHESES 
For ErcHr Concert RULES 


Concept Rule Informative Card Choices 


Untenable Hypotheses 


CONJ 1. Cards changing one attribute 


CONJ AB | 1. cards changing one attribute 
one attribute 
EXC 1. cards changing one attribute 


one attribute 


2. positive cards changing more than 
one attribute (“focus gamble") 


2. positive cards changing more than 


2. positive cards changing more than 


1. both values of the hypothesis on a 
previous negative card 
2. opposite of either value on a previous 
positive card 
th values on a previous negative 
card 
- opposite of either value on a previous 
positive card 
. opposite value of the asserted factor 
on a previous positive card 
2. excluded factor on a previous posi- 
tive card 
3. both the asserted factor and the ex- 
cluded factor on a previous negative 
card 
1. either value on a previous negative 


=. ne 


2. opposite of both values on a previous 
positive card 


IN DIS 1. cards changing one attribute | 
2. cards changing all but one attribute* card 
EX DIS . cards changing one attribute 


1 
2. cards changing all but one at- 
tribute? 


DIS AB 1. cards changing one attribute 


1. opposite of either value (but not the 
opposite of both values) on a previous 
negative card 

2. both values on a previous positive 
card 

3. opposite of both values on a previous 
positive card 

1. both values on a previous positive 


2. cards changing all but one attribute* card 
2. opposite of either value on a previous 
negative card 
COND 1. cards changing one attribute 1. both values on a previous negative 
2. cards changing all but one at- card 
tribute* 2. if factor and opposite of then factor 
on a previous positive card 
3. opposite of if factor on a previous 
negative card $ 
4. then factor on a previous negative 
card 
BIC 1. cards changing one attribute 1. both values on a previous negative 
2. cards changing all but one at- card 
tribute? " 2. opposite of either value (but not the 


opposite of both values) on a previous 
positive card 


a Both of these complementary cases give information on each attribute. 
b These complementary cases give redundant information on each attribute. 
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TABLE 2 
MEANS AND STANDARD DEVIATIONS FOR FOCUSING STRATEGY 


Paper No Paper 
Concept Problem Problem 
One Two Three Total One Two Three Total 
CONJ 
M 55 ER 67 1.77 44 -99 45 1.48 
SD .66 .32 .28 -50 37 .32 34 55 
CONJ AB 
M 25 .52 40 1.17 27 36 AS 1.08 
SD .30 24 29 48 .30 35 35 -69 
EXC 
M 27 -70 71 1.68 -36 .56 48 1.40 
SD 22 35 33 68 29 35 27 63 
IN DIS 
M AS AT .26 58 ll .30 21 .62 
D 14 23 23 47 .09 23 .28 32 
EX DIS 
M 13 14 18 45 kd 32 49 .92 
SD 19 48 29 48 10 27 27 48 
DIS AB 
M 42 25 31 68 21 A3 12 46 
SD 42 27 26 44 2f. 45 A3 40 
COND 
M A7 444 2 A3 18 fle 21 .56 
SD 44 17 A1 .30 A3 21 27 37 
BIC 
M .16 .20 57 93 .16 24 40 .80 
SD 41 .16 .29 36 A5 2 10 53 


card choices and untenable hypotheses 
for each of the eight rules are given in 
Table 1. The means and standard 
deviations for focusing strategy are 
given in Table 2. 

The effect of the concept rule was 
significant, F (7, 176) =17.19, p< 
.001. The order of decreasing focusing 
was CONJ, EXC, CONJ AB, BIC, 
EX DIS, IN DIS, DIS AB, COND. 
The effects of memory and the rule 
by memory interaction were not signi- 
ficant, F (1, 176) «1; F (7, 176) — 
1.39, respectively. Means and levels 
of significance for Duncan multiple- 
range comparisons between the eight 
rules summing over the memory con- 
ditions are given in Table 4. The ef- 
fect of problems was significant, F (2, 
352) = 18.51, p < .001, representing 
more focusing on Problem 2 than 1 


(p € .001), with no difference between 
Problems 2 and 3. 

Number of card choices to solution. 
—The means and standard deviations 
for number of card choices to solution 
are given in Table 3. The effect of the 
concept rule was significant, F (7, 176) 
= 5.97, p < 001. The order of in- 
creasing card choices was CONJ, 
CONJ AB, EXC, EX DIS, BIC, DIS 
AB, IN DIS, and COND. The ef- 
fect of memory was not significant, F 
(1, 176) « 1. The Rule x Memory 
interaction was marginally significant, 
F (7, 176) 2219, p< .05. Means 
and results of Duncan comparisons 
summing over memory conditions are 
given in Table 4. ^ The effect of 
problems was significant, F (2, 352) 
— 2748, p « .001, representing fewer 
card choices on Problem 2 than ! 
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(p < .001), with no difference between 
Problems 2 and 3. 


Discussion 


The obtained order of focusing strategy 
was exactly as predicted, and indicates 
that the use of focusing strategy decreases 
as the instance selection, inference, and 
memory requirements necessary to utilize 
the strategy increase. The multiple-com- 
parisons indicate that the eight rules 
formed two clusters, with more focusing 
for CONJ, CONJ AB, EXC, and BIC 
than for EX DIS, IN DIS, DIS AB, 
and COND. In particular, the three rules 
which require complementary focusing 
(IN DIS, DIS AB, and COND) resulted 
in less use of the strategy than the rules 
which require only some modifications of 
simple focusing. 

The criteria for focusing were that the 
card choice gain information on a new 


attribute and be followed by a tenable 
hypothesis, ie. one consistent with all 
previous information. The scoring of 
untenable hypotheses therefore goes be- 
yond the information on the current in- 
stance to consider all previous instances, 
in a way analogous to the Cahill and 
Hovland (1960) distinction between per- 
ceptual inference and memory errors. It 
may be objected that this scoring method 
is more a measure of problem-solving 
efficiency than of focusing strategy as de- 
scribed by Bruner et al. (1956). This is 
true; the present criteria conceive of 
focusing as a continuous dimension rather 
than a strict dichotomy with scanning as 
proposed by Bruner et al. The crucial 
quantifiable aspects of focusing thus con- 
ceived are the degree to which the person 
gains information on attributes by his 
card selections, and the degree to which 
his present hypothesis is consistent with 
previous information. It seems to be 


TABLE 3 
MEANS AND STANDARD DEVIATIONS FOR NUMBER OF Carp CHOICES 
TO SOLUTION 
Paper No Paper 
Concept Problem Problem 
One Two Three Total One Two Three Total 

CONJ 

M 4.58 4.17 3.75 12.50 1:25 7.58 5.67 20.50 

SD 2.63 2.79 2.05 247 6.43 10.80 3.85 11.03 
CONJ AB 

M 8.50 5.58 3.75 17.83 8.67 4.92 7.00 20.59 

SD 5.14 3.41 2.43 6.06 7.33 5.92 5.87 10.63 
EXC 

M 7.83 3.25 4.67 15.75 9.33 6.92 7.08 23.33 

SD 4,42 1.79 2.45 5.02 6.59 5.48 3.26 10.68 
IN DIS 

M 15.92 5.42 7.25 28.59 14.67 7.15 7.42 29.84 

SD 9.91 3.01 5.31 13.78 7.16 5.80 7.66 12.64 
EX DIS 

M 10.42 6.25 7.25 23.92 9.33 5.58 5.25 20.16 

SD 8.93 4.78 7.38 18.50 3.78 3.21 3.27 7.35 
DIS AB 

M 11.50 6.25 6.50 24.25 8.50 8.75 8.75 26.00 

SD 6.76 2.28 3.33 8.82 5.56 7.63 3.00 11.77 
COND 

M 17.17 11.25 14.25 42.67 11.58 9.17 6.83 27.58 

SD 10.84 6.99 9.85 21.07 7145 5.06 3.37 10.26 
BIC | 

M 8.92 9.00 4.25 2247 11.25 6.00 6.33 23.58 

SD 5.48 7.28 2.20 10.47 6.88 3.56 3.89 10.71 
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TABLE 4 


MEANS AND LEVELS OF SIGNIFICANCE or DuNCAN MULTIPLE-RANGE 
COMPARISONS BETWEEN EIGHT CONCEPT RULES FOR FOCUSING 
STRATEGY AND NUMBER OF CARD CHOICES TO SOLUTION 


PATRICK R. LAUGHLIN 


Focusing Strategy 
COND DIS AB | IN DIS | EX DIS BIC CONJ AB EXC CONJ 
AT 49 .20 3 429 „37 E En 
COND ns ns ns .05 .001 .001 .001 
DIS AB ns ns ns .05 .001 .001 
IN DIS ns ns .05 .001 001 
EX DIS ns .05 .001 .001 
BIC ns .001 .001 
CONJ AB .05 .001 
EXC ns 
Number of Card Choices to Solution 
CONJ |CONJ AB EXC EX DIS BIC DIS AB IN DIS COND 
5.50 6.40 6.51 7.85 7.63 8,38 9.74 11,71 
CONJ ns ns ns ns .05 .001 .001 
CONJ AB ns ns ns ns .01 .001 
EXC ns ns ns .05 .001 
EX DIS ns ns ns .001 
BIC ns ns .001 
DIS AB ns OL 
IN DIS ns 


more profitable to thus consider focusing 
strategy a continuous dimension rather 
than an all-or-none strategy in opposition 
to scanning. 

The obtained order of difficulty for the 
eight concept rules as indicated by the 
number of card choices to solution ex- 
tends previous research and is largely 
congruent with it. The difficulty of dis- 
junctive concepts relative to conjunctive 
is comparable to studies by Bruner et 
al. (1956), Hunt and Hovland (1960), 
Conant and Trabasso (1964), and 
Laughlin and Jordan (1967). Further- 
more, both the finding that the three types 
of disjunctives (EX DIS, IN DIS, and 


DIS AB) did not differ from each other, ` 


and the finding that the three types of 
conjunctives (CONJ, CONJ AB, and 
EXC) did not differ from each other, 
extends the previous studies which were 
limited to IN DIS vs. CONJ. The ex- 
treme difficulty of COND is in accord 


with previous studies by Hunt and 
Kreuter (1962), Haygood and Stevenson | 
(1967), and Neisser and Weene (1962). - 
The only serious discrepancy is for 
BIC, which was relatively easy in the - 
present study and extremely difficult in - 
experiments by Haygood and Bourne - 
(1965) and Neisser and Weene (1962). 
The present study used a selection pro- - 
cedure while both Haygood and Bourne, 
and Neisser and Weene, used a reception — 
procedure. Thus, BIC may be relatively | 
easier than other concept rules under 
selection conditions than reception con- 
ditions, Secondly, the present study used 
a two-value set of attributes, while Hay- 
good and Bourne (1965) used three-value — 
attributes and Neisser and Weene used - 
-strings of any four of five possible con- — 
sonants. With two-value attributes the - 
absence of a value necessarily means the 
presence of the opposite value, while this 
relationship does not hold for three-value ' 
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or more attributes, and thus BIC may 
be relatively easier under the former 
condition. 

The nonsignificant effect of memory as 
manipulated by allowing paper may indi- 
cate that the previously demonstrated im- 
portance of memory in concept learning 
(Bourne, Goldstein, & Link, 1964; Cahill 
& Hovland, 1960; Dominowski, 1965; 
Hovland & Weiss, 1953; Hunt, 1961) is 
limited to reception procedures. Alter- 
natively, the present Ss may have failed 
to' use the paper effectively, or it may 
merely have been a distraction. 

Finally, it should be reemphasized that 
some of the theory of focusing for the 
various concept rules developed in the 
introduction is limited to two-value at- 
tributes and/or concepts with two relevant 
attributes. With two-value attributes the 
absence of a value necessarily means the 
presence of the opposite value, a relation- 
ship which does not hold for attributes 
with three or more values. Thus, many 
inferences are possible with two-value 
attributes that are not possible with three 
or more values per attribute. 
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THE A-B, B-C, A-C MEDIATION PARADIGM: A-C 
PERFORMANCE IN THE ABSENCE 
OF STUDY TRIALS* 


GEORGE E. WEAVER,? RONALD H. HOPKINS,’ ann RUDOLPH W. SCHULZ 


University of Iowa 


In a further investigation of the contribution of the test-list study 
interval (SI) to mediated facilitation in the A-B, B-C, A-C PA 
chaining (C) paradigm, a test-only procedure was introduced. Thus in 
both C and nonchaining (NC) paradigm (A-B, D-C, A-C), the third 
stage consisted only of a series of 10 multiple-choice test trials in the 
absence of study trials or knowledge of results. Test-list performance 
in Cond. C was reliably superior to that in Cond. NC, performance 
in the latter condition not differing from that expected as a result of 
random guessing. In addition, the strength of either Stage I (A-B) 
or Stage II (B-C) associations was a critical determinant of 
matching performance for corresponding A-C pairs in the C condition. 


Schulz and Lovelace (1964) have 
proposed that mediation in paired-asso- 
ciate (PA) learning consists of two 
differentiable processes: the “discov- 
ery” of mediating responses and the 
“utilization” of such responses. Data 
from two experiments led to the tenta- 
tive identification of the length of the 
study interval (SI) in the test list as 
a determinant of the discovery process 
and the length of the test-stage test 
interval (TI) as a main determiner of 
the likelihood that mediating responses 
could be utilized. 

Support for the presumed relation- 
ship between test-list TI and amount 
of facilitation in the A-B, B-C, A-C 
chaining (C) paradigm as compared 
to the A-B, D-C, A-C nonchaining 
(NC) control was provided by Schulz 
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GB2835 from the National Science Founda- 
tion. 

2 This research was conducted during the 
tenure of the first author's National Science 
Foundation Predoctoral Fellowship. He is 
now at Florida State University. 

3 The second author was an National De- 
fense Education Act Title IV Graduate Fel- 
low while conducting this research. He is 
now at Stanford University. 


and Weaver (1968). In this study the 
difference between A-C performance in 
the C and that in the NC conditions 
indicated that facilitation was reliably 
greater with a 3-sec. TI than with a 
TI of 1.5-sec. However, the factorial 
manipulation of SI (1 or 2.5 sec.) 
produced differences in overall level of 
A-C performance but did not lead to 
an interaction with paradigm, i.e., the 
amount of facilitation in the C as com- 
pared to the NC conditions was statis- 
tically equivalent under both SI dura- 
tions. Thus the hypothesized relation- 
ship between SI and discovery received 
little support. 

In an attempt to further specify the 
contribution of test-list SI to the dis- 
covery and subsequent utilization of 
mediating responses under conditions 
paralleling those used by Schulz and 
Weaver (1968), a test-only procedure 
(SI = 0) was employed with the third 
list (A-C) in the C and NC paradigms 
of the present study. Learning of the 
first two lists proceeded in the usual 
manner while the third stage consisted 
of a series of multiple-choice test trials 
over previously unpaired A and C 
terms. If discovery is a necessary pre- 
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condition for utilization of mediating 
responses, and if discovery is depend- 
ent on SI, no mediated facilitation of 
third-stage matching performance in 
the C paradigm would be expected 
under these conditions. The foregoing 
is simply a statement of the null hypo- 
thesis with respect to the postulated 
role of the discovery process in rela- 
tion to SI under the present conditions. 
Evidence for mediated facilitation has, 
of course, been obtained previously 
with test-only procedures (e.g. Clif- 
ton, 1966; James & Hakes, 1965; 
Peterson & Blattner, 1963). 


METHOD 


Design and subjects —The A-B, B-C, A-C 
chaining (C) and the A-B, D-C, A-C non- 
chaining (NC) paradigms defined the treat- 
ments of the present experiment. A total of 
28 University of Iowa undergraduates, seven 
males and seven females in each condition, 
served as Ss. Participation in research was 
a course requirement and all 5s were naive 
with respect to verbal learning experiments. 

Materials—The learning materials were 
similar to those employed by Schulz and 
Weaver (1968). The A terms in the eight- 
pair PA lists consisted of low m paralogs 
taken from Taylor's (1960) list. Low m 
Archer (1960) CVC trigrams served as C 
terms and common nouns with Thorndike- 
Lorge írequencies of AA were used as B 
and D terms. These nouns had a minimum 
of associative overlap in the Russell and 
Jenkins (1954) norms. Intra- and interlist 
similarity was kept at a minimum in both 
paradigms. 

Procedure.—The procedures employed 
during the first two learning stages have been 
described in detail by Schulz, Weaver, and 
Ginsberg (1965). Briefly the lists in both 
the first and second stages were learned by 
the study-test method to a criterion of two 
successive correct trials. On Stage-I (List 
A-B) test trials Ss were required to recall 
the correct response term. In Stage II (List 
B-C or D-C), however, the test trials took 
the form of a four-alternative multiple- 
choice test for each pair with the correct re- 
sponse plus three other response terms from 
a given list as alternatives. The selection 
of respohse alternatives was done randomly 
for each item on each trial with the restric- 
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tion that each alternative appear approxi- 
mately equally often in each position across 
blocks of four trials. The test-trial al- 
ternatives had the numbers 1 to 4 printed 
below them and Ss responded with the num- 
ber of the alternative they had chosen. A 
2-sec. presentation rate per item was em- 
ployed for both study and test trials during 
Stage I while the presentation rate was 2 
sec. per item on study trials and 3 sec. on 
test trials in Stage II. 

Stage III was the same for both conditions 
and consisted of a series of 10 multiple- 
choice A-C trials similar in form to those 
presented in Stage II. No study trials were 
presented at any time during the test stage. 
The Ss in both conditions were told to select 
the alternative that they felt should be 
paired with the stimulus item and were en- 
couraged to guess if not certain. The 
presentation rate was 3 sec. per item with 
a 3-sec. intertrial interval. 

Four different random orders of presenta- 
tion were constructed for each stage of the 
experiment and all materials were presented 
on a Stowe memory drum. 


RESULTS AND DISCUSSION 


The mean number of trials to reach 
criterion on Stage I was 11.29 for 
Cond. C and 10.93 for Cond. NC with 
SDs of 5.52 and 4.91, respectively. 
The difference between these means 
did not approach significance, t (26) 
« 1; hence, Ss serving under the two 
conditions may be regarded as compar- 
able in ability. 

Analysis of Stage-II performance 
indicated that the B-C list (Cond. C) 
was somewhat more difficult to learn 
than the D-C list (Cond. NC) as the 
mean numbers of trials to criterion 
were 7.86 and 5.14, respectively. The 
corresponding SDs were 4.15 and 3.21. 
The obtained difference in list difficulty 
is presumed to be the result of greater 
interference from B-A backward asso- 
ciations during B-C learning than dur- 
ing D-C learning. This finding agrees 
with results obtained previously by 
Schulz et al. (1965) and by Mandler 
and Earhard (1964) although the pres- 
ent difference in total trials to reach 
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Stage III as a function of trials and 
paradigms. 


criterion was not quite significant at 
the .05 level, ¢ (26) = 1.94. 

The mean number of correct re- 
sponses in Stage III is presented in 
Fig. 1 as a function of trials for both 
the C and NC conditions.  Perfor- 
mance in Cond. C may be seen to be 
consistently superior to that in Cond. 
NC and the difference in mean total 
number of correct responses for the 
two conditions resulted in ¢ significant 
at the .001 level when referred to the 
Behrens-Fisher distribution to adjust 
for heterogeneity of variance, t (14) = 
6.08. The obtained difference in vari- 
ances for the two conditions can most 
reasonably be attributed to the fact 
that the range of scores for total num- 
ber of correct responses was very small 
in Cond. NC (9-26) in comparison 
with that in Cond. C (15-66). Fur- 
thermore, there was no evidence that 
the statistical superiority of perfor- 
mance in Cond. C was due to extremely 
high performance on the part of only 
a few Ss in this condition. In fact, 
with only two exceptions, there was no 
overlap of scores for Ss in the C and 
NC conditions. Moreover, not only 
was performance in Cond. C superior, 
overall, to that in Cond. NC, but this 
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superiority was also reliably present on 
the very first test trial, ¢ (26) = 2.74, 
?«.02. 

It must be concluded in light of the 
foregoing that the presence of a formal 
SI is not a necessary precondition 
determining S’s ability to discover and 
utilize mediating associations with the 
materials and procedures used here and 
in previous studies. Also, the accord 
between the present results and those 
obtained under test-only conditions 
with other procedures and paradigms, 
a series of studies by Peterson and 
associates (e.g., Peterson & Blattner, 
1963), James and Hakes (1965), and 
Clifton (1966) suggest that this con- 
clusion has considerable generality. 

If the present results are considered 
in relation to those obtained previously 
by Schulz et al. (1965) with a 2-sec. 
SI and by Schulz and Weaver (1968) 
with 1- and 2.5-sec SIs, a rather 
complete picture of the role of the 
length of SI in Stage III performance 
under C and NC conditions emerges. 
The relevant data are shown in Table 
l. Though comparable in essential 
respects, the present study differs from 
the other two in that 8- rather than 
10-item lists were used; hence, the 
conversion to percentages in Table 1. 
It may be seen that performance was 
directly related to length of SI, as 
would be anticipated. However, the 
amounts of mediated facilitation did 


TABLE 1 


PERCENTAGES OF TOTAL NUMBER or CORRECT 
RESPONSES IN STAGE III UNDER C AND 
NC Conpitions as A FUNCTION 
oF LENGTH oF SI 


Length of SI 


Paradigm 
0 1.0 2.0 2.5 


Ras (C)| 56.07 | 60.35 | 73.95 | 77.80 
on 
(NC) 21.78 | 37.70 | 50.30 | 53.20 
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not covary similarily with SI. For 
intervals of 1, 2, and 2.5 sec., the 
difference between C and NC condi- 
tions remained virtually constant. This 
difference is somewhat larger at 0 sec. 
than at the other intervals, a result un- 
doubtedly attributable to the fact that 
the present NC Ss performed at a 
chance level since the total absence of 
an SI and informative feedback pre- 
cluded virtually all possible bases for 
learning in Stage III under NC con- 
ditions. 

While the evidence is rather defini- 
tive that the discovery process is not 
dependent upon the length of SI, it 


does not follow necessarily that the ` 


notion itself need be discarded. The 
usefulness of this concept does not de- 
pend solely on its postulated relation- 
ship with SI, rather it must be decided 
in subsequent analyses of mediational 
phenomena whether or not the under- 
standing of these phenomena can be 
facilitated by retaining this concept. 
For the present, it seems necessary to 
assume only that discovery may, de- 
pending upon the conditions, occur 
either during study trials and/or test 
trials in the PA learning situation. 

Returning to Fig. 1, another aspect 
of the data presented there which 
merits brief discussion is the apparent 
increase in A-C matching performance 
over trials in the C condition. The 
statistical analyses were in complete 
accord with the graphical evidence as 
both the main effect of trials and the 
superiority of performance on Trial 10 
as compared to Trial 1 were reliable 
sources of variation, F (9, 117) = 5.47 
and F (1, 117) = 12.78, respectively, 
p<.01 in both cases. 

For the NC condition there were 
no reliable differences associated with 
trials, F (9, 117) — 1.04, and the over- 
all performance in this condition was 
not significantly different from that 


expected as a result of random guess- 
ing £ (13)=1.70, p>.10. The 
latter finding indicates that the pairings 
of the A and C terms were not such 
as to allow for fortuitous matching in 
terms of similar letters, like sounds, 
etc. Similarly, the performance of this 
group also rules out the possibility that 
Ss’ performance under C conditions 
improved as the result of their having 
discriminated that the correct C terms 
remained paired consistently with their 
respective A terms while the “incor- 
rect” alternatives changed from trial to 
trial. It thus appears that Ss in the 
C condition not only were able to en- 
hance matching performance through 
mediation but were also able, during 
successive test trials, to discover and 
utilize or further strengthen mediating 
associations which did not lead to the 
correct response on the first test trial. 

The present experiment also pro- 
vided an opportunity to study the effect 
of A-B and B-C associative strength 
on the effective utilization of the medi- 
ating chain in the C condition. A di- 
rect relationship between strength of 
mediate associations and amount of 
mediated facilitation must be predicted 
from an associative interpretation of 
the mediation hypothesis. Schulz and 
Weaver (1968), however, were unsuc- 
cessful in an attempt to obtain differ- 
ences in the amount of mediated facili- 
tation of A-C learning with manipula- 
tion of number of presentations of A-B 
or B-C pairs. Similar null results 
were obtained by Horton and Hart- 
man (1963) and by Peterson (1964). 
However, since A-C performance in 
the present experiment is “uncon- 
founded" by study trials and the for- 
mation of “nonmediated” associations, 
conditions should be optimal for the 
investigation of the relationship of 
associative strength and mediated per- 
formance. 


+ 
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TABLE 2 


MEAN TOTAL NUMBER CORRECT on STAGE III 
AS A FUNCTION oF EITHER STAGE I OR 
Stace II STRENGTH 


Stage I Stage II 
Paradigm 
High Low High Low 
Chaining (C)| 25.57 | 19.28 | 25.21 | 19.64 
Nonchaining 
8.43 9.00 | 8.36 | 9.07 


(NC) 


Although no direct manipulation of 
degree of learning was introduced on 
either the first or second stages, the 
eight items were ranked for each S in 
terms of the number of correct re- 
sponses given to these items within the 
trials to reach criterion. Any differ- 
ences in the number of correct re- 
sponses can be taken as an index of 
the relative associative strength of the 
items. Hence, the four items above the 
median were designated as high (H) 
in associative strength and the four 
items below the median as low (L) in 
associative strength. Ties at the 
median were broken randomly. The 
number of correct responses for the 
corresponding test-list pairs were then 
tabulated and the data combined for 
all Ss in each condition. 

The mean total numbers of correct 
responses on the test list as a function 
of Stage I or Stage II associative 
strength are presented for both the C 
and NC conditions in Table 2. It 
may be seen that performance in Cond. 
C and H Stage-I or Stage-II strength 
was superior to that with L strength. 
On the other hand, strength of prior- 
list associations had little effect on test- 
list performance in Cond. NC. The 
difference in the total number of cor- 
rect responses for Cond. C was sta- 
tistically reliable both as a function of 
Stage-I (A-B) strength, F (L5) 
7.56, p<.05, and Stage-II (B-C) 
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strength, F (1, 13) = 12.35, p < .05. 
The analogous comparisons were not 
statistically significant for Cond. NC, 
F (1,13) < 1.0 in both cases. The 
acquisition curves (not shown), plotted 
as a function of paradigm and Stage-I 
or Stage-II item strength, were com- 
pletely consistent with the statistical 
findings reported above. 

The obtained relationship between 
A-C matching performance and 
strength of corresponding A-B or B-C 
associations in the chaining paradigm 
suggests that facilitation in this para- 
digm involves specific associations be- 
tween individual items; this finding is 
in complete accord with an associative 
interpretation of the mediation process. 
The failure of such a relationship to 
obtain in previous studies poses a prob- 
lem that must be resolved by further 
investigation. 
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RETENTION OF CONNECTED MEANINGFUL MATERIAL AS A 
FUNCTION OF MODES OF PRESENTATION AND RECALL* 


DAVID J. KING? 
Albion College 


This study compared the accuracy of retention for connected meaning- 
ful material (prose and poetry) under various modes of presentation 
and recall. 17 groups of Ss were run under each of the following 
conditions: (a) oral presentation-oral recall, (b) oral presentation- 
written recall, (c) visual presentation-oral recall, and (d) visual 
presentation-written recall. For prose passages, neither sex nor 
modes of presentation influenced the accuracy of retention. The 
written method of recall was superior to the oral-recall method for 
the retention of sequences of material. The retention of poems 
showed a similar superiority of written over oral recall but also 
revealed isolated but significant sex and methods of presentation effects. 


This study compares the accuracy 
of retention for connected meaningful 
material (prose and poetry) under 
various modes of presentation and 
recall. 

A history of visual and auditory 
comparisons recently prepared by Alli- 
son (1964) has shown that the major 
emphasis of previous research in this 
area has been in terms of the "best" 
method of comprehending a body of 
information. Even the most casual 
examination of the multitude of studies 
will lead one to accept the judgment 
of Henneman (1952) that: “The ex- 
perimental results have been far from 
conclusive. Whether visual or aural 
presentation proves to be superior de- 
pend largely upon the specific experi- 
mental conditions of a particular in- 
vestigation [p. 161].” 


1 The research reported in this paper was 
supported by Grant GB-2845 between the 
National Science Foundation and Albion 
College. 

Janice M. Page collected nearly all of the 
data on the recalls of the War of the Ghosts 
material. Lynette Hostetler collected all of 
the remaining data. The author is deeply 
indebted for their assistance. 

2Now at State University College at 
Oswego, Oswego, New York. 
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On the response side, oral and writ- _ 
ten recall have also been compared. 
Horowitz and Newman (1964) and 
Horowitz and Berkowitz (1964) have. 
found that oral recall “is more produc- - 
tive (wordy), more repetitious and 
elaborative, and produces more irre- 
levant and tangential material in all. 
aspects of analysis [p. 620]." How- 
ever, these differences are seen most 
clearly when one compares oral and 
written recall on a per unit time basis. 
As one obviously speaks much faster 
than one writes, the strength of these 
findings is considerably diminished. — 

The studies to be presented in this 
report give a complete comparison of 
modes of presentation, oral and visual, 
and of recall, oral and written. There 
has been one prior study (Horowitz & 
Berkowitz, 1967) somewhat similar in 


design. This study concluded that — 


Ss who listened : i 
produced a larger corpus, more ideas, fewer 
omissions of important units, more distor- 
tions, and a stylistically superior reproduc- 
tion than readers. Reproduction by speak 
ing produced a larger corpus, less diversity 
of expression, more additions, more subordi- 
nate ideas, and more signals than did repro- 
duction by writing [p. 207]. 
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These conclusions will be contrasted 
with the present findings in the discus- 
sion section. 

'This experiment examined four con- 
ditions of learning and recall: (a) oral 
presentation-oral recall, (b) oral pre- 
sentation-written recall, (c) visual pre- 
sentation-oral recall, and (d) visual 
presentation-written recall. 


METHOD 
Learning. Material 


Prose—There were three basic stories 
that served as the prose learning material. 
Each basic story had more than one version 
with respect to length. One story was 
Bartlett’s (1932) War of the Ghosts which 
had nine variations ranging from 20 to 
260 words in length (specifically, 20, 40, 60, 
80, 100, 140, 180, 220, and 260). The two 
other prose passages each had two lengths, 
80 and 180 words. The first of these, The 
Receptors of the Octopus, might be called 
scientific writing and was a brief description 
of the functioning of the olfactory and visual 
sense receptors found in the octopus. The 
second passage might be termed newspaper 
editorial writing. This passage, called The 
Myth of Addiction, was a short account of 
some of the misconceptions about drug ad- 
diction, There was, then, a total of 13 
prose passages. 

Poetry.—Two poems also served as learn- 
ing material, Both poems had versions of 
two lengths, 80 and 180 words. One poem, 
an edited version o£ Kipling's My Rival, re- 
flected a young girl's reaction toward an 
older woman whose great popularity over- 
shadows her. The other poem was an edited 
version of Whitman's Song of the Open 
Road, a poetic plea for the natural life 
uninhibited by the restrictions of society. 
There was, then, a total of four poem 
passages, 


Design and Procedure 


The basic design of this study was a 2 X 2 
X2 cube. One dimension represented mode 
of presentation (visual or oral), another 
mode of recall (written or oral), and the 
final dimension was sex. Each of the 
eight cells of the cube contained 12 Ss. 
The overall design of this experiment can be 
thought of as a cube of the above nature for 
each length of the various learning materials. 
There were, then, 17 such cubes, each con- 


taining 96 different Ss. In addition, each S 
gave two recalls. Finally, as a partial control 
over the necessity for collecting data indi- 
vidually in the oral recall treatment, for the 
100-word-length War of the Ghosts material, 
48 additional Ss were individually run in the 
written-recall treatment. 

The procedure for data collection was as 
follows: All of the written recalls except for 
the 100-word-length control group, were 
collected from groups of college students 
who acted as Ss. For a written recall after 
an oral presentation, E passed out two sheets 
of paper to each S. The E then read the 
stimulus material once and, when finished, 
asked Ss to write down as much of the 
material as they could remember as close 
to the original as possible. Unlimited time 
was allowed, and the first sheet of paper. was 
collected when everyone had finished writ- 
ing. The material was then read a second 
time, and a second recall was obtained. The 
visual-presentation procedure was the same 
except that S's were given dittoed copies of 
the stimulus material and asked to read it 
through at their normal reading rate from 
beginning to end and to “look-up” after. they 
had finished reading. When all had finished, 
a recall was obtained on the first sheet of 
paper. When this had been collected, they 
were asked to read the material again and 
to recall it once more, The control group 
was tested in the same way as shown except 
that the experimentation was done by testing 
Ss one at a time. 

The oral recalls were obtained by use of a 
Stenorette recorder, testing one S at a time. 
Each S was instructed regarding the pro- 
cedure to be followed with the recorder and 
was allowed to “hear his voice" if he 
wished (almost no one did). The S was 
then presented with the learning material 
(either orally or visually) and, following 
one exposure, was asked to recall it as close 
to the original as possible. After he finished 
with the first recall, he was again exposed to 
the material and a second recall obtained. 
All recorded recalls were transcribed by 
E who ran Ss. 


RESULTS 


Each of the recalls was then scored 
for accuracy by the following 10 mea- 
sures: number of words, number of 
letters, number of identical words (all 
words), number of identical words 
(content words only), number of idea 
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units, close procedure, number of sen- 
tences, word sequence No. 1 (5 
words in length), word sequence No. 
2 (8 words in length), and word 
sequence No. 3 (10 words in length). 
The details of these scoring procedures 
have been presented elsewhere (King, 
1960, 1966; King & Russell, 1966) and 
will not be repeated here. After pre- 
liminary tests of significance to col- 
lapse dimensions, factor analyses were 
done on the intercorrelations of the de- 
pendent variables. The results were 
consistent with prior two-factor solu- 
tions (see King, 1960, 1966; King & 
Russell, 1966). In keeping with these 
results, the analyses presented below 
will be restricted to the representatives 
of the quantitative (number of words) 
and organizational (sequence measure 
No. 2) dimension. Unless otherwise 
indicated, the .01 level of significance is 
used in all of the following analyses. 
Number of words.—There were 24 
F tests for main effect? Eight were 
significant for method of recall 3 
for method of presentation, and none 
were significant for sex. For the vari- 
ous interaction terms, there were 96 
possible F tests of significance. Only 
3 reached significance. These results 
rather clearly suggest a lack of sig- 
nificance for interaction effects and 
sex. Any effect of method of presenta- 
tion was weak and, while the effect of 
method of recall was strongest, it could 
hardly be considered as overwhelming. 
There were no systematic differences 
in locus of significance between the 
first and second recalls. The means 
will be presented following a discus- 
sion of the second dependent variable. 
Number of sequences.—Because of 
the marked heterogeneity of variance 


3 Not 26 as the 20-word-length recalls of 
the War of the Ghosts story were near per- 
fect in some conditions and hence produced 
zero or near-zero variability. 
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present in the sequence measures, a 
nonparametric analysis was performed. 
In doing this analysis, it was assumed 
that there were no sex differences. 
This was justified in terms of both a 
simple observation of the means and 
in view of the complete lack of sex 
differences found in the analysis with 
number of words. Kruskal-Wallis 
analyses on the four classifications of 
presentation recall resulted in 8 
of the 24 H values reaching signif- 
icance at the .001 level. There was a 
clear and consistent difference between 
method of recall (i.e., in all cases the 
written-recall mean was greater than 
the oral-recall mean) while no such 
consistent pattern emerged from the 
method of presentation. In contrast to 
the results obtained with the number 
of words, all but 1 of the significant 
H values was for the second recall. 
Table 1 presents the mean number of 
words and sequences for the recall of 
the War of the Ghosts material, for 
the two methods of recall and for both 
first and second recall. Table 2 pre- 
sents similar data for the recalls of 
The Receptors of the Octopus and The 
Myth of Addiction. The data for the 
100-word-length control group were 
not presented in the tables and are as 
follows: For the first recall, the mean 
number of words and sequences were 
73.1 and 2.3, respectively. For the 
second recall, in the same order as 
above, the means were 85.9 and 8.1. 
A brief mention of the influence of 
method of recall on variability should 
also be reported. For the number of 
words, for all sets of recalls, the oral- 
recall method has a greater standard 
deviation than the  written-recall 
method. Exactly the reverse is true 
for the sequence measures. That is, 
except for the 20-word-length recall 
of the War of the Ghosts, the method 
of written recall had a larger standard 
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TABLE 1 


MEAN NUMBER OF WORDS AND SEQUENCES RECALLED rom THE War of the Ghosts Material 
For Two METHODS or RECALL AND BorH First AND SECOND RECALL 


First Recall Second Recall 
Sau Oral Recall Written Recall Oral Recall Written Recall 
Words | Sequences | Words | Sequences} Words | Sequences} Words | Sequences 
20 15.8 39 18.9 5.1 18.5 6.2 20.1 8.6 
40 24.9 14 29.1 23 32.8 3.7 35.5 8.3 
60 36.6 1.0 41.3 2.3 48.3 24 51.6 6.8 
80 53.4 2 51.3 8 68.0 1.8 67.8 4.1 
100 73.8 1 73.3 24 87.3 2.8 87.6 12.5 
140 119.4 2.0 96.9 4.1 129.3 4.7 114.8 1237, 
180 137.6 1.6 122.9 3.6 169.8 6.1 149.0 14.2 
220 136.7 5 118.8 11 175.2 3:2 152.8 6.5 
260 161.6 6 132.1 sf 191.7 2:1 180.6 4.1 


deviation than the method of oral re- 
call for the sequence measure. 


Poetry Passages 


The results with the poems were far 
more complex than with the prose. 
Because of the relatively smaller num- 
ber of passages, a less detailed presen- 
tation of the tests of significance is 
warranted. There was a difference be- 


"tween the longer and shorter versions 


of the My Rival recalls. For the 


longer version, on the number of words 
dimension, there were no significant 
effects for presentation and sex but 
one out of two F tests was significant 
for method of recall. The sequence- 
measure analyses confirmed the signif- 
icance of method of recall. On the 
other hand, for the shorter version, 
both method of recall and sex were 
found to be significant. Consequently, 
it was only possible to eliminate 
the method-of-presentation classifica- 


TABLE 2 


MEAN NUMBER OF WORDS AND SEQUENCES RECALLED FOR The Receplors of the Octopus AND 
The Myth of Addiction MATERIAL FOR Two METHODS OF RECALL AND 
Bot First AND SECOND RECALL 


First Recall Second Recall 
SET Oral Recall Written Recall Oral Recall Written Recall 
Words | Sequences | Words | Sequences Words | Sequences | Words | Sequences 
The Receptors of the Octopus 
80 50.5 14 45.6 3.2 67.0 5:2 62.2 10.0 
180 86.6 a 80.2 2.2 121.2 1.8 114.7 6.3 
The Myth of Addiction 
80 33.5 9 42.8 5.5 51.3 | 5.3 64.1 19.8 
180 72.6 8 76.5 2.8 94.0 2.9 99.8 10.9 
D 
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TABLE 3 
MEAN NUMBER OF WORDS AND SEQUENCES RECALLED FOR THE My Rival MATERIAL 
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80-Word-Length Version 


First Recall 
Recall 
Words Sequences Words Sequences 
Male Female Male Female Male Female Male Female 
Oral 28.5 53.6 3.1 7.3 52.5 65.7 5.8 1726 — 
Written 36.8 50.8 9.2 11.3 60.8 7.7 20.3 24.7 
180-Word-Length Version 


Oral 
Written 


tion for the shorter version of this 
poem. For the Song of the Open 
Road, the results were even more com- 
plex. For the second recall of both 
lengths of this poem, only the method 
of recall produced significant effects 
(both words and sequences were signif- 


TABLE 4 
MEAN NUMBER OF WORDS AND SEQUENCES RECALLED FOR THE Song of the Open Road MATE 


icant for all tests). For the first 
however, there was a difference | 
tween the shorter and longer ver 
of the poem. For the shorter ver: 
no tests for sex or presentation rea 
significance while the method of 

was significant (for both words 


Recall 


80-Word-Length Version 


Oral 
Written 


Oral 
Written 
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sequences). For the longer version, 
both methods of presentation and recall 
were significant and it was possible to 
eliminate only sex as a classification. 

The mean number of words and se- 
quences recalled under the various con- 
ditions identified above for the My 
Rival poem are presented in Table 3. 
Similar data are presented for the re- 
calls of the Song of the Open Road 
poem in Table 4. 

Again, a brief comment on variabil- 
ity is appropriate. For the number of 
words, in all cases but two, the oral- 
recall results had a larger standard 
deviation than that obtained with the 
written-recall results. For the se- 
quence measure, in all cases, the writ- 
ten-recall standard deviation was lar- 
ger than the oral-recall standard devia- 
tion. 

Interaction effects were absent in 
the results of the poems as was the 
case with the prose recalls. 


Discussion 


Prose passages.—lt seems clear that 
with prose passages as the learning mate- 
rial, the method of recall is the only con- 
sistently significant effect. 

An interesting question can be exam- 
ined by the data presented in Table 1. 
This concerns the relationship between 
length of input and the resultant retention 
for the two methods of recall for both 
words and sequences. If the material for 
the number of words recalled, as a func- 
tion of the length of the learning material 
is plotted in a figure, the following de- 
scription seems to hold for both the first 
and second recalls: With original learning 
material of short length (20-60 words), 
obtaining recall by the written method, 
rather than orally, produces a larger 
amount of material. With longer mate- 
rial (140-260 words) the oral-recall 
method produces more material. There 
is a monotonic relationship between length 
of input and output (the slight drop at 
220 is undoubtedly due to some individual 


peculiarity of that specific stimulus story). 
This relationship between amount recalled 
as a function of length of original learn- 
ing material for the two methods of recall 
should not be overemphasized. The dif- 
ferences are small and when these results 
are checked against the number of words 
recalled by the two methods of recall for 
the additional prose passages (see Table 
2) conflicting results are obtained. That 
is, for The Receptors of the Octopus, for 
both lengths and recalls, the oral-recall 
method produced a larger number of 
words than did the written-recall method. 
For The Myth of Addiction, however, 
exactly the opposite results were found. 
Clearly, then, the above relationship would 
appear to have little generality. 

Tf similar figures are constructed for 
the number of sequences recalled as a 
function of the length of the original 
learning material (again, from Table 1), 
a more complicated picture emerges. In | 
general, the number of sequences de- 
creases for recalls from 20 to 80 words 
in length and then: shows a bow-shaped 
curve from 60 to 260 words. On the 
other hand, there is a much clearer dif- 
ference between the two methods of recall 
on this dimension as contrasted with the 
number of words. Under both conditions, 
at all lengths, the written-recall method 
produces a greater number of sequences 
than the oral-recall method. This differ- 
ence is especially marked with two repeti- 
tions of the learning material. Consider- 
able support for the generality of this 
finding can be found in the results for the 
two other prose passages as presented in 
Table 2. For both passages and under all 
conditions the written-recall method pro- 
duces a larger number of sequences than 
does the oral-recall method. 

The summary statistics for the control 
written group fall between the oral and 
written condition but more in the direc- 
tion of the written group. 

There was also a highly consistent in- 
fluence of method of recall on the. mag- 
nitude of the variability of the words and 
sequences recalled. In nearly all in- 
stances, for the number of words, the 
oral-recall method has a greater standard 
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deviation than the written-recall method 
with the reverse being true for the se- 
quence measures. 

Poetry passages—In general, the re- 
sults for the poems are much more diffi- 
cult to interpret than was the case with 
the prose passages. The very consistent 
prior findings about mode of presentation 
and sex differences no longer hold. Mode 
of presentation and sex differences appear 
to be sometimes significant but not for all 
lengths nor for both first and second 
recalls. For some reason, females were 
far superior to males in the recall of the 
shorter version of My Rival. In a similar 
manner, the method of written presenta- 
tion was superior to oral presentation for 
the first recall of the longer version of 
the Song of the Open Road. 

There are still, however, consistent 
findings present in the poems. In all 
cases, the method of written recall pro- 
duces more sequences than does the 
method of oral recall. This finding is 
consistent both between the two poems 
but also, of course, with the prose pas- 
sages. With respect to number of words, 
there seems to be a general consistency 
that the method of written recall is supe- 
rior to oral recall although the magnitude 
of the difference does not approach that 
of the sequence measure and, indeed, 
there is one case where this is not true. 
Finally, consistency was also present with 
respect to variability of words and se- 
quences as a function of method of recall. 
Almost exactly the same results were ob- 
tained with the poems as was found with 
the prose material. 

It seems futile to speculate about pos- 
sible reasons for the additional significant 
effects found with poetry. For example, 
it is tempting to call My Rival a female 
poem and, hence, easier for females to 
learn. Unfortunately, the longer version 
of this same poem, which did not produce 
significant sex differences, seems just as 
feminine. It must be noted that only a 
very small sample of poetic material was 
used so all findings must be considered 
as tentative. Much additional research 
will be needed to specify the nature of 
the differences between prose and poetry 
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with respect to the problem of modes of 
presentation and recall. 

Comparison with previous research.—A 
special comment must be made here in 
contrasting the present results with those 
of Horowitz and Berkowitz (1967) as 
previously quoted. First, it must be noted 
that there are many differences between 
the two studies. For example, they ran 
all their Ss individually while only one 
control group was so treated in the pres- 
ent work. They also used several de- 
pendent-variable measures that were not. 
used in this study but did not use others 
that were used here. Even granting such 
differences, it is difficult to reconcile their 
findings of significant differences between 
the methods of presentation (they found 
that oral presentation produces, among 
other things, both a greater number of 
ideas and a greater number of words). 
While the present research found isolated 
instances of such differences, it seems 
clear in these results that the two modes 
of presentation do not result in consistent 
differences on these measures. 

Horowitz and Berkowitz also found lis- 
tening more closely allied to speaking and 
reading more clearly related to writing. 
This conclusion was supported by the 
finding of two significant interactions. 
They found that those giving an oral re- 
call produced, among other things, fewer 
ideas with visual presentation. They also 
found that "Speakers who listened to the 
story displayed a far more accurate mem- 
ory than speakers who read it [Horowitz 
& Berkowitz, 1967, p. 214]." Again, this 
contrasts with the present results, With ' 
prose material, a total of 96 tests for sig- 
nificance of interaction effects was made, 
Only three (3.1%) of these tests were | 
significant at the .01 level or better. 

The differences between these two 
studies cannot be attributed to some 
unique learning material differences as 
Horowitz and Berkowitz used the War 
of the Ghosts as a stimulus. 
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TIME CHANGES IN THE STRENGTH OF EXTINGUISHED CONTEXT 
AND SPECIFIC ASSOCIATIONS? 


JOHN C. ABRA 
The University of Calgary 


2 experiments employing the A-B, C-D, and A-B, A-Br transfer para- 
digms as well as a l-list control assessed time changes in recall. 
List 1 learning was to a criterion of 1 perfect trial in Experiment H 
and to 7/10 correct on 1 trial in Experiment L. Recall and associa- 
tive matching were measured either following List 2 learning or after 
There was no indication of any gain in List 1 
strength over time. The C-D paradigm showed no initial unlearning 
but some increase in retroactive inhibition over time. 
paradigm showed appreciable unlearning which was also present to 


a 24-hr. interval. 


a lesser extent in matching. 


Several studies (Birnbaum, 1965; Kop- 
penaal, 1963) have shown no spontaneous 
recovery of an extinguished first list over 
time; these failures pose serious problems 
for contemporary interference theory (Post- 
man, 1961) which attributes much interfer- 
ence to recovered habits. However, the re- 
covery studies have generally employed the 
A-B, A-C paradigm, which, according to 
McGovern (1964), produces unlearning of 
both the Stage 1 association formed between 
the list responses and the experimental con- 
text to achieve response availability, and 
the Stage 2 specific S-R association. If 
the rates of loss over time differ for these 
two associations, any recovery occurring 
for one could be masked by a rapid decre- 
ment in strength for the other. To deter- 
mine whether this analysis is feasible, the 
present study examined time changes in the 
A-B, C-D (CD) and A-B, A-Br (ABr) 
paradigms. According to McGovern the 
former paradigm should produce only 
Stage 1 unlearning and the latter only 
Stage 2 unlearning. Houston (1967) found 
no recovery in CD but, since the requirement 
of complete interlist stimulus dissimilarity 
may not have been met in his lists, a re- 
examination of this paradigm appeared to 
be worthwhile. 

Method—All Ss learned the same first 
list, having 10 pairs of low-frequency nouns 


i' This research was conducted under a it from. 
the Thi dents Fund of the University of Calgary. 
This fund is supported by its from the National 
Research Council of Canada. . The author would 
like to thank Don Fields, Shirley Anderson, 
Norma Lipski who collected the data, and Bruce 
Dunn, who helped with the statistical analysis. 
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The A-Br E 


as stimuli and two-syllable adjectives 
responses. The second list for the A 
condition had the same materials as the 
first list but the S-R pairings were changed. 
To achieve minimal interlist stimulus similar- 
ity in CD complex perceptual figures taken. 
from Gibson (1941) were employed as 
stimuli in the second list. The List 2 i 
sponses were again two-syllable adj 
having minimal meaningful similarity 
the List 1 responses. List 1 was learned 
one perfect trial in Exp. H and to 7/1 
correct on one trial in Exp. L. In eac 
experiment there were three paradigms 
(ABr, CD, and a one-list control), Meis 
tention taken either immediately (0 condi 
dition) or after 24 hr. In each experimen 
18 undergraduate students at the Universi! 
of Calgary served in each of the six ex 
perimental conditions. 

Learning was by the  paired-associate - 
anticipation method at a 2:2-sec. rate. 
Ss in the two experimental conditions learned | 
their second lists for 15 trials, while the 
control Ss spent a corresponding length of 
time (about 10 min.) solving a pyram 
puzzle. Recall was by the MMFR techniq 
(Barnes & Underwood, 1959) ; S was sho 
each stimulus and given as much time 
he required to produce as many of the cot 
responding responses as he could. Stimuli 
were presented in the same order to all Ss — 
for recall; the correct responses were not - 
shown on the recall trial. The Li 
stimuli were presented first, then the CD 
Ss were shown the List 2 stimuli for List 2 
response recall Following the recall trial | 
S was shown all the stimuli and responses - 
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panel shows the results for Exp. 


from the lists he had learned and asked to 
match them correctly. 

Results and discussion —Analysis of the 
number of trials required to reach criterion 
on List 1 showed no differences between 
groups in Exp. H (all Fs <1) but revealed 
a significant effect of paradigm, F (2, 102) 
=5.01, p «.01, in Exp. L. Consequently, 
analysis of covariance techniques were used 
to evaluate recall for Exp. L. 

The recall results are presented in Fig. 1; 
the left panel shows the results for Exp. H, 
while the right panel shows Exp. L. Deal- 
ing first with Exp. H, a variation of the 
median test (Winer, 1962) was employed, 
since the high absolute level of recall pro- 
duced evident heterogeneity of variance. 
Comparing each of the experimental condi- 
tions separately with the control, ABr 
showed lower recall than the control x? 
(1) = 32.08, p <.01; the absence of an inter- 
action (x! « 1) showed that this relation- 
ship held at both retention intervals. Thus 
ABr showed unlearning, but no change in 
in strength over time. The CD-control 
comparison showed only a significant Para- 
digm X Interval interaction, x* (1) = 4.53, 
p.05. While there was no initial unlearn- 
ing in CD, there was an apparent increase 
in retroactive inhibition over time. 

Analysis of covariance of Exp. L recall, 
with the original learning score as the co- 
variate, followed conversion of recall scores 
to degrees. For the ABr-control compari- 
son, the only significant effect was paradigm, 
F (1, 67) = 32.05, p<.01. Thus, in agree- 
ment with Exp. H, unlearning occurred in 
ABr, with no indication of recovery over 
time. No effects were significant for the 


H and t 


igm at O and 24-hr. intervals; the left 
ent panel shows Exp. L. 


CD-control comparison. Again CD showed 
no unlearning and an indication of increasing 
retrocative inhibition over time, However, 
while the difference in loss rates is larger in 
absolute terms than in Exp. H, the Para- 
digm X Interval interaction was not sig- 
nificant, F (1, 67) —2.14. 

Thus there was no indication of List 1 
recovery for either paradigm. In fact CD 
showed increasing retroactive inhibition over 
time. However, this effect was small in 
magnitude and may not be reliable, since 
Houston (1967) found no such result. The 
major finding in the recall data was, in 
opposition to both Houston and McGovern 
(1964), a total absence of unlearning in 
CD. The reason for this discrepancy seems 
fairly clear. Both of these studies used 
nonsense syllable stimuli in both lists, with 
an inevitable repetition of letters. Stimulus 
selection of the repeated letters would effec- 
tively produce an A-B, A-C paradigm and 
appreciable unlearning. Such specific stimu- 
lus similarity was eliminated here by using 
perceptual figures in List 2; unlearning in 
CD was thereby eliminated. The empirical 
support would thus appear to be weakened 
for McGovern’s suggestion that aspects of 
the experimental context might act as the 
functional stimuli for Stage 1 learning. This 
notion apparently requires that unlearning 
be manifest in CD. 

Matching was evaluated by 'the median 
test. Since matching is thought to measure 
only Stage 2 strength, McGovern’s analysis 
predicts unlearning in ABr but not CD; 
these expectations were fulfilled. In both 
experiments individual experimental control 
comparisons showed only the main effect of 
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paradigm for the ABr-control comparison to 
be significant, x° (1) =14.37 and 246 for 
Exp. H and L, respectively, p<.01. The 
absence of unlearning in CD, even in Exp. L 
where performance was appreciably below 
ceiling, contradicts Houston's matching re- 
sults. It is possible that the specific stimu- 
lus similarity present in his lists produced 
Stage 2 unlearning. 

Comparisons by the median test showed 
a reliable improvement from recall to match- 
ing for all conditions except CD-0 in Exp. H 
and ABr-0 in Exp. L, x? (1) — 3.8 required 
for significance at the .05 level. Since 
matching presumably measures Stage 2 
strength and recall Stages 1 and 2, decre- 
ments in recall compared to matching could 
be attributed to Stage 1 loss. However, it 
is likely that matching scores are inflated by 
a guessing bias inherent in this technique; 
S might produce several actually unavailable 
associations simply by a process of elimina- 
tion. This bias would tend to exaggerate 
recall-matching differences and therefore 
estimates of Stage 1 loss. This point is 
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given credence by the general improvement 
in matching found in ABr, where Stage 1 
unlearning is unlikely. If anything, List 1 
response availability should be enhanced by 
learning the same responses in List 2. 
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REVERSAL AND NONREVERSAL SHIFTS IN CONCEPT FORMATION 
USING CONSISTENT AND INCONSISTENT RESPONSES? 


ELAINE SMITH anp HENRY LOESS 
College of Wooster 


64 Ss in 8 subgroups formed an exact replication of Harrow and 


Buchwald's (1962) experiment. 


48 additional Ss in 6 subgroups 


learned consistent responses in the Ist and 2nd card sorting tasks 
instead of inconsistent responses as required in the original study. 
Results confirmed the original findings that: (a) reversal shifts are 
learned significantly faster than nonreversal shifts; (b) consistent 
responses are learned significantly faster than inconsistent responses; 
and (c) there is a significant Shift X Response interaction. In 
addition, number concepts were learned significantly faster than posi- 
tion concepts, and the nature of the Task 1 response was found to 
interact both with the type of shift and with the type of Task 2 


response. 


Experiments have repeatedly shown that 
college students learn reversal shifts faster 
than nonreversal shifts (Kendler & Kend- 
ler, 1962). Typically, a reversal shift in- 
MTM al a oie aa 
in part by Grant GY 747 from the National Science 


Foundation Program of Unde luate Research Par- 
ticipation to the College of Wooster. 


volves two successive sorting tasks, with 
correct responses in Task 2 based on the 
same stimulus dimension as Task 1. In a 
nonreversal shift the basis of the Task 2 
response is shifted to a new stimulus dimen- 
sion. It has been hypothesized that reversal 
shifts are learned more quickly because S 
responds, via a previously learned media- 
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tional cue, to the same dimension used in 
Task 1 learning; whereas in nonreversal 
shifts such a mediational response is irrelev- 
ant and may even retard the Task 2 learn- 
ing. Accordingly results have been inter- 
preted as supporting a mediational S-R 
framework. However, Harrow and Buch- 
wald (1962) have pointed out that reversal 
groups in the above experiments, besides 
using the same dimension of stimuli as was 
previously relevant, have also been required 
to make literal reversals. That is, in Task 
2 the correct sorting response was the exact 
opposite of what had been required in Task 
1. They point out that a greater generality 
would be accorded a mediational S-R inter- 
pretation if it could be shown that reversal 
responses that are not the exact opposite of 
Task 1 responses are also learned faster 
than nonreversals. 

Harrow and Buchwald (1962) investigated 
this through the use of consistent and incon- 
sistent responses. Consistent responses 
occur when response cards must be matched 
with stimulus cards that are similar in some 
important respect (e.g, red response cards 
matched with red stimulus cards) and in- 
consistent responses occur when there is 
little or no similarity between the stimulus 
card and its appropriate response card (e.g., 
red response cards matched with nonred 
stimulus cards). In their study, 64 Ss 
learned an inconsistent response in Task 1. 
In Task 2, subgroups learned various com- 
binations of consistent-inconsistent, num- 
ber-position, and reversal-nonreversal condi- 
tions, representing a 2X2X2 analysis of 
variance design. The present experiment 
was partly a replication of this study and 
partly an attempt to determine whether the 
same general results occur when Ss learn 
consistent responses rather than inconsistent 
responses in Task 1. 

Method.—The experiment involved a four- 
category card sorting situation in which re- 
sponse cards were sorted into categories 
identified by stimulus cards. Briefly, each 
response card contained one to four dots in 
one of the four corners. This made a deck 
of 16 cards. Of 4 stimulus cards, 1 card 
contained a single black line in the upper 
left corner, a second contained two lines in 
the lower left corner, a third three lines in 
the lower right corner, and a fourth four 
lines in the upper right corner. The Ss were 
required to sort the response cards accord- 
ing to a position concept or a number con- 
cept. If a position concept, $ might be re- 


quired to make a consistent sorting response : 


response cards matched with stimulus cards 
according to the positions of lines and dots ; 
or, S might be required to make an in- 
consistent sorting response: response cards 
with dots in corner X sorted with. stimulus 
card with lines in the corner one clockwise 
position (or one counterclockwise position) 
from corner X. Similarly, if a number con- 
cept, S might be required to make a con- 
sistent or an inconsistent sorting response. 
Procedural details were followed exactly 
as described in the original article (Harrow 
& Buchwald, 1962). The Ss were tested 
individually. Response cards were presented 
randomly to each S. The criterion for both 
tasks was 12 successive correct responses. 
The S’s score consisted of the number of 
trials to learn a concept, excluding the 12 
criterion trials, The change in pattern of 
reinforcement was made without informing 
S. To avoid partial reinforcement effects 
during Task 2 for nonreversal groups, 4 
cards were eliminated from the deck of re- 
sponse cards. As a control the same re- 
sponse cards were also eliminated for the 
reversal groups. 

The basic design is illustrated in Columns 
1-5 of Table 1. Eight subgroups (Column 
7) represented an exact replication of the 
original experiment; arid six additional sub- 
groups (Column 8) provided a partial sec- 
ond replication in which Ss were required to 
make consistent rather than inconsistent re- 
sponses during Task 1. The latter require- 
ment did not allow a complete replication 
of the original experiment because two of 
the resulting Task 1-Task 2 combinations 
(Groups 3 and 4) involve a continuation of 
the Task 1 concept during Task 2 instead 
of a reversal or a nonreversal shift. A 
total of 112 undergraduate psychology 
students, 8 per subgroup, served as Ss to 
fulfill a course requirement. 

Results and discussion.—Statistical tests 
were performed exactly as in the original 
experiment. For Task 1, analyses of vari- 
ance applied separately to groups learning 
the number and the position concepts failed 
to show significant differences in mean trials 
to learn, indicating equal groups on the 
initial task. For Task 2, the mean trials to 
learn are presented in Table 1 along with 
the results of Harrow and Buchwald (1962). 
The table shows that for all groups in the 
present study (Column 7) learning occurred 
in fewer trials than in the original (Column 
6). Analyses of variance revealed that, as 
in the original, reversal groups learned sig- 
nificantly faster than nonreversal groups 
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TABLE 1 
MEAN NUMBERS OF TRIALS TO LEARN THE SECOND CONCEPT (TASK 2) AS A 
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Function OF Task 1 AND TASK 2 REQUIREMENTS 
Task 18 Task 2 Results 
Group 
Concept Concept Response? Shifte H & Bå Smith & Loess 
[05] (2) (3) (4) (5) (6) (7) (8) 
1 pos. pos. IC Rev. 1 7 39 
2 num. num. IC Rev. 14 4 15 
3 pos. pos. c Rev, 9 7 T 
4 num, num, € Rev. 6 3 = 
5 num, pos, Ic NR 155 36 60 
6 pos. num, IC NR 98 24 25 
7 num, pos. c NR 38 17 11 
8 pos. num. Ç NR 18 7 11 


a For groups represented in Columns 6 and 7, Task 1 involved inconsistent responses for all Ss, for Column 8 
(Pos. = position, num. = number.) 


Task 1 involved consistent responses. 
b IC = inconsistent; C = consistent. 
e Rev, = reversal; NR = nonreversal, 
4 Results of Harrow and Buchwald (1962). 


(p<.01), and concepts requiring consistent 
responses were learned significantly faster 
than concepts requiring inconsistent responses 
(p<.05). There was also a significant 
Shift X Concept interaction in both studies 
($ € 01). 

The nature of the interaction may be ob- 
served by inspecting Table 1. As in the 
original experiment, two-tailed ¢ tests were 
used to explore these differences, The t 
tests verified that nonreversal shifts involv- 
ing consistent responses were significantly 
easier to learn than nonreversal shifts in- 
volving inconsistent responses (p <.01), but 
there was no difference between reversal 
shifts involving consistent vs. inconsistent re- 
sponses (P 2.10). Similarly, t tests indi- 
cated that reversal shifts were easier to 
learn than nonreversal shifts when incon- 
sistent responses were involved (5 «.01), 
but not when consistent responses were in- 
volved (p > .05) even though the difference 
was in the same direction. 

The results support the general conclusions 
of Harrow and Buchwald (1962): (a) that 
groups required to make inconsistent re- 
sponses were more sensitive to the experi- 
mental conditions, and (b) that the relative 
difficulty of reversal as opposed to nonre- 
versal shifts is affected by whether the con- 
cepts used require consistent or inconsistent 
responses. However, the conclusions prop- 
erly should be limited to situations involv- 
ing inconsistent responses during Task 1 
learning. 3 i 

Results of the second replication, in which 
Ss were required to make consistent instead 
of inconsistent responses during Task 1, are 


presented in Column 8 of Table 1. The 
effect of Task 1 upon Task 2 learning was - 
evaluated by two 2X 2 X 2 analysis of vari- 
ance designs in which appropriate subgroups 
in the first replication (Column 7) 
compared with similar subgroups ii 
umn 8. In the first analysis, dealing 
with groups which learned inconsistent 
sponses in Task 2 (Groups 1, 2, 5, and 6), 
all main effects were significant (p's < 01 
However, there was also a significant Shi 
X Task 1 interaction (p<.01). Subsequent 
t tests designed to explore this interaction 
indicated that Task 2 reversal shifts were 
easier to learn than nonreversal shifts when 
Task 1 required an inconsistent response 
(p<.01), but not when Task 1 required 
a consistent response (p > .10). Additional 
£ tests showed that Task 2 reversal shifts 
were more easily learned when Task 1 in- 
volved inconsistent responses than when 
Task 1 involved consistent responses (p< 
01) but for nonreversal shifts, difficulty was 
unaffected by the type of response which 
had been required in Task 1 (p > .10). j 

In the second analysis, dealing only with 
groups learning nonreversal shifts in Task 2 | 
(Groups 5, 6, 7, and 8, in Columns 7 and 
8) the type of response required in Task 1 
did not significantly affect the rate of Task 
2 learning ($ 7.05). However, there was 
a significant Task 2 effect ( <.01) indicat- 
ing that consistent nonreversal shifts were 
easier to learn than inconsistent nonreversal 
shifts, regardless of the nature of the Task 1 
response. None of the interaction terms 
was significant. " 

The above analyses extend the findings of 
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Harrow and Buchwald (1962) and indicate 
that the relationship between reversal and 
nonreversal shifts on the one hand and con- 
sistent and inconsistent responses on the 
other hand is indeed complicated. Not only 
does the relationship vary as a function of 
the Task 2 response requirement, as ob- 
served by Harrow and Buchwald and veri- 
fied in the present study, in addition, the 
present results indicate that the effect of 
the first task response also interacts with 
the type of shift and with the second task 
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response. The results emphasize the neces-. 
sity of systematic controls when consistent 
responses are used in experiments of this 
type. 
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STIMULUS GENERALIZATION FOLLOWING FIXED INTERVAL TRAINING? 


DOROTHY S. KONICK ^^ AND 


Kent State University 


10 pigeons were trained on a fix 


presence of a 555 my key light. 


DAVID R. THOMAS 
University of Colorado 


ed interval 1-min, schedule in the 
Variable duration time out (TO) 


periods averaging 30 sec. followed each reinforcement. After sub- 
stantial training on this schedule, generalization testing was carried out 


in extinction, with 5 different wave! 


length stimuli each presented for 12 


periods of 1-min. duration (regardless of Ss’ responding), each sepa- 
rated by 30-sec. TO periods. The Ss showed a pattern of regularly, 
increasing response rate within the 1-min. interval (i.e a "scallop") 
both during reinforcement training and during generalization testing 


in extinction. A decremental grad 


ient of response rate was obtained, 


with greatest responding to the 555 mp value. During the l-min. test 
intervals, however, Ss responded in a stereotyped proportional manner, 
ie, the increasing proportion of responses in each consecutive 15-sec. 
period was the same for the different test stimuli. 


The purpose of the present investigation is 
to examine the temporal distribution or pat- 
tern of responses which develops in fixed 
interval training and to ascertain whether 
such patterning varies lawfully during stimu- 
lus generalization testing. Changing the 
value of the discriminative stimulus, of 
course, is but one means of producing dif- 
ferences in overall rate of responding; ex- 
tinction and drive level manipulations are 
two obvious alternatives. It is of interest 
to determine whether these different pro- 
cedures for manipulating response rate 
affect temporal response patterning in com- 
parable ways. 

Method—Ten experimentally naive hom- 
ing pigeons were food-deprived to 75% of 
their free-feeding weights and were main- 
sna This study vas pedermed Diversity under the di 


tion of the second author. It was supported 
Beach Grants NSF-GE-5159 and NIH-HD-00903-06. 


tained at these weights throughout the ex- 
periment. They were individually hoüsed, 
with water always available in the home 
cages. 

Two standard Grason-Stadler key-pecking 
boxes with associated automatic program- 
ming equipment were used. Visual stimuli 
were provided by Industrial Electronics En- 
gineers in-line display cells fitted with Kodak 
Wratten filters, No. 65, 74, 99, 73, and 72B, 
which provided relatively monochromatic 
stimuli with peak wavelengths of 501, 538, 
555, 576, and 606 mu, respectively. A Gra- 
son-Stadler white noise generator provided 
masking noise to both boxes at all times, 

An Esterline-Angus event recorder (Model 
AW) provided complete and continuous rec- 
ords of Ss’ performance during training 
and generalization testing; one pen recorded 
responses, another recorded stimulus onset 
and offset. The paper moved through the 


690 


recorder at a speed of .375 in/sec, providing 
a permanent visual record on which every 
key peck could be differentiated and tem- 
porally located within the stimulus interval. 

Following two days of continuous rein- 
forcement training with 40 2-sec, reinforce- 
ments each, all Ss were switched to a fixed 
interval 1-min. schedule (FI-1) for six 
40-min. daily training sessions. During all 
training up to this point the key was con- 
tinuously illuminated by a light of 555 mu. 

On the eighth training day, a time out 
(TO) was introduced by terminating the 
light on the response key following every 
reinforcement. At the completion of the TO 
period the 555 ma key light (SP) was again 
presented, which marked the beginning of 
the next l-min. interval. The TO periods 
varied randomly in duration from 3 to 60 
sec, but averaged 30 sec. across each train- 
ing session. 

The variable TO periods served two func- 
tions. First, the onset of the TO (along 
with the delivery of the reward) signified 
the end of a 1-min. fixed interval; the offset 
of the TO initiated the next interval. It 
was anticipated that the maintenance of fixed 
interval scalloping in extinction would re- 
quire (at the very least) a cue indicating 
the start and the end of the appropriate 
temporal interval, and the TO periods were 
included to serve this purpose. Secondly, 
the use of a variable TO period was de- 
signed to make the delivery of reward an 
unreliable cue to the duration of the sub- 
sequent interval. The onset of the SP re- 
liably signalled the start of a 1-min. fixed 
interval, presumably maximizing the con- 
trol developed over fixed interval behavior 
by this stimulus. 

All Ss were given 40 daily training ses- 
sions under the TO condition, each of 60 
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TABLE 1 


ANALYSIS OF VARIANCE ON TOTAL NUMBER OF 
RESPONSES PER 15-SEC. QUARTER 
TO EACH Test STIMULUS 


Source df MS F 
Between Ss 9 19,773.64 
Within Ss 190 4,078.58 
Stimuli (A) 4 76,539.25 | 9,629 
A X Ss within 36 7,959.90 
Quadratic Stimuli (B) 1 184,113.78 | 16.25** 
B X Ss within 9 11,328.79 
Quarters (C) 3 15,656.80 | 12.23** 
C X Ss within 27 1,280.00 
Linear Quarters (D) 1 46,498.76 | 12.45 
D X Ss within 9 3,735.55 
AXC 12 4,237.94 | 9.199*. 
A XC X Ss within 108 461.39 
BXD 1 29,549.01 9.91* 
B XD X Ss within 9 2,982.06 
* p < 025. 
> c 0l. 


min. duration; during each session, 40 rein- 
forcements were available, and were almost 
always earned. On the day following the 
last training session, each S was given 15 
min. of warm-up training and then tested 
for stimulus generalization in extinction. 
Five stimuli (501, 538, 555, 576, and 606 
m4) were randomized within a series, and 
12 different random series were presented 
to each S. 

Each stimulus was presented for 1 min. 
and was then terminated regardless of S's 
responding (or failure to respond). The 
TO between stimulus presentations was fixed 
at 30 sec. throughout the generalization test. 

Results and Discussion.—The visual rec- 
ords were scored by dividing every 1-min. 
interval into four 15-sec. periods and tabulat- 
ing the number of key pecks in each. If 
"scalloping" is defined as a progressive in- 
crease in response rate during the 1-min. 
interval, then all Ss clearly met this criterion 
by the end of training. A smooth, regular 
(and linear) increase in response rate from 
the first to the fourth quarter was typical 
of the performance of each S for the great 
majority of the intervals; only rarely was 
there an abrupt change from nonresponding 
to a near terminal rate of response. 

During generalization testing, the number 
of responses made in each 15-sec. quarter 
of the 1-min. interval was averaged for 
each S separately for each of the test stimuli, 
and was then averaged across all Ss to pro- 
duce the mean responses-per-quarter func- 
tions presented in Fig. 1. With the ex- 
ception of the function obtained on the last 
day of training (which is also plotted in 
this figure) these data were submitted to an 
analysis of variance and to a number of dif- 
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ferent trend analyses, the results of which 
are presented in Table 1. 

The significant Stimulus effect, F (4, 36) 
—1625, p «01, indicates that the number 
of responses given to the various test stim- 
uli differed reliably, and the result of the 
quadratic trend analysis, F (1, 9) — 1625, 
p € 01, confirms that a decremental rate 
was reliably obtained. 

The significant F ratio for the Quarters 
effect, F (3, 27) —1223, p < .01, indicates 
that rate of responding (averaged across 
stimuli) differed in the four quarters of the 
stimulus interval. A linear trend analysis 
of the responses-per-quarter function yielded : 
F (1,9) —1245, p<.01, and in fact 99% 
of the variance due to quarters was accounted 
for by the increasing linear function. 

The test for the effect of stimuli on re- 
sponse patterning is the Stimulus X Quarters 
component, which was significant, F (12, 
108) =9.19, p<.01. As can be seen in the 
figure, the absolute increase in response rate 
across the four quarters (ie. the slope of 
the responses-per-quarter function) is great- 
est under the 555 my condition and less 
to stimuli increasingly different from it. 
The more specific Quadratic X Linear trend 
analysis of this component, F (1, 9) — 9.91, 
p € 025, confirms that these slopes vary in 
a quadratic fashion. 

Figure 1 reveals that (excepting the 606 
my value, to which virtually no responding 
occurred) the responses-per-quarter func- 
tions are multiplicatively related, such that 
the increasing proportion of responses given 
in the sequential segments of the fixed inter- 
val is essentially the same for all stimuli. 
In fact, even the data from the last training 
session seems multiplicatively related to the 
generalization test data. 

In summary, following fixed interval train- 
ing, stimulus change during generalization 
testing results in a reduced overall rate of 
responding, less absolute increase in response 
rate throughout the fixed interval, but no 
change in the proportional increase. It is 
thus necessary to consider whether the same 
proportional increase in response rate within 
the fixed interval occurs when response level 
is varied by means other than stimulus ma- 
nipulation. Unfortunately little is known 
about the relationship between response 
rate and fixed interval performance. Ferster 
and Skinner (1957) reported data obtained 
from one pigeon during satiation produced 
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by 18 continuous hr. on a previously well- 
established FI 2-min. schedule, No quanti- 
tative analysis of scalloping was made, but 
it was evident that scalloping remained good 
throughout; though few responses were made 
in each interval near the end of the session, 
fhese tended to be made toward the end of 
the interval so that most reinforcements were 
received almost as soon as scheduled. 

In a study with rats and an FI-3 min. 
schedule, Weiss and Moore (1956) examined 
the responses-per-quarter functions of ani- 
mals trained and tested at two different food 
deprivation levels. Though the more de- 
prived Ss responded at a reliably higher rate, 
the slopes of the two functions did not differ. 

In the present experiment the effects of 
rate differences produced by extinction were 
examined by comparing the responses-per- 
quarter function obtained from each S in 
the first half of the generalization test (aver- 
aged across stimuli) with that obtained from 
the same Ss during the second half of the 
test. The result was two parallel, rather 
than multiplicative, curves. 

Analysis of variance of these functions in- 
dicated that though the rate of responding 
differed significantly between the two halves, 
F (1, 9) 2883, p < .025, neither Quarters 
nor the linear component of quarters inter- 
acted significantly with Halves, F (3, 27) = 
182 and F (1, 9) — 1.17, respectively. 

Thus it may be concluded that the fixed 
proportionality of the increase in response 
rate throughout the stimulus interval ob- 
served during generalization testing in this 
experiment is not a necessary consequence 
of fixed interval training. Subsequent to 
such training, some procedures for manip- 
ulating response level lead to parallel re- 
sponses-per-quarter functions, whereas 
stimulus change produces multiplicative ones. 
The investigation of the effects of stimulus 
manipulation on the more molecular attrib- 
utes of operant responding (e.g. the tem- 
poral distribution of responding within 
stimulus intervals) constitutes a challeng- 
ing new area for fruitful research, 
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EFFECT OF INSTRUCTIONAL SET ON KINESTHETIC 
FIGURAL AFTEREFFECTS 


MICHAEL WERTHEIMER anp CHARLES A. SHEETS, JR. 
University of Colorado 


A kinesthetic figural aftereffect (KFAE) was measured on 2 groups 
of 10 S's each, one group receiving instructions designed to produce a 
phenomenological set and the other receiving instructions designed to 
produce a physicalistic set. Presentation of the Müller-Lyer illusion 
under both types of instructions was used to indicate to S what was 
meant by the differing instructions. With the phenomenological in- 
structions, the measured size of KFAE was significantly greater 
(more than twice as great) as with the physicalistic instructions, but 
with the physicalistic instructions the size of KFAE was still signifi- 


cantly greater than 0. 
Müller-Lyer. 


Recent work has demonstrated that set 
(e.g., Gaze & Dodwell, 1965a; Story, 1959; 
Willems, 1967), including instructional set 
(e.g, Gaze & Dodwell, 1965b; Singer & 
Sheehan, 1965), can substantially affect the 
measured size of visual figural aftereffects. 
The present study was designed to investi- 
gate whether kinesthetic figural aftereffects 
(KFAE) (Kohler & Dinnerstein, 1947) 
are also subject to such instructional effects. 
Specifically, the question was whether phe- 
nomenological, apparent instructions (e.g., 
“Report what it feels like”) produce a 
larger measured KFAE than physicalistic, 
objective instructions (e.g, “Report what 
it is really like.”), and whether KFAE sig- 
nificantly different from zero would still 
obtain under physicalistic instructions. 

Method—Twenty students in the intro- 
ductory psychology courses at the University 
of Colorado served as Ss; the sample was 
limited to right-handed males because it was 
previously shown that both sex and handed- 
ness can affect the measured size of KFAE 
(Wertheimer, 1954). Since it is known 
that KFAE may persist for a long period 
(Malhotra, 1966; Wertheimer & Leven- 
thal, 1958), it was decided to use each S 
only under one set of instructions rather than 
using him as his own control. Accordingly, 
one randomly chosen half of the Ss (Group 
I) received the phenomenological instruc- 
tions, and the other half (Group II) received 
the physicalistic instructions. a. 

In order to demonstrate the distinction be- 
tween phenomenological and  physicalistic 
instructions to Ss, measurement of KFAE 
was preceded by presentation of the Müller- 


Similar effects were obtained with the 


Lyer arrowhead illusion under two differ 
instructions (cf. Carlson, 1966). 3 
The Müller-Lyer apparatus consisted | 
two interlocking pieces of matte-black Plexi- 
glas, a horizontally movable slide on whi 


was painted a white, 5-in. line bounded by | 


two concave arrowheads, and a tray on 
which was painted a 10-in. line bounded on 
the side away from the slide by a convex 
arrowhead. The concave arrowhead on the 


edge of the slide formed the convex arrow- | 


head for the figure on the tray. The arrow- 
head lines were each 1.4 in. long and at an 
angle of 45° from the horizontal lines; at 
of the painted lines were vs in. wide. / 
scale on the back of the slide allowed E 
to read error in inches directly. The appa- 
ratus was frontal parallel to S's line of 
sight, 6 ft. from his eyes. Knobs on the 
back of the apparatus allowed E to manipu- 
late the slide on S's instruction. ree 
All 20 Ss were first asked to indicate 
“when the line between the inward point- 
ing arrows looks just as long as the 
line between the outward pointing arrows.” 
There were four trials with these phenom- 
enologically oriented instructions; the start- 
ing position of the slide was systematically 
counterbalanced (variable too long, 
short, too short, too long). The illusion was 


then briefly explained and S was told that - 


it is possible to compensate for the illusory 
effect of the arrowheads. He was then 
asked to indicate “when the line between 
the inward pointing arrows is actually as 
long as the line between the outward point- 
ing arrows.” 
physicalistically oriented instructions with 


"n. 


Four trials followed these —— 
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the slide's starting position counterbalanced 
as before. 

For every S but one, the Müller-Lyer 
effect was smaller under the physicalistic 
instructions than under the phenomenological 
instructions, essentially replicating Carlson's 
(1966) finding. The difference between 
mean Müller-Lyer effects under the two 
kinds of instructions (phenomenological 
mean was 142 in, SD — 22; physicalistic 
mean was .40, SD —.34) was statistically 
significant, F (1, 36) —51.50, p<.001. In 
addition the mean Müller-Lyer effect under 
the physicalistic instructions was significantly 
different from zero, t (19) —5, p < .001. 
Thus the physicalistic instructions signif- 
icantly reduced the measured size of the 
illusion but did not eliminate the effect 
completely. 

Next KFAE was measured. The S was 
seated on a chair placed between two small 
felt-topped tables placed 24 in. apart. The 
table to S’s left held two transparent Plexi- 
glas bars, one 3 in. wide (the inspection 
bar) and one 13 in. wide (the standard 
test bar). The table to S’s right also held 
two bars, one 3 in. wide (the inspection 
bar) and one 4 in. wide, at the end near 
S, and increasing linearly and continuously 
in width to 4 in. at the far end (the 
variable test bar). The first three bars 
were 6 in. long and the last 30 in. long. 
The bars were all oriented so that their 
long dimension pointed straight ahead rela- 
tive to S. All of the bars were ł in. 
thick, and were held by their narrow sup- 
porting columns and felt-bottomed bases so 
that their top surfaces were 2% in. above the 
tabletop. The friction between the felt base 
and the felt tabletop was intended to keep 
the bars in place. 

There were four pre- and four postinspec- 
tion trials. The S sat with his eyes closed, 
and felt the bars with thumb on one side and 
index finger on the other. The starting posi- 
tion on the right-hand test bar was counter- 
balanced in both sets of four trials, beginning 
at the near (narrow) end, then at the far 
(wide) end, then the far, then the near. 
On each trial, S was asked to find the place 
on the right-hand variable test bar whose 
width matched the width of the standard 
left-hand test bar. After the fourth trial, 
the test bars were replaced by the inspec- 
tion bars, and S' was asked to rub back and 
forth on them continuously for 1 min. The 
inspection bars were then immediately re- 
placed by the test bars, and the four post- 
inspection matches were made, The S's 
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matches were coded in inches from the 
narrow end of the right-hand test bar. 

Before the KFAE procedure was begun, 
S was given a brief description of KFAE, 
and told that it involves an illusion much as 
the Müller-Lyer figure does. He was told 
that "the effect of rubbing this bar [the 
left-hand inspection bar] is to make the 
standard feel wider and the effect of rubbing 
this [the right-hand inspection bar] is to 
make the variable bar feel narrower... .” 
and that "the overall effect is to cause you to 
overestimate the width of the standard bar." 

The 10 Ss in the phenomenological group 
were told “report how wide the standard 
bar feels. Please do not try to compensate 
in any way for the effect of the rubbing. 
Pay attention to and report how wide: it 
feels," During the first preinspection trial, 
while E placed S’s hands on the appropriate 
bars, S was told, “find the place on this 
bar which feels just as wide as this bar 
seems to feel.” As soon as S indicated 
“here,” the position on the variable test bar 
was recorded, and S was asked to fold his 
hands in his lap. This procedure was then 
repeated for the remaining three preinspec- 
tion trials; After the inspection, while E 
placed S's hands in position for the first 
postinspection trial, S was told, “find the 
place on this bar which feels just as wide as 
this bar now seems to feel.” 

The 10 Ss in the physicalistic group were 
treated exactly the same as Ss in the phe- 
nomenological group, except that the word- 
ing of their instructions was slightly dif- 
ferent. Before the matches were made, they 
were told, “when you know the effect of 
the rubbing you can compensate for it as 
you did with the Müller-Lyer, and report 
how wide the standard bar actually is. Try 
to compensate for the effect of the rubbing 
and report how wide the bar actually is." 
During the placement of his hands for the 
first preinspection trial, S was told, “find 
the place on this bar which is as wide 
as this bar,’ and these same words were 
repeated during the placement of S’s hands 
for the postinspection trials. 

Results and discussion —The size of 
KFAE was computed as the ratio of the dif- 
ference between the mean postinspection 
match and the mean preinspection match to 
the mean preinspection match, weighted by 
a factor of 100 to give a percentage. The 
mean KFAE under the phenomenological 
instructions, 27.2%, differed significantly from 
the mean under the physicalistic instructions, 
12596; t (18) — 375, p<.001. The mean 
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TABLE 1 


PREINSPECTION AND POSTINSPECTION WIDTH (IN IN- 
CHES) MATCHED TO 1} IN, AND KINESTHETIC 
FIGURAL AFTEREFFECT (IN PERCENTAGE) 

AS A FUNCTION OF INSTRUCTIONS 


Preinspec-| Postinspec-| Figural 


tion tion Aftereffect 
Instructions 
M |SD| M|SD| M | SD 
Phenomenological | 1.62 | .16 | 2.06 | .18 | 27.2 | 11.5 
Physicalistic 1.69 |.18 | 1.90 | .19 | 124 | 9.1 


KFAE under the physicalistic instructions 
was also significantly different from zero, 
t (9) —428, p<.001. Table 1 and Fig. 1 
present these results in greater detail. 
Among other things, the figure shows a 
striking starting position effect in both 
groups in both the pre- and postinspection 
trials, with a wider match accepted when 
the starting position is wide and a narrower 
match accepted when the starting position is 
narrow. 


2.25 


e——e phenomenological set (N=10) 


x——x physicolistic set (NzIO) 


2,00) 


MEAN WIDTH MATCHED TO 1% INCHES (IN INCHES) 


1 2 3 
FRE-INSPECTION MATCHES 


inesthetic figural aftereffect as a function of instructions. 
widths seeded to e Pines 1%4-in. test width under instructions designed to induce a phenomenologi 
the dashed line indicates those matched to the standard under instructions designed to induce a ph: 
Matches were made before and after 1 min. of rubbing the inspection 


and Trials 2 and 3 descending.) 
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The findings clearly indicate that the mea- 
sured size of KFAE can be substantially 
affected by the set, suggestion, or orientation 
induced by the wording of the instructions, 
In the present study, phenomenological in- 
structions produced a mean KFAE which 
was numerically twice as large as the mean 
KFAE produced under physicalistic instruc- 
tions. However, even the physicalistic in- 
struction (to try to compensate for the illu- 
sion) was insufficient to remove the mea- 
sured aftereffect entirely, at least as pro- 
duced and measured in the present study, 

The consistent starting position effect, if 
it is interpreted as due to a rapidly-induced 
figural aftereffect produced by contact with 
the widths felt before the hand has reached 
the point of subjective equality, suggests that - 
a physicalistic, compensatory set is insuffi- 
cient to overcome such a rapid aftereffect, 
since it is present to about the same extent 
under both sets of instructions, b 

In general, the results indicate that the 


Scc d) 


4 I 2 3 4 
POST-INSPECTION MATCHES 

(The solid line indicates the ale 

istic set. 

bars; Trials 1 and 4 were ascending 
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wording of the instructions in a KFAE task 
can greatly affect the measured size of 
KFAE. Thus, in future work with KFAE, 
it would be advisable to examine the instruc- 
tions to see whether they are likely to induce 
the kind of set intended. From the present 
results, it appears that phenomenological 
instructions produce larger ^ measured 
KFAEs. Furthermore, the results suggest 
that, unless the instructions are closely com- 
parable, it is hazardous to generalize from 
one KFAE study to another. 
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PROCEEDINGS 


of the 76th Annual Convention of the — 
American Psychological Association 1968 


The 1968 PROCEEDINGS will contain the majority of the papers presented A 
at the 1968 Annual Convention and for the first time will contain summaries 
of symposia and inviled addresses. f 


The purpose of PROCEEDINGS is to facilitate the dissemination of in- - 
formation regarding current work in major fields of psychology by i 
research reports available in advance of the Convention and with a m 
publication lag. 1 


The 1968 PROCEEDINGS-— Volume III of the series which was begun i 
1966 with Volume I—will contain approximately 350 reports distributed across 
the major subject areas of psychology. The reports, together with the sum- | 1 
maries of symposia and addresses, will comprise about 700 pages. A publication . | 
date of July 1 has been set but you are urged to order your copy of the 
PROCEEDINGS now to avoid a delay in mailing. 


PRICE: $12.00 per copy ($6.00 to APA members) 


American Psychological Association 
1200 17th Street, N. W. 
Washington, D. C. 20036 
Please send me........ copy(ies) of the 1968 PROCEEDINGS 
( ) Remittance Enclosed ( ) Bill Me (when issue is mailed) 


f the 1966 and 1967 PROCEEDINGS are also available at $6.00 and E re- 
m $3. 00 and $4.50 to APA members.) 


FOR TODAY'S SOPHISTICATED TIMING AND COUNTING NEEDS 
1510 SERIES Preset Counter Time Interval Generator 


Preset counters and timers, all solid state instru- 
ments. Time intervals from 0.001 to 999.9 seconds 
can be generated with an accuracy of +0.25%. 
External events up to 9999 can be Preset Counted. 
Available in one, two or four channel models. 


3PDT relays can be controlled to remain closed for 
the duration of the count or preset time interval 
or can be set to close momentarily at the end of 
the period. Thus, a variety of external circuits can 
be controlled. Another outstanding feature includes 
the generation of a positive pulse at the end of the 
period to allow interface with other solid state 
equipment if this mode of operation is preferred 
to relay circuits. 

These instruments can be triggered for one-time 
quum or can recycle automatically. The multi- 
channel units also feature sequential or parallel 
operation of eu cuanna onein Penna oor 
nections are located on the back panel. Intern: 

time intervals can be set in increments of 0.001, MODEL 1514 
0.01 and 0.1 seconds. Counting speeds range up to 


100 Khz. APPLICATIONS 
Where else can you get such versatility and ac- e Timing 
curacy combined in one compact, rugged and eco- Interval Generotion 


nomical instrument. Write for more information or 
for our free catalog describing a full line of time 
tested devices for use in psychological 
Model 1510 (1-channel), $395.00 Pulse Delay 
Model 1512 (2-channel), $525.00 Programming 


Model 1514 '(4-channel), $650.00 X Control 


Hunter Manufacturing Company, Inc. 


P.O. Box 6566, lowa City, lowa 52240 


Preset Counting 


Consider this important new text . . . 


CASEBOOK ON ETHICAL STANDARDS FOR 
PSYCHOLOGISTS 


The application of the 19 principles comprising the Code of Ethics for all psychologists 
is presented. Each principle is illustrated by actual, but, disguised, cases involvi 
alleged violations of the goncin and action taken by the Committee on Scientific an 


Professional Ethics and Conduct. 
Four Appendixes state and amplify the additional principles applicable to specific 
areas of psychology: 
—Ethical Practices in Industrial Psychology 
—Automated Test Scoring and Interpretation Practices 
—Standards for APA Directory Listings of Private Practice 
—Guiding Principles for the Humane Care and Use of Animals 
The arrangement of the Casebook into a general section of cases and four additional 
specific sections increases its versatility as a reference book for individuals and as a text 
for instructional purposes. 
i Price: $1.00 


Order From: 
AMERICAN PSYCHOLOGICAL ASSOCIATION 


1200 Seventeenth Street, N. W., Washington, D. C. 20036 


NEW IN 1969 


DEVELOPMENTAL 
PSYCHOLOGY 


A bi-monthly journal from APA 


The subject matter will be broad in scope. “Development” vill 
be treated as a process extending throughout the life cycle of both 
humans and animals. The majority of the articles will be reports of 
original research, but theoretical discussions will also be included. 


Editor: Boyd R. McCandless 


Frequency of Publication: 


Bi-monthly; one yolume (approximately 400 pages) per calender 
year—six issues per volume. 


Subscription Price: $10.00 per volume 


Subscriptions are now being accepted and should be sent to: 


Subscription Department 
American Psychological Association 
1200 Seventeenth Street, N. W. 
Washington, D. C. 20036 


if you must . Arm decide 
between 


99.99% & 
70.00% data 


you have 


A DIGITAL-LOGIC SYSTEM 
FROM BRS 
.. IT'S AS SIMPLE AS THAT. 


The Navy used a BRS system 
along with a relay system on a 


| 4-month long test. The test was to 


determine human response to 
confinement lasting up to 10 days. 
All subject movements were 


monitored. 


The BRS system had only 5 
minutes of down time and returned 
99.99% of the data during the 

test period. 


The relay system broke down 


¢ 


recorded 


a simple choice! 


16 times, required 15 hours of 
repair, and returned only 70% 
of the data. 


ABRS Digi Bits system is a tool for 
the experimenter... by design. 


You can be a BRS data systems 
expert in a surprisingly short time. 
Send for Bits of Digi, our easy to 
read, easy to understand manual 
that takes the “mystery” out of 
digital-logic. 

If you are already familiar with our 
system, you may wish to ask for 
our current catalog. 


Applying electronics to the research laboratory 


=$: £- JFORINGER 


A DIVISION OF TECH SERV INCORPOP A 


5451 HOLLAND DRIVE, BELTSVILLE, vand 
PHONE: (301) 474-2900 
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the recorders you can depend on in your research 


EVENT RECORDER 
AVAILABLE IN 4 and 6 PEN MODELS 


from $166.00 
fob Arlington, Mass. 


Our Model C-3 and C-3, SHS, © | 
CUMULATIVE RECORDERS 


from $355.00 
fob Arlington, Mass. 


low cost 


STUDENT-LAB type 
CUMULATIVE RECORDER 


$185.00 a sal 
fob Arlington, Mass. jum 


Ralph Gerbrands Coingand EEE] 


8 Beck Road, Arlington, Mass. 02174 Tel. 617 648-6415 
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for systems of your 
own design. 


MODEL C RAT CAGE 


RAT LEVER 


aguz 


A QUALITY LEVER 
MOUNTED IN A 
RIGID CASTING 


$25.00 
fob Arlington, Mass. 


EXTERNAL PELLET TRAY 

e STAINLESS STEEL GRID BARS 

e HINGED TOP WITH LOCKING 
LATCH 

e STAINLESS STEEL DROPPINGS PAN 


$55.00 fob Arlington, Mass. 


e INSTANT DELIVERY OF PELLET 
e Capacity of 3000 Rat Food Pellets 
e Will not Crush Pellets 
e Models Available for CIBA Pellets 
e Low Operating Noise 
* Low Operating Current 
© Small in Size 
© Easy Cleaning 
€ Continuous Duty Components 


$106.00 fob Arlington, Mass. 


Ralph Gerbrands Company 


8 Beck Road Arlington, Mass. 02174 Tel. 617 648-6415 


CONTINUED FROM BACK COVER 


COMPLETELY NEW THREE-CHANNEL 
AUTOMATIC TACHISTOSCOPE 


TECHNICAL SPECIFICATIONS: 


I. Optical System—3-channel Dodge type; Viewing distance, 16 inches; Apparent view- 
ing distance (i.e., focal distance), 30 feet; Field dimensions, 1/2" horizontal, 1” verti- 
cal; Maximum clear field brightness, 20 ft. lambert (min); Mirrors: 45/45 anti-re- 
flection back coated; Mirror mounts: two axes (easily adjustable), Color Corrector— 
fixed, Diffusers: double etched mylar, Masks: Reference Mask: fixed, Matching Mask: 
adjustable (all 4 edges independent); Viewing Head: Beam splitters, adjustable inter- 
occular separation; Holders: 1 stimulus and 3 filter (35mm slide), plus 2 changers, 
Electrical Connections: 3 separate field connections, 1 interlock connection; Material: 
BN Bakelite; Overall dimensions: approx. 2 ft. X 2 ft. X 1 ft.; Weight: approx. 
s. 


Il, Control System—3-channel unit. Timers (3) solid state one-shot bootstrap reset. 
Range: 0.1 millisec. to 110 sec (6 overlapping ranges). Accuracy: +2% of reading. 
Inputs: 2 (A.C. coupled—independent). Outputs: 2 (gate &. gate inverse—D.C. 
coupled). Fan-out: 10 min. 


Lamp drives (2 per channel), active series dissipative current feed-back type. Controls 
(1 per channel): 10-turn precision. Range: approx. 1 log unit. Accuracy (feed-back 
error): 5296. Inputs: normal input (D.C. coupled), inhibit input (D.C. coupled). 


Pulse Generator—I per 3-channel control unit 30 cps rate limited go/no-go bounce 
filter for start pushbutton or external switch closure. 


General Construction—all solid state except for vacuum tube lamp switches trans- 
formers hermetically sealed, (Mil-T-27A) filter capacitors, computer grade, timing 
capacitors mylar or solid, electrolyte tantalum, connectors aircraft (type K), hardware: 
military (Jan) & industrial. 


Ill. Stimulus Changers: Capacity: 36 or 100 (TDC type tray), Speed: 1 sec/cycle max, 
Controls: Input (Direct Coupled with manual pushbutton), Input (D.C.), Inhibit input 
(Direct Coupled with manual pushbutton), Reverse input (A.C. with manual pushbut- 
ton), Display: Manually reset electromechanical frame counter. 


Shipping weight of control system and stimulus changers is approximately 200 Ibs. 
PRICE $5,400.00 F.0.B. New York City. 


SCIENTIFIC PROTOTYPE MFG. CORP. 


Programming and Control Equipment; cages; feeders; manipulanda; 
Lebe and event recorders: soundproof chambers: power kuppia; audio 
equipment; special shock sources; closed circuit T.V. equipment; 2, 3 and 
6 channel tachistoscopes; solid state programming equipment; and special devices 
and systems. 
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COMPLETELY NEW 


THREE-CHANNEL — — 
AUTOMATIC TACHISTOSCOPE 
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The Model GB Auto-Tach consists of a standard 
three-channel tachistoscope control system and a min- 
iature three-channel optical system which is designed 
dai ^ (for transillumination of 35mm transparencies. A beam - 
splitter viewing head with adjustable interocculor sep- 
aration is provided. si Saat 


Two of the three fields are provided with automatic 
stimulus changers. which operate in approxima’ e 
second and locate the slide holders with a precision of 
0.02 inches. The third field (i.e.n "Blank") is ordi- 
narily used for pre- or post-exposure adaptation or fi 
(drop-in) holder. Additional drop-in filter holders are provided 


tion and is equipped with a manual 
in each channel. 


The high brightness of the transillumination systems and the high speed, medium capacity changers — 
(36 slide holder standard—100 slide drum available which will not accept glass or precision slide ` 


mounts) moke this the most convenient commercially available tachistoscope. 
nw" i 
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